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CASE REPORT: 


AVOIDANCE CONDITIONING THERAPY OF AN INFANT 
WITH CHRONIC RUMINATIVE VOMITING * 


PETER J. LANG ann BARBARA G. MELAMED ? 


University of Wisconsin 


This paper reports the treatment of a 9-mo.-old male infant whose life was 
seriously endangered by persistent vomiting and chronic rumination. Àn aver- 
sive conditioning paradigm, employing electric shock, significantly reduced the 
frequency of this maladaptive response pattern in a few, brief treatment sessions, 
Electromyographic records were used in assessing response characteristics of the 
emesis, and in determining the shock contingencies used in therapy. Cessation 
of vomiting and rumination was accompanied by weight gains, increased 
activity level, and general responsiveness to people. 


A variety of techniques have been used in 
the treatment of persistent vomiting in in- 
fants and children. In general these therapies 
are tailored to the known or hypothesized 
causes of the disorder. Thus, the presence of 
functional disturbance in the intestinal tract 
would encourage the use of pharmacologic 
agents—“tranquilizers,” antinauseants, or 
antiemetics, If gastric, anatomical anomalies 
can be diagnosed, their surgical removal often 
proves to be the most effective treatment. 
Animal studies suggest that surgical manipu- 
lation of the central nervous system may also 
become a vehicle for emesis control (Borison, 
1959). 

When diagnosis excludes obvious, organic 
antecedents, both the etiology and treatment 
of the disorder appear less certain. However, 
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(MH-10993) from the National Institute of Mental 
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this treatment method and for his assistance during 
its application. The authors are also indebted to 
Charles Lobeck, Chairman of the Department of 
Pediatrics of University Hospitals, Madison, Wiscon- 
sin, who made facilities available for use, and to the 
assigned nursing staff without whose help and co- 
operation the present result could not have been 
accomplished. The authors also express their appre- 
ciation to Norman Greenfield and Richard Sternbach 
of the Department of Psychiatry, University of 
Wisconsin, for the loan of a polygraph, and to 
Karl G. Stoedefalke of Physical Education for pro- 
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clinical workers have described an apparently 
“psychosomatic” vomiting in children which 
is generally accompanied by a ruminative re- 
chewing of the vomitus. In reviewing the syn- 
drome, Richmond, Eddy, and Green (1958) 
adhere to the widely held psychoanalytic 
hypothesis that it results from a disruption 
in the mother-infant relationship. They sug- 
gest that the condition is brought about by 
the inability of the mother to fulfill an adult 
psychosexual role which is reflected in mari- 
tal inadequacy. She is unable to give up her 
own dependent needs and is incapable of pro- 
viding warm, comfortable, and intimate physi- 
cal care for the infant. This lack of comfort 
from without causes the infant to seek and 
recreate such gratification from within. Thus, 
in attempting to regain some satisfaction 
from the feeding situation, he regurgitates his 
food and retains it in his mouth. The recom- 
mended treatment is the interruption in the 
mother-infant relationship by hospitalization 
and the provision of a stimulating, warm en- 
vironment with a substitute mother figure. 
This method achieved success in the four 
cases reviewed. Berlin, McCullough, Lisha, 
and Szurek (1957) offer a similar psycho- 
analytic interpretation in reporting a case 
study of a 4-yr-old child hospitalized for 8 
mo. at Langley-Porter Clinic. Psychotherapy, 
involving concomitant counseling to improve 
the relationship between the parents, led to 
an alleviation of the child's vomiting reaction. 

From the point of view espoused by learn- 
ing theorists, emesis and rumination may be 
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learned habits. In point of fact, vomiting has 
been clearly demonstrated as a conditioned 
response in at least three independent studies 
(Collins & Tatum, 1925; Kleitman & Crisler, 
1927; Pavlov, 1927). This prompts the corol- 
lary hypothesis that such behavior could be 
eliminated directly by counterconditioning 
procedures. 

A number of case reports indicate that con- 
siderable success may be achieved in modify- 
ing alimentary habits in the clinic setting. 
Both Bachrach, Erwin, and Mohr (1965) 
and Meyer? successfully treated adult 
anorexic patients by making various social 
and physical reinforcers contingent on eating 
behavior or weight gain. Lang (1965) de- 
scribed the therapy of a young adult patient 
who became nauseous and vomited under so- 
cial stress. In this case, counterconditioning 
methods increased the patient’s tolerance of 
formerly aversive social situations, and thus 
markedly reduced the frequency of nausea 
and emesis. 

The only study reviewed, attempting to 
apply conditioning methods specifically in 
the treatment of ruminative vomiting was 
reported by White and Taylor (1967). Elec- 
tric shock was applied to two mentally re- 
tarded patients (23-yr.-old female, 14-yr.-old 
male) whenever throat, eye, or coughing ges- 
tures signaled rumination. They suggest that 
the shock served to distract the patient and 
he engaged in other activities rather than 
ruminating. Significant improvement occurred 
after 1 wk. of treatment, and gains were 
maintained at a 1-mo. follow-up. 

The following case report illustrates the 
efficacy of aversive conditioning in reversing 
the vomiting and rumination of a 9-mo.-old 
infant whose life was endangered by this be- 
havior. The case is of general interest because 
of the extreme youth of the patient, the 
speed of treatment, and the fact that condi- 
tioning procedures were undertaken only 
after other treatments had been either ruled 
out by diagnostic procedures, or had been 
given a reasonable trial without success. 
These data also have further implications 
for the understanding of aversive condition- 
ing procedures in clinical practice. 


3 Meyer, V. Personal communication, 1964. 


BARBARA MELAMED 


History of Problem and Family Background | 


A. T. at the age of 9 mo. was admitted to 
the University Hospital for failure to retain 
food and chronic rumination. This infant had 
undergone three prior hospitalizations for his 
persistent vomiting after eating and. failure 
to gain weight. Born in an eastern state after 
an uneventful 39-wk. pregnancy, the patient 
was bottle fed and gained steadily from a 
birth weight of 9 Ib. 4 oz. to 17 Ib. at 6 mo. of 
age. Vomiting was first noted during the fifth 
month, and increased in severity to the point 
where the patient vomited 10-15 min. after 
each meal, This activity was often associated 
with vigorous thumbsucking, placing fingers in 
his mouth, blotchiness of the face, and rumi- 
nating behavior. The mother remarked that 
the start of vomiting may have coincided with | 
her indisposition due to a broken ankle which 
forced the family to live with maternal grand- 
parents for several weeks. Some friction was 
reported between the patient’s mother and| 
her own adoptive mother concerning care of 
the child. The patient’s father is a part-time 
college student and the family received finan- 
cial assistance from the paternal grandfather, 
a successful dentist. At the time of the most 
recent hospitalization, the social worker’s re- 
port suggested that the parents were making 
a marginal marital adjustment. 

Three brief periods of hospitalization 
which included medical tests (gastrointestinal 
fluoroscopy, EEG, and neuropsychological 
testing) failed to find an organic basis fot 
this persistent regurgitation. An exploratory 
operation was performed and a cyst on the 
right kidney removed, with no discernible 
effect on his condition. The patient had m 
history of head trauma. One previous ind 
dent of persistent vomiting in a paternal 
uncle was noted to be of very short duration 
The paternal grandfather and two uncles att 
reported to suffer ulcers, 

Several treatment approaches were applied 
without success. Dietary changes (Pro-Sobet 
skim milk), the administration of antinaus* 
ants, and various mechanical maneuvers t° 
improve the feeding situation (different pos 
tions, small amounts at each feeding, burp 
ing) gave short-lived, if any, relief. As thumb 
sucking often preceded the response, !€ 
straints were tried. However, this did little 
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Fic. 1. The photograph at the left was taken during the observation period just prior to treatment. (It 
clearly illustrates the patient's debilitated condition—the lack of body fat, skin hanging in loose folds. The 
tape around the face holds tubing for the nasogastric pump. The photograph at the right was taken on 
the day of discharge from the hospital, 13 days after the first photo. The 26% increase in body weight 
already attained is easily seen in the full, more infantlike face, the rounded arms, and more substantial 


trunk.) 


to reduce the frequency of emesis. An at- 
tempt had been made to initiate intensive 
nursing care “to establish and maintain a 
one-to-one relationship and to provide the 
child with warm, friendly, and secure feel- 
ings [nurse’s chart].” This had to be aban- 
doned because it was not inhibiting the 
vomiting and some observers felt that it in- 
creased the child’s anxiety and restlessness. 

At the time the present investigators were 
called in, the infant was in critical condition, 
down to a weight of 12 lb., and being fed 
through a nasogastric pump. The attending 
Physician’s clinical notes attest that condi- 
tioning procedures were applied as a last 
attempt, “in view of the fact that therapy 
until now has been unsuccessful and the life 
of the child is threatened by continuation of 
this behavior.” 


Therapeutic Procedure and Results 


ü The patient was given a private room, con- 
uous nursing care, and assigned a special 


graduate nurse to assist in the conditioning 
procedures. The authors closely observed the 
infant for 2 days during and after normal 
feeding periods. He reliably regurgitated most 
of his food intake within 10 min. of each 
feeding and continued to bring up small 
amounts throughout the day. Observers on 
the hospital staff suggested that vomiting was 
originally induced by thumb pressure at the 
back of the throat. However, at this stage 
thumb manipulations were not a necessary 
part of the vomiting sequence. He did pro- 
test, however, if hand restraint was enforced. 
His frail appearance and general unrespon- 
siveness, made him a pathetic looking child 
as seen from a photograph taken just prior 
to treatment (Figure 1). 

In an attempt to obtain a clearer picture 
of the patterning of his response, electromyo- 
graph (EMG) activity at three sites was 
monitored on a Gilson Polygraph. Responses 
leading up to and into the vomiting sequence 
reliably coincided with the nurse's concur- 
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rent description of the sequence of behavior. 
Figure 2 illustrates the typical response pat- 
tern. The uppermost channel of information 
represents muscle potentials recorded just 
under the chin, and shows the sucking be- 
havior which usually preceded vomiting; the 
lowest channel is an integrated record taken 
from the throat muscles of the neck; the 
center channel which monitors the upper 
chest region is largely EKG artifact. It can 
be noted from this segment that the onset of 
vomiting is clearly accompanied by vigorous 
throat movements indicated by rhythmic, 
high-frequency, high-amplitude activity, in 
contrast with quiescent periods and periods 
where crying predominated. 

The authors were concerned with elimi- 
nating the inappropriate vomiting, without 
causing any fundamental disturbance in the 
feeding behavior of the child. Fortunately, 
the child did not vomit during feeding, and 
the sucking and vomiting could be distin- 
guished readily on the EMG. After 2 days 
of monitoring, conditioning procedures were 
initiated. The aversive conditioning paradigm 
called for brief and repeated shock (approxi- 
mately 1 sec. long with a 1-sec. interpulse 
interval) as soon as vomiting occurred, con- 


Fic. 2. Three channels of EMG activity are pre- 
sented. (The nurse observer's comments are written 
just above the first channel. The intense muscle 
activity on this line is associated with sucking be- 
havior, recorded from electrodes on the underside 
of the chin. The second channel is just below the 
one pulse per second, timing line, and was taken from 
electrodes on the upper chest, at the base of the 
throat. The EKG dominates this channel, with some 
local muscle activity. Electrodes straddling the 
esophagus yielded the lowest line, which in this inte- 
grated record clearly shows the rhythmic pulsing of 
the vomiting response.) 


tinuing until the response was terminated. An ` 
effort was made to initiate shock at the first 
sign of reverse peristalsis, but not during the 
preceding sucking behavior. The contingency 
was determined from the nurse's observations 
of the patient and the concurrent EMG 


ning. If EMG confirmed the judgment, shock 
was delivered. Occasionally, the EMG would 
initiate this sequence, with the observational 
judgment following.* Shock was delivered by 
means of a Harvard Inductorium to elec- 
trodes placed on the calf of the patient's leg. 
A 3,000-cps tone was temporally coincident 
with each shock presentation.’ Sessions were 
chosen following feeding to insure some fre- 
quency of response. Each session lasted less 
than 1 hour. 

After two sessions shock was rarely re- 
quired. The infant would react to the shock 
by crying and cessation of vomiting. By the 
third session only one or two brief presenta- 
tions of shock were necessary to cause cessa- 
tion of any vomiting sequence. Figure 3 
illustrates the typical sequence of a condi- 
tioning trial. 

The course of therapy is indicated in Fig- 
ure 4. Few shocks were administered after 


* Particular thanks are due to Mary Kachoyeanos, 
the nurse who assisted at all the therapy sessions. 

5Shock level was first determined by applying 
the electrodes to the Es, who judged it to be quite 
painful and unpleasant. Intensity was incremented 
slightly during the first and second sessions on the 
basis of the patient's response, but was subsequently 
unchanged. The inductorium does not permit for 
exact or wholly reliable measures of current level. 
However, under the conditions of treatment de- 
scribed here, the average current was within a range 
of from .10 to 30 ma., with a cycle frequency of 
approximately 50 cps. It should be borne in mind 
that pulses from an inductorium vary widely in 
amplitude, and the authors’ instrument produced 
some spikes over 10 ma. Electrodes were first ap- 
plied to the ball of the foot and then moved to the 
calf for reasons stated in the text. The accompany- 
ing tone was generated by a Hewlett-Packard signal 
generator and administered by a small oval speaker 
in a free field. The intensity was loud but not pain- 
ful (approximately 80-95 db.), and varied con- 
siderably because of spontaneous changes in the 
infant’s position. It was employed in order to in- 
crease the density of the reinforcer and on the possi- 
bility that the therapists might employ it alone, if 
shock proved to have negative side effects. 


records. In general, the nurse would signal 
as soon as she thought an emesis was begin- 


——— 
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Fic. 3. The electrode positions are the same as 
in Figure 2. (The top line shows the point at which 
two brief shocks were administered. It may be noted 
that they follow closely on the first pulse of the 
vomiting response and that the rhythmic regurgita- 
tion observed in Figure 2 never gets underway.) 


the first day of treatment, and both the time 
spent vomiting and the average length of 
each vomiting period were abruptly reduced. 
After only two sessions it seemed that the 
infant was anticipating the unpleasant conse- 
quences of his behavior. He would begin to 
suck vigorously using his thumb, and then 
he would remove his thumb and cry loudly. 

The data graphed (Figure 4) for the sec- 
ond treatment session represent those rein- 
forcers that the authors are certain were de- 
livered. Early in this session, it became ob- 
vious that the infant was not receiving the 
majority of the administered shocks. The 
electrodes were at that. time attached to the 
plantar surface of the foot. Observation sug- 
gested that the patient had learned to curl 
his foot, either coincident with emesis or at 
the first sensation of shock, so as to lift the 
electrodes off the skin and thus avoid the 
painful stimulus. At this point, the electrodes 
were relocated on the calf, and conditioning 
proceeded normally, If the shock adminis- 
trations prior to this procedural change are 
added to those on the graph, Day 3, afternoon 
figures for emesis period, percentage of 
emesis, and shock, respectively, are 11 sec., 
21.6% and 77. 

By the sixth session the infant no longer 
vomited during the testing procedures. He 
would usually fall asleep toward the middle 
of the hour. Figure 5 indicates the sequence 
of response demonstrating the replacement of 


—— —emeiis period 


— — percent emesis 


no. of reinforcements 


DAYS 


Fic. 4. The abscissa describes successive days 
(morning and afternoon) on which observation or 
treatment was accomplished. (“Emesis period" is the 
length of any continuous period of vomiting. “Per- 
centage of emesis” is the total time spent vomiting 
divided by the time observed. Sessions varied from 
16 min. to 60 min. Treatment began on Day 3 
which included two unshocked emesis periods, In 
Session 10 tone alone was presented on one trial. It 
is of interest to note that following therapy, nursing 
staff reported that they could now block the very 
rare vomiting periods with a sharp handclap.) 


vigorous sucking with what the nursing ob- 
servers described as a “pacifier” use of the 


thumb. 

To vary the conditions under which learn- 
ing would take place, thereby providing for 
transfer of effects, the sessions were sched- 
uled at different hours of the day, and while 
the infant was being held, playing on the 
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Fic. 5. The above segment is representative of 
behavior near the end of a conditioning session. 
(Only mild sucking activity is apparent in the upper 
EMG channel. The electrode positions are the same 
as in Figure 2.) 
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floor, as well as lying in bed. Nursing staff 
reported a progressive decrease in his rumi- 
nating and vomiting behavior during the rest 
of the day and night, which paralleled the 
reduction observed across therapy sessions. 
After three sessions in which there was no 
occurrence of vomiting, the procedure was 
discontinued. Two days later there was some 
spontaneous recovery, which included some 
vigorous sucking, with a little vomiting and 
rumination. Three additional sessions were 
initiated to maintain the reduced frequency 
of the response (see Figure 4). Except for 
a brief slackening prior to these trials, there 
was a steady, monotonic increase in his 
weight as shown in Figure 6. In general, his 
activity level increased, he became more in- 
terested in his environment, enjoyed play- 
room experience, and smiled and reached out 
to be held by the nurse and other visitors. 
The mother was reintroduced the day fol- 
lowing the last conditioning trial. She took 
over some of the patient’s caretaking needs, 
including feedings. There was no marked 
change in his ruminating behavior at this 
time. The mother responded well and her 
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Fic. 6. The infant’s body weight as determined 
from the nursing notes is plotted over time, from 
well before conditioning therapy was instituted to 
the day of discharge from the hospital. (Days on 
which conditioning sessions occurred are marked by 
circles on the curve. Reinforcers were delivered only 
on days marked by open circles. The decline in body 
weight in the few days just prior to therapy was 
probably occasioned by the discontinuance of the 
nasogastric pump, in favor of normal feeding pro- 
cedures, The marked weight gain from Day 13 to 
18 is coincident with the first 6 days of therapy. 
The temporary reduction in weight increase, asso- 
ciated with a resumption of emesis, is apparent at 
Day 19. The additional conditioning trials appear 
to have acted immediately to reinstate weight gain.) 


child reciprocated her attention. He was dis- 
charged from the hospital 5 days later, after 
exhibiting almost no ruminating behavior. 
The remarkable contrast in his physical ap- 
pearance is noted in a photograph taken on 
the day of discharge (Figure 1). 


Follow-up 


Correspondence with the mother indicated 
that there was no further need for treatment. 
A. T. was eating well and gaining weight 
regularly. She reported that any thumbsuck- 
ing or rumination was easily arrested by pro- 
viding him with other forms of stimulation. 
He was beginning to seek atttention from 
other people and enjoyed the company of 
other children. One month following dis- 
charge from the hospital, he was seen for a 
physical check-up. He appeared as a healthy 
looking 21-Ib. child and, aside from a slight 
anemic condition, was found fully recovered 
by the attending physician. His local physi- 
cian reported on a visit 5 mo. later when his 
weight was 26 lb., 1 oz. “His examinations 
were negative for any problems. . . . He was 
eating quite well . . . no vomiting had re- 
curred. He was alert, active and attentive." 
A snapshot taken by the mother a few weeks 
before this examination is reproduced in Fig- 
ure 7. One year after treatment he con- 
tinues to thrive. Mother and father are both 
pleased with his development, and no further 
treatment is indicated. 


Discussion 


The rapid recovery of this 9-mo.-old male 
infant following brief aversive conditioning 
therapy, argues for the effectiveness of be- 
havioral modification in the treatment of this 
type of psychosomatic disorder. The vomiting 
and ruminating were treated as maladaptive 
behavior patterns, and electric shock was 
used to inhibit a previously well-established 
response sequence. Elimination of the re- 
Sponse was accompanied by increase in the 
infant’s responsiveness to people, as well as 
substantial weight gains, and physiological 
improvement. 

Treatment was undertaken without analy- 
Sis of the disorder's antecedents. Neverthe- 
less, the family history of the infant could 
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Fic. 7. The patient 5 mo, after treatment. 


be construed as consistent with other cases 
in the literature. One clinical worker sug- 
gested that a feeling of hostility dominated 
this infant’s home. It is true that the parents’ 
wedding was attended by difficulties and the 
subsequent birth of the patient occurred be- 
fore the parents were fully prepared for this 
responsibility. Furthermore, the mother later 
expressed anxiety about her marriage and 
complained of the problem of balancing the 
separate demands of father and child. She also 
reported her feeling that her own step-mother 
had not provided a good maternal model. 
As a consequence she felt inadequate herself 
and uncertain in the role. 

The caseworker’s notes are thus rich in 
“dynamics,” and while one is unable to estab- 
lish the relative accuracy or significance of 


these statements, it is clear that this case is 
interpretable within traditional personality 
theories. Nevertheless, therapies generated by 
this orientation were not successful in the 
present case. In deference, it should also be 
noted that “one-to-one” care was not main- 
tained as long or as consistently as in many 
cases reported in the literature, and despite 
evidence of some marital discord, no exten- 
sive counseling of the parents was under- 
taken. However, like many psychiatric treat- 
ments, the above are expensive of professional 
personnel and prolonged in duration. The 
aversive conditioning procedures used here 
achieved success in little more than a week, 
and considering the developing danger to the 
child’s life, speed was of more than usual im- 
portance. 

No evidence of “symptom substitution” 
was observed following treatment. On the 
other hand, positive social behavior increased 
coincident with the successful conditioning 
therapy. The infant became more responsive 
to adults, smiled more frequently, and seemed 
to be more interested in toys and games 
than he had been previously. An analogous 
improvement in social behavior was noticed 
in the defective adults treated by White and 
Taylor (1967). Lovass, Freitag, Gold, 
and Kassorla (1965) and Lovaas, Schaeffer, 
and Simmons (1965) have cited similar ef- 
fects following the avoidance conditioning of 
tantrum behavior in autistic children. The 
latter investigators suggest that the Es at- 
tained secondary reinforcing value because 
of their association with shock reduction. 
This provides the basis for training the chil- 
dren to exhibit affectionate patterns toward 
adults, In the present case this contingency 
was very imprecise, and it is not clear that 
the above mechanism mediated change. What 
could be called normal infant behavior in- 
creased regularly, as the emesis decreased. 
The social environment appeared simply to 
replace ruminating as the infant’s focus of 
attention. 

Aversive conditioning has been applied 
widely in adult therapy as well as with au- 
tistic children. Eysenck and Rachman (1965) 
and Feldman (1966) describe its use in treat- 
ing alcoholic and sexual disorders. However, 
one hesitates to interpret these findings in a 
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straightforward manner. Adult patients may 
submit to aversive conditioning procedures 
from a variety of motives, and cognitive fac- 
tors may blunt the impact or distort the 
meaning of aversive stimuli. The present case 
is of particular interest because these pro- 
cedures were successful in treating an appar- 
ently normal child. Furthermore, the absence 
of language and the limited cognitive de- 
velopment achieved at this age permit one 
to interpret this change as avoidance con- 
ditioning, unmitigated by the above factors. 

Finally, it should be noted that the present 
case represents a productive use of psycho- 
physiologic recording in therapy. Not only 
did the EMG provide extensive documenta- 
tion of the response, but concurrent record- 
ing was of considerable help in guiding the 
treatment effort. Specifically, these records 
confirmed in an objective manner external 
observations of mouth and throat movements 
which seemed to precede emesis. Further- 
more, they extended these observations, help- 
ing the authors to specify those aspects of 
the response which were unique to the vomit- 
ing sequence, thus assuring that shock was 
never delivered following noncontingent be- 
havior. Finally, observation of the recordings 
during therapy probably reduced the latency 
of reinforcement, particularly during the 
early trials when the validity of external signs 
seemed less certain, and provided the clearest 
indicator of the end of the response when 
shock was promptly terminated. While the 
importance of this information to the results 
obtained cannot be unequivocally established, 
it certainly increased the confidence of the 
therapists in their method, and, in turn, the 
speed and precision with which they pro- 
ceeded. The further exploration of physio- 
logical analysis in the therapeutic setting is 
encouraged. 
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INFLUENCE OF THE PARENTS AND VERBAL 
REINFORCEMENT ON THE PERFORMANCE 
OF SCHIZOPHRENIC PATIENTS * 


AMERIGO FARINA, JULES D. HOLZBERG, ann ROBERT R. DIES? 
University of Connecticut and Connecticut Valley Hospital, Middletown 


Two studies concerned with the psychological deficits displayed by schizophrenic 
patients were conducted. The first study evaluated the role of the parents and 
the second measured the effects of praise and censure. For each study, Ss were 
two groups of male schizophrenic patients, one rated as Good and the other as 
Poor by means of the Phillips Scale, and a group of nonschizophrenic psychi- 


atric patients. Large differences were 


found between patients whose parents 


displayed high and those whose parents displayed low degrees of conflict 
toward them just prior to the task performance. Verbal rewards and punish- 
ments did not appear to affect the patient’s behavior and for neither study 
were the three types of Ss differentially influenced by the manipulations. The 
main conclusions are that the phenomenon of psychological deficits is extremely 


complex and that schizophrenic patient 


ts are less readily influenced by affective 


experiences than prior researchers have suggested. 


A very large number of studies demonstrate 
that schizophrenic patients display psycho- 
logical deficits (Buss & Lang, 1965; Lang & 
Buss, 1965), defined as inadequacies in the 
behavior of such patients in comparison to 
control Ss or in relation to expectations based 
upon education and intelligence. Usually 
measured in terms of performance on labora- 
tory tasks, these deficits are reported with 
such regularity that they would appear to be 
one of the more demonstrable and stable 
characteristics of people labeled as schizo- 
phrenic. The researchers who have done this 
work have been concerned with a variety of 
questions such as whether schizophrenic pa- 
tients are deficient because they are not 
responsive to external stimulation (Slechta, 
Gwynn, & Peoples, 1963) or because they 
have limited ability to think abstractly (Mc- 
Gaughran & Moran, 1956). The two studies 
reported in the present paper are related to 
two of the many existing questions about this 
phenomenon: (4) what role do other people, 
and especially the patients’ parents, play in 
these deficits, and (b) in what way are the 
deficits affected by censure and praise? Rele- 
vant research will be reviewed briefly. 


1 This study was financed by a research grant from 
the National Institute of Mental Health, United 
States Public Health Service (M-6167). 

2Now at Wesleyan University. 

3 Now at the University of Kentucky. 


When the tasks assigned have been varied 
with respect to presence or absence of human 
contents, the presence of such contents has 
been found to significantly increase the 
deficits displayed by schizophrenic patients 
(Brodsky, 1961; Davis & Harrington, 1957; 
Whiteman, 1954). As these findings imply, 
the physical presence of other persons also 
seems to cause schizophrenic patients to per- 
form tasks less adequately than when people 
are absent (Felice, 1961; Fischer, 1963; 
Schooler & Spohn, 1960). What is especially 
pertinent for this investigation, though, is 
that a number of studies have indicated that 
the parents play a particularly important role 
in the elicitation of behavioral deficits. Thus, 
Dunham (1959), Dunn (1954), Kreinik 
(1959), and Turbiner (1961) have all demon- 
strated that some of the deficits appear to be 
specifically associated with cues of parents or 
parents in certain kinds of activities. The 
most direct study of this association between 
parental presence and defective behavior, 
however, is that carried out by Farina and 
Dunham (1963). These authors induced a 
group of schizophrenic patients to engage in, 
a rather conflictful interaction with thei 
parents and immediately after this had them 
perform a task in the presence of cues of 
the parents. When their performance was 
compared to that of a group of schizophrenic 


patients who had not seen their parents for 
at least a week, it was found to be signifi- 
cantly poorer. One of the two studies reported 
here is quite similar to the Farina and Dun- 
ham (1963) study but employs a number 
_ of methodological improvements including a 
control group of nonschizophrenic patients. 

The second study is closely related to the 
first. Other groups of schizophrenic and con- 
trol Ss were administered a preliminary task, 
after which their performance was either 
praised, censured, or not evaluated. They were 
subsequently assigned the same tasks as were 
given Ss of the first study. Thus, this investi- 
gation was designed to facilitate the interpre- 
tation of the results of the first study. If the 
expected deficits were manifested by patients 
following interaction with the parents, the 
second study might provide meaningful cues 
about the specific facets of the interaction 
which led to such deficits. For example, 
assuming the results of the second study indi- 
cated that censure led to inadequate perform- 
ance, the findings would argue that it was 
parental disapproval and criticism which were 
responsible for the patient’s poor performance 
in the first study. 

The second investigation was also designed 
to provide data which might potentially bring 
some clarity to the confusing array of results 
regarding the effect of praise and censure on 
performance of schizophrenic patients. This 
literature, which has been reviewed recently 
(Buss & Lang, 1965), contains reports of 
censure leading to decrements (Garmezy, 
1952; Webb, 1955) as well as to improve- 
ment in adequacy of task performance (At- 
kinson & Robinson, 1961; Cavanaugh, Cohen, 
& Lang, 1960). However, the nature of the 
tasks used in these contradictory studies has 
varied considerably. Thus, Garmezy (1952) 
had patients discriminate tones varying in 
pitch while Atkinson and Robinson (1961) 
required their Ss to learn lists of paired ad- 
jectives. Since it has been shown that for 
normal Ss the effectiveness of verbally ad- 
ministered reward and punishment changes as 
a function of the complexity of the task 
(Meyer & Offenbach, 1962), it may be that 
some of the discrepant results are due to 
differences in tasks used. It is partly for this 
reason that in the present studies two tasks 
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were used, one being an extremely simple 
reaction-time (RT) task and the other a 
complex anagrams task requiring the con- 
struction of new words out of a given word. 

There is at least one more variable which 
might conceivably account for some of the 
inconsistent results. Since the category of pa- 
tients diagnosed as schizophrenic is recognized 
to be an extremely heterogeneous one, Ss of 
previous studies may have differed with re- 
gard to important personal characteristics. 
This variability among patients has led to the 
identification of a number of dimensions, such 
as process-reactive and good-poor premorbid- 
ity and to the discovery that patients differ- 
ing with respect to these dimensions behave in 
distinctly different ways (Herron, 1962). For 
this reason, the two studies used two groups 
of schizophrenic patients, one group having 
had a good premorbid history (Goods) and 
the second group a history of inadequate 
interpersonal relationships (Poors) as deter- 
mined by a Phillips Scale rating (Phillips, 
1953). This dimension was used since it is 
objectively assessed and it is highly corre- 
lated with other dimensions for categorizing 
schizophrenic patients (Garfield & Sundland, 
1966). The finding that Goods and Poors 
respond differently to praise and censure 
would imply that some of the existing con- 
fusion regarding psychological deficits may be 
due to differences in patient characteristics in 
these contradictory studies. 


Stupy 1 
Method 


The Ss of this investigation were three groups of 
hospitalized male psychiatric patients and their bio- 
logical parents. These Ss are described in detail in 
earlier reports (Farina & Holzberg, 1967; Farina & 
Holzberg, 1968). AIl Ss had to meet a series of 
rather stringent criteria, and there were no significant 
group differences for age, education, family size, or 
length of time the parents had been married. One 
group (124) consisted of nonschizophrenic psy- 
chiatric patients, while all patients in the remaining 
two groups were diagnosed as schizophrenic, one of 
these groups (z—26) being composed of patients 
rated as Goods while patients in the last group 
(1— 24) were classed as Poors. 

For each group, half the patients performed the 
tasks before interacting with their parents (TBI 
condition) While the remaining half performed these 
immediately after the interaction (TAI condition). 
The interaction procedure is fully described in the 
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earlier reports. In brief, the mother, son, and father 
were individually asked how they believed six 
hypothetical problem situations involving parents and 
children should be resolved. The three members 
were then brought together and asked to indicate 
how they would resolve the same problems as a 
family. Both the individual and joint sessions were 
tape-recorded, and the recordings were subsequently 
used to obtain measures of conflictual behavior. 

The first task given the Ss, the anagrams task, 
required them to make as many new words as pos- 
sible out of a given word within a 2-min. time 
period. They were given an initial practice word, 
“table,” and, if necessary, were helped to make new 
words such as "late" and “ale.” Following the 
practice word, they were presented with the words 
“chair,” “bossy,” “scolding,” “mother,” and “father,” 
in the order listed. A record was made of all new 
words as well as the time in the 2-min. time period 
when each was produced. The second task measured 
simple RT and required visual motor skills in con- 
trast to the thinking necessary to perform the 
anagrams task. After a standard practice period, 
each patient was given seven trials at each prepara- 
tory interval of 1, 2, 4, 8, and 16 sec., all seven 
trials at the 1-sec. interval being given first and 
then followed by the other intervals in the order 
listed. 


Results 


The results obtained are summarized in 
Table 1. The patients’ productions were 
classed as words if the word appeared in Funk 
& Wagnall's Standard College Dictionary, 
1963 Edition, otherwise they were scored as 
errors. As may be seen, the differences among 
groups and between conditions are quite 
small, not particularly consistent, and seem- 
ingly uninterpretable. Other measures not 
presented in Table 1, such as the number of 
words produced for each specific word, num- 
ber of three-letter words, etc., show a pattern 
quite similar to that of Table 1. These re- 
sults were analyzed by means of three-factor 
(groups, condition, and word or preparatory 
interval) analyses of variance as described 
by Winer (1962, pp. 337—349). There were 
no groups of condition main effects nor were 
any of the condition interactions with the 
other variables significant. Only one signifi- 
cant interaction involving the groups factor 
was found, and it was a Groups X Prepara- 
tory Interval interaction for RT. The Poors 
were relatively slower than the Goods and 
Controls on the 2- and 4-sec. preparatory 
intervals. Additional analyses revealed that 
in doing the anagrams task the Controls ar- 


TABLE 1 


PERFORMANCE ON ANAGRAMS AND REACTION-TIME 
Tasks BY EACH GROUP OF SUBJECTS AS A FUNCTION 
OF CONDITION 


(Srupv 1) 
Anagrams task Keane 
Group Mean no. | Totalno, | Mean time 
words pro- | errors made per trial in 
duced to each | to all words | 1/100ths of a 
|word presented| presented second 
Controls 
TBI 9.2 82 20.2 
TAP 8.3 7.8 18.9 
Goods 
TBI 7.1 3.5 20.0 
TAI 7.5 6.3 21.1 
Poors 
TBI 8.4 5.2 22.7 
TAI 7.6 5:3 21.1 


* Performed task before interacting with parents. 
b Performed task after interacting with parents. 


rived at the first word significantly faster 
than the Goods and Poors and that they pro- 
duced the last word significantly later in the 
2-min. period than the Goods and somewhat 
later than the Poors. Thus, the Control group 
Ss appear to have worked longer than the 
schizophrenic patients. No other differences 
were found. i 

Although these results suggest that the 
interaction with the parents did not influence 
the performance on the two tasks, it is still 
possible that some of the patients were af- 
fected by the experience. Conceivably the 
interaction was benign for some of the pa- 
tients but traumatic for others because of the 
difference in degree of conflict they encoun- 
tered in interacting with the parents. It would 
seem reasonable to expect that traumatized 
patients would manifest more deficits in their 
performance than other Ss. To test this 
hypothesis, 37 Ss in the TAI condition were 
ranked in terms of the amount of conflict 
their parents had expressed toward them 
during the interaction. Eliminating the pa- 

4The indexes used were: failure to agree (when! 
the mother and/or father disagreed with the son), 
interruptions (when the mother or father interrupted 
the son), and disagreements and aggressions (dis- 
played by either parent toward the son). Further 
details about the indexes are available in the earlier 
publications. 
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tient who was at the median, high, and low 
parental conflict groups of 18 Ss each were 
formed. Analyses of variance revealed that 
the high-conflict group made significantly 
more errors in the anagrams task (F — 10.1, 
df = 1/170, p<.01) and produced signifi- 
cantly more words (F = 214, df = 1/170, 
p «.01) than the low-conflict groups. Since 
these two dependent measures, errors and 
words, are only slightly correlated (r — .105), 
these findings suggest that a conflictual inter- 
action with the parents causes patients to 
make more errors or to produce more words 
without changing the proportion of errors. 
There was no relationship between group 
membership and either tendency to produce 
more words or to make more errors. No 
differences of any sort were found for the 
reaction time results. 


Srupv 2 
Method 


The Ss of the second study were also three groups 
of hospitalized male psychiatric patients, and, as in 
the former study, one group was composed of non- 
schizophrenic patients, the second of Good, and the 
third of Poor premorbid schizophrenic Ss. There were 

0 patients in each group, and parents were not 
involved in this study. These Ss had to satisfy the 
same criteria as those in the first study except the 
one requiring that the parents be alive, in good 
health, and living together. These criteria are de- 
scribed in the earlier publication (Farina & Holzberg, 
1967). Groups were comparable for education, occu- 
pation, and number of hospital admissions, The 
Goods were significantly older than the Controls and 
Poors, however, while the latter two groups did 
not differ from each other. In view of the results, 
this age difference does not present a problem. They 
appear to be reasonably representative of relatively 
acute state hospital patients, being about 30 yr. of 
age on the average and of low socioeconomic status. 

All Ss meeting the criteria were taken to an experi- 
mental room and given two tests. One was the WAIS 
Vocabulary subtest and the other was a measure of 
how rapidly the S could tap a standard telegraph 
key. The latter task required S to tap as rapidly as 
possible for 30 sec. after a 5-sec. practice period, the 
number of taps being automatically counted by a 
Veeder Roots counter. These tasks served as a pre- 
text for the experimental manipulation which con- 
sisted of praising or censuring the Ss. A control 
group received no evaluation. One-third of each 
group of Ss, Goods, Poors, and nonschizophrenic, 
were randomly assigned to each of the experimental 
conditions. 

Following the two initial tasks, patients in the 


praise condition were told that they appeared to 
have done very well but that the E would score 
the records and then inform them. The patients were 
allowed to wait for 3 min. while the protocols were 
apparently scored and they were then told they had 
done much better than anticipated and that they 
were indeed very good at those tasks. They were 
then instructed they would do some other things 
which were similar, and the RT and anagrams tasks 
were administered. These tasks were identical to 
those of the first study except that three additional 
words, “picture,” “kitchen,” and “curtain,” were in- 
cluded in the anagrams task. These were given after 
the word “chair.” In the censure condition the same 
procedure was followed but the patients were told 
they appeared not to have done very well and, after 
the 3-min. wait, that they had done very poorly and 
were very poor at these tasks. The nonevaluated 
group Ss were made to wait 3 min. after being told 
that the E needed to do something unrelated to the 
study, and no information about their performance 
was given. This is substantially the same procedure 
which was used by Webb (1955), who reported large 
differences in task performance between his groups 
of praised and censured schizophrenic patients. For 
each group, 5 of the 10 Ss in each condition received 
the RT task first, and the remainder received the 
anagrams task first. 


Results 


Results were examined by means of four- 
factor analyses of variance in which three of 
the variables were between Ss and one was 
within Ss. The three between-Ss variables 
were groups (Goods, Poors, and Controls), 
evaluation (praise, censure, and nonevalua- 
tion), and order of tasks (RT first and ana- 
grams first). The within-Ss variable was either 
preparatory interval or word depending on 
which of the two measures was being exam- 
ined. Results of these analyses were consistent 
and unambiguous. For number of words pro- 
duced in the anagrams task and for time 
required to respond in the RT task there were 
significant main effects for groups. The F 
Scores were 3.36 and 3.56, respectively, 
which for df — 2 are statistically significant 
( < .05). As may be seen from the mean 
Scores presented in Table 2, the Controls 
produce more words and react faster than the 
Ss of the other groups, while Poors produce 
the least number of words and are the slowest. 
No other differences were found to be signifi- 
cant for either measure except for main effects 
for preparatory interval for RT and words 
for the anagrams task which are to be ex- 
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TABLE 2 


PERFORMANCE ON ANAGRAMS AND REACTION-TIME 
Tasks By EACH GROUP OF SUBJECTS AS A Func- 
TION OF TYPE OF EVALUATION 


(Srupy 2) 
Anagrams | Reaction-time 
task: task: 
e M i 
k Mean mro | per tzial in 
duced to each | 1/100ths of a 
word presented second 
Controls 
Praised 9.10 19.07 
Censured 9.28 18.70 
Nonevaluated 8.88 19.49 
Total group average 9.08 19.09 
Goods 
Praised 7.89 23.79 
Censured 6.95 21.26 
Nonevaluated 8.00 20.54 
Total group average 7.61 21.86 
Poors 
Praised 7.88 20.82 
Censured 7.34 25.31 
Nonevaluated 6.75 27.49 
"Total group average 7.32 24.54 


pected and are not particularly informative. 
Two other dependent measures from the ana- 
grams task, number of errors and time re- 
quired to produce the first word, were simi- 
larly analyzed. For these scores, only the main 
effect for words was found to be significant. 
Thus, the results indicate that neither praise 
nor censure had any effect on the three groups 
of Ss nor did the groups perform differently 
as a function of the task administered. 


Discussion 


The clearest implication of the present 


findings is that the phenomenon of behavioral 
“deficits in schizophrenia is an extremely 


complex one. It was found that the perform- 
ance of high- and low-conflict Ss differed on 
the anagrams but not on the RT task, and 
that patients exposed to high parental con- 
flict made more errors and produced more 
words than low-conflict Ss. Moreover, while 
this marked effect was found as a function 
of the nature of the interaction, the seem- 
ingly more important variable was that of 
being exposed or not being exposed to a con- 
flictual interaction with the parents prior to 


the task. This did not appear to meaningfully 
influence the performance. These findings, 
then, do indicate that such factors as the 
nature of the tasks used, the kinds of depen- 
dent measures used, and the nature of the 
manipulations to which patients are exposed 
significantly affect results. For example, on 
the basis of the present study one might 
conclude that exposure to the parental inter- 
action had either a facilitating or a debili- 
tating influence upon the behavior of schizo- 
phrenic patients depending on the dependent 
measures considered. A recent study by Lup- 
pino (1966) further illustrates the complexity 
of this research area. Luppino showed that 
praise, administered the same number of 
times in the same way by the same E for 
doing the same task, can lead to very different 
performance depending on whether the easy 
or difficult parts of the performance are 
praised. The general conclusion to which this 
leads is that it is very premature at the 
present time to attempt to infer the nature 
of schizophrenia from studies of psychological 
deficits as is frequently done (Buss & Lang, 
1965; Lang & Buss, 1965). Additional and 
much more sophisticated studies should pre- 
cede such an effort if we are not to be badly 
misled. 

Some of the findings of the present study 
are relevant to much contemporary theorizing 
about the nature of schizophrenia and its 
etiology. Such theorizing supposes that the 
family, and especially the mother, plays an 
important role in this condition and that the 
parent-child relationships have been more 
disturbing for schizophrenics than for non- 
schizophrenics (e.g., Bateson, Jackson, Haley, 
& Weakland, 1956; Coleman, 1956, pp. 276- 
285). In contrast to these assumptions, the 
first experiment demonstrated that schizo- 
phrenic patients encountered their parents 
under quite realistic and potentially trauma- 
tizing conditions without showing any obvious 
disturbances. The degree of conflict experi- 
enced in the parental interaction greately in- 
fluenced the patients’ behavior, but high 
conflict did not lead to unambiguous deterio- 
ration of behavior as this kind of theorizing 
would predict. It was also found that there 
were no differences between schizophrenic and 
control Ss in their response to parental inter- 
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action (Table 1) nor did these groups react 
differently to the manipulation used in the 
second study. These negative findings may be 
due to the fact that in the present study the 
control groups consisted of nonschizophrenic 
psychiatric patients whereas in other studies 
the control Ss have typically not been psychi- 
atric patients. While a more typical control 
group might have yielded results more similar 
to previous ones, the present findings raise the 
possibility that the deficits which have been 
assumed to be specific to schizophrenic pa- 
tients may also be characteristic of other 
psychiatric populations. 

Although the only variable found to be 
associated with changes in psychological 
deficits was degree of parental conflict ex- 
pressed, group differences were found which 
are quite consistent with earlier studies. In 
the second study, the two groups of schizo- 
phrenic Ss were slower to react in the RT 
task and produced fewer words in the ana- 
grams task than the Controls, and the Poors 
performed less adequately than the Goods. 
While there are some discrepancies between 
the findings of the first and second studies, 
even in the first study the only significant 
group difference obtained indicated the Poors 
to be relatively less adequate at some prepara- 
tory intervals in comparison with Ss of the 
other two groups. A number of other studies 
which, like the present ones, indicate the non- 
schizophrenic Ss to be most adequate, the 
Goods next, and the Poors least adequate in 
the performance of various tasks have been 
reported (Buss & Lang, 1965; Garmezy & 
Rodnick, 1956; Lang & Buss, 1965). Thus, 
the present results, in agreement with the 
preceding ones, indicate that behavioral defi- 
cits are an important facet of psychopathol- 
ogy and merit further research attention. Tt 
should also be noted that these group dif- 
ferences in performance imply that the tasks 
used are valid measures of psychological 
deficits. Although it is possible that other 
procedures might yield much larger group 
differences than the ones used in these 
studies, the results obtained prompt the con- 
clusion that schizophrenic patients are con- 
siderably less influenced by affective experi- 
ence than prior researchers have suggested. 
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ing minimal trance depth despite concerted efforts 
to induce hypnosis. The Ss, at best, were able to 
respond only to ideomotor suggestions such as arm 
levitation and eye closure, attaining a score of < +2 
on the rating scale of trance depth. The mean and 
standard deviation of their scores, respectively, on 
the SHSS:C, were 1.79 and 148. Two Ss had experi- 
ence in simulating from other studies. 


Procedure 


In a prior independent experiment, Ss met E, 
who was to serve as the nonblind E in the present 
study. He scheduled Ss to take part in the present 
study, and administered initial instructions to hyp- 
notic and simulating Ss. All Ss were told that the 
study with the author (E;) would investigate the 
effect of hypnosis on a variety of tests, and that the 
tests would be given on 2 successive days. Simu- 
lating Ss were instructed, additionally, that they 
were to simulate the part of a hypnotized S and 
behave just as they thought a deeply hypnotized S 
would behave. They were asked, explicitly, not to 
disclose the fact that they were unaffected by hyp- 
nosis and to conceal the fact that they had not been 
hypnotized until E, saw them after the experiment 
had been completed. To motivate simulators for their 
task of deception E, told Ss that E, would stop the 
experiment if he caught on they were faking. If E; 
continued the experiment Ss were to assume that 
they were handling the deception well. Even if they 
thought they had made an error they were told to 
continue. It was stressed that intelligent Ss are able 
to handle the deception successfully. 

The study was divided into two sessions and all 
Ss were run in each of three conditions. Condition I 
established a base control set of personality measures. 
Condition II investigated the effect of the artificially 
induced conflict on personality functioning when Ss 
were amnesic for the source of the conflict, and 
Condition III examined the effect on personality 
functioning of lifting amnesia for the complex, 


First Session 


Condition I, Trance was induced in a hypnotic 
session which was oriented personally to individual 
Ss; E varied the induction procedure and hypnotic 
testing to accord with the responsiveness of the S. 
Although the form of testing differed slightly among 
Ss, the following items were administered in a 
standard order (as given) : arm levitation, eye closure, 
arm rigidity with challenge, experience of cold and 
nose itch, fantasy experience with suggested dream 


8 The independent study investigated the effect of 
motivating instructions on the hand strength of low 
and high hypnotizable Ss. At the end of the experi- 
ment Ss were informed what the next study would 
be. The time between the two studies varied for Ss, 
Two Ss went onto the present study immediately 
while others came back as much as a month later, 
The average time between studies was 13 days. 
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episode, delusion about a negatively hallucinated 
number (tested by the inability to handle arithmeti- 
cal computations), age regression with positive visual 
hallucination, posthypnotic suggestion, and suggested 
amnesia for all trance events. 

After awakening, E tested amnesia and the post- 
hypnotic suggestion and reinduced trance if either 
suggestion was not met successfully.4 After success 
on these items a battery of personality tests was 
administered to Ss in the waking state to complete 
the session. The battery contained Card 3 BM of 
the Thematic Apperception Test (Murray, 1943), the 
Rotter Incomplete Sentences Blank, College Form 
(Rotter & Rafferty, 1950), the Semantic Differential 
Test (Osgood, Suci, & Tannenbaum, 1957), and a 
word association test which was devised by the E. 
The Semantic Differential and word association tests 
were designed especially to relate to the complex. 

The Semantic Differential Test employed 10 scales 
and 10 concepts. Concepts were: “child,” “honesty,” 
“hypnosis,” “love,” “me,” “my ideal self,” “my 
mother,” “peace of mind,” “school,” and “self con- 
trol.” Scales were selected to represent Osgood’s three 
main factors of Evaluation (F1), Activity (F2), and 
Potency (F3). Scales representing each factor loaded 
highly on that factor and low on the remaining two 
factors. F1 was represented by “clean-dirty,” “good- 
bad,” and “kind-cruel”; F2 by “active-passive,” 
“excitable-calm,” and “hot-cold”; and F3 by “free- 
constrained," “hard-soft,” and "strong-weak." “In- 
tentional-unintentional” was the tenth scale. Scales 
and concepts were rotated against each other to 
minimize halo effects, and scales representing the 
same factor were alternated in polarity direction to 
prevent the formation of position preferences. The 
test was arranged in a booklet of 10 pages each 
presenting a randomly selected order of the 10 con- 
cepts paired with 10 different scales, Throughout the 
test, concepts were paired only once with each scale. 

The word association test listed 40 words. The S 
was instructed to say the first word which came to 
his mind after he read the stimulus word on a set 
of cards which were handed to him by E. Normative 
data were available for a sample of 500 male and 
500 female American university students (Palermo & 
Jenkins, 1964) on 23 of the stimulus words. When 5 
had completed the test, E ended the session and 
remarked that he would see him the next day for the 
final phase of the experiment. 


*The posthypnotic suggestion was an integral 
part of a study of posthypnotic behavior outside the 
experimental setting. The Ss were given the sug- 
gestion to respond to the cue wor "experiment" 
everytime they heard the word in the next 48 hr. 
The Ss were tested for the response, away from 
the E, by the project secretary as they left the 
building and as they reentered it on the following 
day. Details of the study are reported elsewhere 
(Orne, Sheehan, & Evans, 1968: “Occurrence of post- 
hypnotic behavior outside the experimental setting." 
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Second Session 


Condition 11. On the second day a deep trance was 
established and instructions were given to Ss to 
induce an anxiety conflict with amnesia for the 
source of the conflict.5 Following these instructions S 
was given amnesia suggestions for the entire session, 
awakened and tested for amnesia. Trance was rein- 
duced until S reported no memory of the events of 
the session. The S then was given the test battery 
to determine the effect of the induced conflict on 
personality functioning. 

Condition III. After S had finished the battery he 
was put back into trance and given the following 
instructions: 


When I wake you up now, you will remember the 
incident I told you about. You will remember the 
park and finding the red purse. You had forgotten 
it before but now you remember the incident 
completely. You recall everything just as it hap- 
pened to you. 


5 Conflict instructions to S were as follows: “On 
your way home from school many years ago, you 
crossed a park. Remember many years ago as a 
small child crossing a park on your way home from 
school. Remember it just as it happened to you; 
remember now going across a big park, green and 
round. It was a big park, green and round. Halfway 
across this big park you look down; you see a 
bright, red purse. You pick it up and find two shiny, 
big pennies. Two shiny pennies in a bright, red 
purse. You carry it with you and on your way 
home from school you pass a candy store, and you 
see some candy which you like very much. You are 
tempted to go in and buy it with the two pennies 
but you know you shouldn’t. Your mother would 
want you to give the purse to her so that she could 
find the owner. The temptation, however, is too 
strong and you go in and buy the candy with the 
two pennies. When you get home you give the 
empty purse to your mother and you tell her that 
there was nothing in it. You feel terrible; you lied 
to your mother, and you were dishonest about the 
pennies. You think your mother knew that you were 
not telling the truth and this makes you feel bad. 
She would be so hurt to think you are deceiving her. 
But you keep it all to yourself and don’t tell her. 
As you grow up from childhood, you sometimes feel 
uneasy about being dishonest and lying to your 
mother. You’ve always felt very uneasy about steal- 
ing that money from the red purse which you found 
in the park on your way home from school that day. 
It is only a small thing but you feel terrible about 
it and about not having told your mother. You feel 
very uneasy and anxious but you try to forget it. 
You're grown up now. You're no longer a little 
Child but grown up now, but you still feel very 
uneasy about that incident. You have forgotten that 
time you stole the money and lied to your mother. 
You have forgotten all about it, but it worries you, 
makes you feel uneasy.” 
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The S was awakened, asked what he remembered, 
and given the test battery again. This condition 
studied the effect of absence of amnesia on personal- 
ity functioning and is relevant to the important ques- 
tion whether the essential therapeutic step in the 
relief of neurotic conflict is recovery from amnesia 
or some other manifestation of dissociation. Mc- 
Dougall (1926) suggests that the revival of emotion, 
or reliving the emotional experience in itself, does 
not result in relief. He states that, in cases where 
relief follows, there is a relief of dissociation, and 
this is the essential therapeutic step in the alleged 
abreaction process. What is important in therapeutic 
cure is “the attempt to reintegrate into conscious- 
ness the complex that has become autonomous 
[p. 462].” In the present study, lifting the amnesia 
without removing the original source of the conflict 
allows experimental investigation of this hypothesis. 
It is suggested that in Condition III Ss will show 
less evidence of conflict than in Condition II where 
the complex has been successfully repressed. 

At the conclusion of the experiment when the 
test battery was completed for the third time, S was 
rehypnotized and told that the story he recalled was 
only vividly imagined and that the incident never 
really happened. Amnesia was lifted and S was 
awakened. A comprehensive interview was then con- 
ducted by E}. The E asked Ss for their comments 
on the experiment and their subjective reactions to 
the study. A check was made to establish that hyp- 
notic Ss were in trance throughout the session and 
that the simulators experienced no trance effects. 


RESULTS 


Results for 3 Ss in the hypnotic group 
and 1 S in the simulating group were 
excluded from analysis.) Fourteen hypnotic 
Ss and 13 simulating Ss provided data for 
analysis. 


Sentence Completion Test 


Two independent raters scored the Rotter 
Incomplete Sentences Blank for adjustment. 
Completions were scored by comparing them 
against typical items in empirically derived 
scoring manuals for men and women (Rotter 
& Rafferty, 1950) and by assigning to each 
response a scale value from O to 6. Scores 
were summed over the 40 items to give a 
total score which was taken as the test index 
of adjustment. Tests were coded to preclude 
S identification, and both scorers were un- 


9In the hypnotic group one S was given two 
sessions without conflict; the order of Conditions IT 
and III was reversed for another S; and a third S 
failed to develop any amnesia for the source incident. 
In the simulating group one S refused to simulate 
and showed evidence of entering light trance. 
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familiar with the experimental design. The 
interrater consistency of the raters was ex- 
amined for each group separately on each 
condition, and for groups and conditions as a 
whole. There was essential agreement between 
test scores for all meaningful comparisons 
between raters. The minimum correlation be- 
tween test scores for any comparison was .93; 
the maximum, .99. Table 1 sets out the means 
and variances of adjustment scores listed by 
conditions and, groups. Results are shown for 
the rater who used the wider range of scores. 
High scores represent greater degrees of mal- 
adjustment. 

Results in Table 1 were analyzed by Lind- 
quist’s Type I split plot model of analysis 
of variance (Lindquist, 1953). Classifications 
were for conditions (I, IT, IIT) and S group- 
ing (hypnosis, simulating). There was no 
significant difference in performance between 
hypnotic and simulating Ss over conditions. 
Condition I characteristically showed better 
adjustment scores than Conditions II and III. 
There was a significant main effect for con- 
ditions (F = 7.59, df = 2/50, p < 01), but 
the differences in adjustment scores among 
conditions were evident for both the simu- 
lating Ss and hypnotic Ss. 


Word Association Test 


The Ss’ association responses were analyzed 
in terms of commonality scores, which sum 
the proportional frequencies of words as ex- 
pressed by Ss in the normative population, 
and in terms of new responses which were 
original to the test sample. An original re- 
sponse was defined objectively as one which 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF ADJUSTMENT 
Scores FoR CONDITIONS AND SUBJECT GROUPS 


Condition 
Group 
Hypnotic 
x 
SD 
Simulating 
bg 
SD 
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TABLE 2 


AVERAGE Commonatity Scores (C) AND ORIGINAL 
Responses (O) ror SUBJECT GROUPS AND 
EXPERIMENTAL CONDITIONS 


Group 
Condition Measure 
Hypnotic Simulating 
I C 5.47 7.55 
o 3.79 .85 
II Cc 5.34 6.90 
o 3.31 2.08 
III Cc 5.47 6.49 
o 2.93 1.92 


was not contained anywhere in the set of 
normative responses. Table 2 lists the average 
commonality scores and numbers of original 
responses for the two experimental groups. 
Results are considered only for the 23 words 
common to the word lists of Palermo and 
Jenkins (1964). 

Results in Table 2 were analyzed by Lind- 
quist’s Type I split plot model of analysis 
of variance. Classifications were experimental 
condition (I, II, III) and S grouping (hyp- 
nosis, simulating). Analysis showed a sig- 
nificant main effect for groups (F = 4.35, , 
df = 1/25, p<.05), and a significant 
interaction between groups and conditions 
(F = 3.67, df = 2/50, p < .05). Simulators 
gave more original responses in Conditions II 
and III as compared with the control condi- 
tion, but hypnotic Ss gave fewer unusual re- 
sponses under both conditions. The latter 
finding is evidence that hypnotic Ss performed 
differently from simulating Ss. Under conflict, 
hypnotic Ss inhibited spontaneity of response, 
while simulators did not. However, in inter- 
preting these differences it is necessary to take 
into account the characteristics of the faking 
Ss introduced by the act of simulating. Simu- 
lators may have felt that, in order to prevent 
repetition of responses already given in the 
control condition, it would be appropriate to 
give original responses in Conditions II 
and III. 

Examination was made of the original re- 
sponses which were related to the complex. 
Five raters rated each original response paired by 
with its relevant stimulus. All responses were ' 
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coded to prevent identification of S grouping 
and condition. Nineteen responses were se- 
lected as complex related which were agreed 
upon by three of the five raters. Examples of 
such responses (in italics) and their stimuli 
are: RED-dislike, BUYING-candy, RED-purse, 
AFRAID-dishonest, FINGERS-found, YOUNGER- 
scared, RED-park, Boy-spend. The average 
numbers of complex related responses for hyp- 
notic Ss were 1.07 and .79 for Conditions II 
and III, respectively. Simulators showed an 
average score of .54 for both conditions. 
These differences between groups were insig- 
nificant, but it is of interest to note that 
although hypnotic Ss showed less tendency to 
give unusual responses in Conditions II and 
III while simulating Ss showed more, they 
actually gave more responses which were 
related to the induced conflict. 


Semantic Differential Test 


Semantic data were analyzed to detect in- 
stances of psychopathological perception, and 
to test for differences in meaning attributed 
to key concepts by hypnotic and simulating 
Ss. Data were arranged to represent actual 
ratings of concepts on scales, and difference 
scores (D scores) between concepts over the 
total semantic space. The D score measured 
the semantic similarity between concepts, and 
took the sum of the squared differences be- 
tween ratings on two concepts for selected 
conditions or groups, summing over all scores 
(Osgood et al., 1957). 

The Ss’ ratings of concepts were averaged 
over nine scales* and the changes in meaning 
of specific concepts were examined for hyp- 
notic and simulating Ss. Differences were 
analyzed between groups for each condition 
and between conditions for the two groups, 
separately. Of 90 comparisons made, only 
three differences were significant (involving 
the concepts hypnosis, me, and school). Con- 
sidering the number of test comparisons, the 
differences obtained can be interpreted reason- 
ably as chance occurrences, 

Distances between each concept (m) and 


* Summation did not include the scale “intentional- 
unintentional” since this scale represents a fourth, 
unknown factor in semantic space and is not inde- 
Pendent of the three main factors selected. 


every other concept were calculated and then 
entered in m X m matrices which represented 
the semantic structure of the total set of m 
concepts and gave the distance or similarity 
relations among them. Matrices of scores for 
real and simulating Ss were correlated for 
Conditions I, II, and III, separately. Analy- 
sis of results showed a high degree of simi- 
larity in performance between hypnotic and 
simulating Ss in each condition. For Condi- 
tions I, II, and III the D-score matrices of 
hypnotic Ss and simulators correlated .76, 
.56, and .58, respectively. 

Change scores from one condition to another 
were correlated for both groups. The change 
in meaning of concepts from Condition I to II 
was similar for hypnotic and simulating Ss 
(for matrices of rating scores, r — .45, df — 
88, p < 001), as was also the change in 
meaning of concepts from Condition II to TIT 
(for rating scales, r = .23, df = 88, p < .05). 
However, the change in meaning from Condi- 
tion I to III for hypnotic Ss bore no rela- 
tionship to that shown by the simulating Ss 
(for rating scores, r = .09, df = 88, p > .10; 
and for difference scores, r= .01, df = 34, 
p> .10). The latter finding is evidence that 
Condition III discriminated hypnotic and 
simulating Ss. 


Thematic Apperception Test (Card 3 BM) 


Stories were analyzed to provide compari- 
son of test protocols with normative data on 
an American population of 50 college students 
and 100 psychiatric patients (Eron, 1950). 
Two raters blind to S grouping and condition 
scored Ss on four variables selected by Eron: 
(a) the emotional tone; (5) emotional out- 
come of the stories (rated on a 5-point scale 
ranging from "very happy" to "very sad"); 
(c) theme, scored according to a comprehen- 
sive checklist provided by Eron; and (d) 
misidentification of the sex of the huddled 
figure which is ambiguously defined in the 
test card. Tests were also rated on a 5-point 
scale measuring complex relatedness. 

The emotional tone and outcome of the 
stories were rated as characteristically un- 
happy for all Ss. Frequency analysis of themes 
showed that, for both raters, hypnotic Ss gave 
different themes in the control condition from 
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TABLE 3 


FREQUENCIES OF SAME-SEX IDENTIFICATION OF 
AxBIGUOUS FIGURE BY HYPNOTIC AND 
SIMULATING SUBJECTS 


Condition 


Group 
I Il Ill 
Hypnotic 2 5 2 
Simulating 0 1 1 


simulating Ss, but emphasized the same 
themes in Condition II. In Condition II both 
groups of Ss most frequently reported "guilt- 
remorse" and "pressure from parents" which 
are clearly complex related, In Condition III 
both groups expressed “guilt-remorse,” but 
hypnotic Ss next emphasized “inadequacy” 
while simulating Ss expressed “exhaustion.” 
The hypnotic group gave three unusual 
themes in the control condition which were 
not included in the special list compiled from 
responses typically given to the card (Eron, 
1950), whereas simulators gave no unusual 
themes, On Condition II, however, hypnotic 
Ss gave no unusual themes, whereas simu- 
lators gave two atypical themes. This reflects 
the tendency, previously observed, for hyp- 
notic Ss to show reduced spontaneity under 
induced conflict. 

Nearly all Ss immediately identified the 
ambiguous figure as of the opposite sex. In 
only 10 instances did Ss identify the figure 
immediately according to their own sex. 
Table 3 sets out the frequencies of such 
occurrences. 

Table 3 shows that identification according 
to S’s own sex was characteristic only of the 
hypnotic group in Condition II. Four of the 
five Ss who perceived correctly in this condi- 
tion were observed to respond posthypnotically 
outside of the experimental setting when 
tested by the project secretary on a pre- 
arranged cue (see Footnote 4). The phenome- 
non of extraexperimental response to a post- 
hypnotic suggestion is known only to occur 
with Ss capable of deep trance. The finding 
suggests that identification with the huddled 
figure typically perceived as unhappy and 

8 Actual frequency scores differed between the two 


raters, but the pattern of results reported here was 
identical for both raters. 
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with little chance of escaping the conflict was 
associated with trance depth. Hypnotic Ss, 
who were markedly susceptible to trance, 
identified more with the figure under induced 
conflict than did other Ss. 

An analysis of variance (Lindquist’s Type 
I design) carried out on the ratings of com- 
plex relatedness ® showed significant differ- 
ences among conditions across Ss (F = 10.40, 
df = 2/50, p < .001), but there was no dif- 
ferential effect for hypnotic Ss. There was 
some evidence that Condition III discrimi- 
nated Ss. Eighty-six percent of the hypnotic 
Ss and 85% of the simulating group were 
rated as having given complex related re- 
sponses in Condition II; in Condition III, 
71% of the hypnotic Ss continued to give re- 
lated responses, but only 46% of the simula- 
tors showed any influence of the complex in 
their stories. Some simulating Ss apparently 
expected that the complex should be removed 
in Condition III, but hypnotic Ss character- 
istically behaved uniformly under both condi- 
tions of conflict. 

At the end of Session I on the first day, E» 
rated Ss on whether he thought they were 
simulators or true hypnotic Ss. The Ss’ re- 
sponses were analyzed in terms of E's per- 
ceptions to detect the possible effect of E bias 
on Ss' test performance. Data showed that 
spontaneity changes were independent of E's 
perception. The hypnotic Ss gave fewer origi- 
nal responses in the conflict condition whether 
they were perceived as real or faking, and 
simulators likewise increased new responses 
independently of E’s perception. Similarly, E's 
perception did not discriminate hypnotic or 
simulating Ss on the degree of complex re- 
latedness given in their TAT stories. How- 
ever, in Condition II, E perceived six of the 
seven Ss who identified the TAT figure ac- 
cording to their own sex as good hypnotic Ss. 
For the hypnotic Ss, then, identification with 
the ambiguous figure may be an index of their 
involvement under induced conflict, or may 
reflect E's bias that S should identify with 
the figure in a particular way. 


9 Two raters scored the stories for complex re- 
latedness. The interrater reliability for the measure 
was .78. Results are reported for the rater who, 


over all groups, rated more stories as complex- 
related. 
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Discussion 


The artificial induction of conflict had con- 
siderable effect on S's test behavior, but initial 
differences in the control testing between hyp- 
notic and simulating Ss reduced differentiation 
of Ss in the conditions of induced conflict. 
There was evidence, though, that reduction of 
spontaneity was associated with hypnotic 
rather than simulating performance. Hyp- 
notic Ss showed fewer unusual association re- 
sponses in Conditions II and III while simu- 
lating Ss increased the number of original 
responses, and hypnotic Ss ceased giving un- 
usual themes to the TAT card while simulat- 
ing Ss increased the number of such themes 
reported. Both these findings suggest that 
fluidity of thought organization may have 
been impaired for the hypnotic Ss and that 
their mental processes were interfered with by 
the conflicting, emotionally laden ideas per- 
taining to the complex. The decrease in spon- 
taneity suggests some degree of maladjust- 
ment since the nature of hypnotic Ss’ re- 
sponses to the test stimuli deviated from their 
usual style of responding. Results, however, 
are not completely unequivocal. It is conceiva- 
ble that the act of simulating accounted for 
the increase in original responses simulators 
gave under the conflict conditions. If this is 
so, the simulating S’s failure to respond in a 
similar fashion to the hypnotic S does not in- 
dicate that loss of spontaneity is a genuine 
effect of hypnosis, but, rather, the loss may be 
attributed to the psychological situation in 
which simulators were placed. 

_ The main consensus of evidence was that 
simulators complying with the demands of the 
experimental situation displayed essentially 
the same pattern of behavior as did hypnotic 
Ss in the conflict situation. In the conflict con- 
dition, simulators demonstrated similar com- 
monality Scores, emotional themes to projec- 
tive test material, and distortions in percep- 
tion of semantic material, as did hypnotic Ss. 
The pattern of results suggests that previous 
findings on the effect hypnosis has on per- 
sonality after the induction of conflict (e.g., 
Reyher, 1961) can be equally well explained 
by the demand characteristics of the experi- 
mental situation as by appeal to the effects 
of hypnotic treatment. This is not to say that 


hypnosis failed to produce genuine effects, but 
the results of previous studies must be re- 
garded as equivocal without adequate control 
for S expectations. 

The extent to which test behavior of neu- 
rotically anxious Ss can be simulated draws 
attention to the need to develop clinical tests 
which are not amenable to conscious control. 
Present findings support the conclusion of 
Meltzoff (1965) that Ss can easily manipu- 
late their responses on standard personality 
tests. The sentence completion test, for ex- 
ample, largely provides only information that 
S is “willing to give.” This questions the situ- 
ational validity of such tests. The problem is 
not so much one of estimating the extent to 
which such tests will behave in a predictable 
fashion following induced changes in the 
testee, but the extent to which S’s test per- 
formance reflects the personality change and 
is not an artifact of the experimental situa- 
tion. 

There was some evidence that Condition IIT 
discriminated hypnotic and simulating Ss. In 
this condition the source incident was left 
intact but amnesia for it was removed. The 
semantic differential change scores for hyp- 
notic Ss from Condition I to III bore no re- 
lationship to those of simulating Ss, and simu- 
lators reported fewer complex related stories 
to the TAT. Hypnotic Ss expressed greater 
subjective involvement, projecting feelings of 
personalized inadequacy into their TAT sto- 
ries while simulators took refuge in the rela- 
tively “safe” category of exhaustion. These 
findings indicate that the demand character- 
istics were less obvious for this condition 
than for Condition II. Simulators apparently 
thought some difference in behavior was ex- 
pected of them from one condition to the next. 
The evidence, in support of data from the 
postexperimental inquiry, indicates that some 
simulating Ss believed that the conflict should 
be alleviated when awareness of the source 
incident was present. The differences between 
the two groups in Condition III argue for 
the validity of the behavioral effects observed 
for hypnotic Ss in that condition. 

Hypnotic Ss’ behavior in Condition IIT in- 
dicated that repression was not a necessary 
condition for the occurrence of painful emo- 
tional states. Condition II and III were be- 
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haviorally indistinguishable and both demon- 
strated the presence of conflict. The removal 
of amnesia did not itself result in relief of the 
emotional experience. Neither did it result in 
greater conflict value as might be suggested 
by the theorizing of Freud, which states that 
repression protects S from painful emotional 
states, its removal meaning that the patient 
must “face pain." It is possible, however, as 
Bobbitt points out that the development of 
full awareness of conflict may accompany 
changes in disturbance so that maximum dis- 
turbance occurs when awareness is partial, but 
less than complete. 

Hypnotic Ss characteristically expressed 
considerable subjective involvement under 
induced conflict, identifying with the ambigu- 
ously defined figure more than simulating Ss. 
The subjective reality of their feelings argues 
for changes and distortions in their behavior 
which accord with individual differences in 
the personality dynamics of Ss. In support of 
this, the evidence showed that hypnosis had 
no single effect on Ss’ personalities even 
though the same paramnesia was suggested to 
all Ss. Subjective reactions ranged from ex- 
pression of somatic symptoms such as head- 
aches to symptoms of unspecified distress, 
paranoid ideas, and aggressive urges. Hypnotic 
Ss all reported that they experienced amnesia 
for the conflict material yet showed varying 
signs of breakdown of ego defenses. The most 
extreme reaction came from a hypnotic S who 
was agitated enough to hurl the TAT card 
at the E and say that he felt as if the police 
were watching him through a one-way screen 
which was placed in the room. 

In conclusion, it should be stressed that, 
within the present methodology, null findings 
between groups can be unambiguously in- 
terpreted as indicating that S expectations 
could be a sufficient explanation for the re- 
sults. Differences between groups are more 
difficult to interpret for the reason that simu- 
lators and hypnotic Ss are not equally sus- 
ceptible to trance, and such differences could, 
presumably, be correlated with other personal- 
ity and motivational traits which may com- 
plicate interpretation of performance. Inter- 
pretation of differences is also complicated by 
E's instructions to Ss to simulate. Simulating 
instructions can be expected to yield specific 
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treatment effects which require independent 
study, Alternative methodologies such as those 
employed by Reyher (1961) and Moore 
(1964) which use only susceptible Ss in both 
the hypnotic and pretense conditions over- 
come the problem of personality differences 
among Ss, but fail to control for the demand 
characteristic effects resulting from using Ss 
as their own controls. 

The limitations of the present design do not 
negate the main finding of the experiment: 
many of the signs of so-called maladjustment 
and emotional disturbance as inferred from 
Ss’ behavior after the induction of conflict 
can be equally well explained by the expecta- 
tions experimental Ss bring to their task, as 
by theories of hypnosis. Where differences do 
exist between simulating and hypnotic per- 
formance the evidence suggests that the anxi- 
ety-toned conflict has a repressive influence 
which reduces the spontaneity of the conflicted 
Ss. Results also show that repression is not a 
necessary condition for the occurrence of pain- 
ful emotional states, 
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OBJECTIVE MEASURES OF SUSPICIOUSNESS * 


NOBLE A, ENDICOTT, SIDNEY JORTNER, anp EILEEN ABRAMOFF 
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The purpose of this study was to gather reliability and validity data on several 
measures of suspiciousness derived from psychological tests. Clinical ratings of 
suspiciousness were made on the basis of interviews with two groups of psychi- 
atric patients: 90 hospitalized patients and 40 outpatients. Different “degrees” 
of suspiciousness were found to be characterized by qualitatively different types 
of Holtzman Inkblot Technique (HIT) content. Significant correlations were 
obtained in both groups of patients between the clinical ratings, the MMPI 
suspiciousness scale, and two content scales from the HIT. A multiple correlation 
combining the three test measures correlated .61 and .69 with rated suspicious- 
ness in the hospitalized and outpatient groups, respectively. 


The purpose of this study was to gather 
reliability and validity data on a number of 
measures of suspiciousness derived from psy- 
chological tests. It is hoped that studies of 
this type will eventually lead to the develop- 
ment of objective measures of the trait of 
suspiciousness which can be used in evaluat- 
ing various forms of psychiatric treatment and 
in testing psychodynamic hypotheses. 

Suspiciousness was defined in the present 
study as a psychological trait expressed by a 
distrust of others’ motives in interpersonal re- 
lationships. This is closely related to, but not 
synonymous with, the diagnostic concept of 
paranoia. The present writers chose to study 
*suspiciousness" rather than “paranoia” be- 
cause they thought psychological traits of this 
type would be more amenable to operational 
definition than the more global traditional 
diagnostic concepts. 

A review of the literature failed to reveal 
any studies relating scores derived from psy- 
chological tests to suspiciousness, per se. How- 
ever, there have been a number of previous 
investigations of the relationship between 
test scores and paranoia or paranoid tenden- 
cies (Bower, Testin, & Roberts, 1960; Du- 
Brin, 1962; Friedman, 1957; Harris, 1949; 
Kleinmuntz, 1960; Murati, 1955; Nagge, 
1951; Reznikoff & Nicholas, 1958; Ribler, 
1957; Valentine & Robin, 1950; Wertheimer, 
1953; Zeichner, 1956). 

"There is a general consensus among clinical 
writers that Rorschach content such as eyes, 

1This study was supported in part by United 


States Public Health Service, National Institute of 
Mental Health Grant MH-05518. 


masks, concealed or obscured figures, etc., 
frequently reflects paranoia or paranoid 
tendencies (Hertz, 1948; Lindner, 1947; 
Phillips & Smith, 1953; Schafer, 1954). The 
studies by Harris (1949) and DuBrin (1962) 
of Rorschach content generally supported 
these contentions, while Wertheimer’s (1953) 
investigation of “eye” content failed to do so. 
In addition to the Rorschach, the TAT 


- (Friedman, 1957; Nagge, 1951; Valentine & 
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Robin, 1950; Zeichner, 1956), MMPI (Klein- 
muntz, 1960), Figure Drawings (Reznikoff & 
Nicholas, 1958; Ribler, 1957) and the Aspira- 
tion Level Test (Murati, 1955) have been 
used to compare paranoids with nonparanoids. 

The studies mentioned above suffer from a 
number of methodological limitations if one's 
goal is the development of an objective mea- 
sure of suspiciousness. Most of these investiga- 
tions compared the test scores of a paranoid 
group with those of one or more relatively 
nonparanoid groups. With this type of ex- 
perimental design, it is not possible to deter- 
mine whether the differences between the test 
scores of the groups are the consequence of 
differences between the groups in the “degree” 
of suspiciousness, or whether they are due to 
other personality characteristics in which the 
groups also differ. In addition, the majority of 
these investigations have used tests with many 
psychometric limitations (e.g., the Rorschach) 
or have used scores derived from only one 
psychological test. 

The present study attempted to cope with 
these problems by relating each of the pro- 
posed test measures of suspiciousness to a 
more highly graduated criterion measure of 
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suspiciousness (i.e., ratings of suspiciousness 
on a 5-point scale). In addition, each of the 
measures used in this study was correlated 
with ratings of 41 other symptomatic or per- 
sonality variables. This permitted an analysis 
of the relative contribution made by each of 
these psychological states or traits to the pro- 
duction of the proposed test measures of sus- 
piciousness. 

In an effort to deal with some of the other 
limitations of previous research in this area, 
the Holtman Inkblot Technique (HIT; Holtz- 
man, Thorpe, Swartz, & Herron, 1961) was 
used instead of the Rorschach, and a relatively 
extensive battery of tests was administered to 
each patient. 


METHOD 
Subjects 


Three groups of patients were involved in the 
present study: a hospitalized group (n= 90), an 
outpatient group (n=40), and a mixed group of 
hospitalized patients and outpatients (n= 84) used 
in a previous study (Endicott & Endicott, 1963). The 
hospitalized group represents all of the patients ad- 
mitted to the Psychiatric Treatment Research Cen- 
ter, State University of New York, Brooklyn, be- 
tween May 1963 and October 1964, who had either a 
verbal or performance Wechsler Adult Intelligence 
Scale (WAIS) IQ score of 80 or above and who 
remained in the Center long enough to be tested and 
interviewed. Of these patients, 63 were male and 27 
were female. The patients were primarily of lower- 
or lower-middle-class backgrounds and had an 
average of 11.3 yr. of formal education (range 6-18 
Yr). The mean full scale WAIS IQ was 101.9 (range 
75-131), and their mean age was 33.7 yr. (range 17- 
58). The diagnoses for this group were: alcohol 
addiction, 41; schizophrenic reaction, 32; personality 
disorder (other than alcoholism), 9; psychotic de- 
pressive reaction, 1; and manic-depressive reaction, 

The outpatient group (n= 40) was taken from the 
senior author’s private practice. Of these patients, 18 
Were male and 22 were female. They had a mean age 
of 32 yr. (range 16-59). The group was predomi- 
nately middle class in background and had a mean 
of 13 yr. of formal education (range 8-20). The 
diagnoses for this group were: alcohol addiction, 2; 
schizophrenic reaction, 9; personality disorder, 12; 
and psychoneurotic reaction, 17. 

The third group of 84 patients were involved in 
only one aspect of the study. They provided the 
data for the initial derivation of the new MMPI 
Suspiciousness scale used in this study. A description 
of this group of patients is presented in a previous 
Publication (Endicott & Endicott, 1963). 


Procedure for the Hospitalized Patients 


Shortly after the patients were admitted to the 
hospital they were interviewed jointly by the first 
two authors. Following the interview, both made 
independent ratings of the patient’s suspiciousness 
and each of the other psychological variables being 
investigated. All ratings were made without knowl- 
edge of the test results. The product-moment reli- 
ability coefficient for the suspiciousness ratings of 
these two raters was .92. Following the independent 
rating, the authors arrived at a consensus rating in 
conference. The consensus rating was used as the 
criterion measure in this study. 

A battery of psychological tests consisting of 
Series A of the HIT, the WAIS, the MMPI, the 
Draw-a-Person Test, the Embedded Figures Test 
(EFT), and a portion of Cattell’s O-A battery was 
administered after the initial interview. A more de- 
tailed account of the procedure followed can be found 
in a previous publication (Endicott & Jortner, 1966). 


Procedure for the Outpatients 


A similar procedure was followed with these pa- 
tients except they were rated only by the senior 
author after five sessions and were given a less ex- 
tensive battery of psychological tests. 


Suspiciousness Rating Scale 


A 5-point scale ranging from (1) “minimal” to 
(5) “extreme” suspiciousness was used in making the 
ratings. Each point was briefly defined in terms of 
the apparent pervasiveness and "intensity" of the 
trait, as follows. 


1. Minimal suspiciousness.—The S would be de- 
scribed by observers (friends, associates, inter- 
viewers, etc.) as “trustful” or “not particularly 
cautious” in his dealings with other people. This 
should not be taken to mean that an individual 
rated in this category never distrusts anyone. 
Rather that his tendency to distrust others is 
primarily determined by the “realities” of the 
situation and he does not chronically expect to be 
taken advantage of or disappointed in his dealings 
with other people. 

2. Mild suspiciousness.—The Ss classified here 
would frequently be described by others as 
“cautious” or “somewhat over-cautious” in their 
dealings with other people. Most observers, associ- 
ates, etc., would not be likely to spontaneously 
label individuals classified here as “distrustful.” 
The feelings of distrust would be relatively in- 
frequent and/or of relatively low “intensity.” 
Suspiciousness of this degree would not be con- 
sidered pathological or particularly striking by 
most people in our culture. 

3. Moderate suspiciousness.—With suspiciousness 
of this level there would be a tendency on the 
part of many of the S's friends and associates (or 
interviewers) to spontaneously observe and report 
distrust of others as one of his characteristics. 
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ized group. The MMPI Pa scale T scores 
correlated .29; the S18 scale, .44, the SCSS I, 
.58; and the SCSS II, —.23, with rated suspi- 
ciousness. When the HIT SP responses were 
added to the SCSS I and II, their correlations 
with rated suspiciousness were .52 and —.46, 
respectively. These data are presented in 
Table 1. 

A multiple correlation of .69 was obtained 
in the cross-validation group between rated 
suspiciousness and the scores of the three best 
measures developed on the validation group: 
the MMPI S18 scale, the HIT SCSS I plus 
SP responses, and the HIT SCSS II plus SP 
responses. 


Discussion 


The results of the study were encouraging. 
Using three of the measures developed in this 
investigation, a multiple correlation of .61 
was found with rated suspiciousness in the 
developmental sample. Applying these same 
measures to a markedly different cross- 
validation sample, a multiple correlation of 
.69 was obtained. While correlations of this 
magnitude leave considerable room for im- 
provement, they are sufficiently high to war- 
rant use of these measures in evaluating 
suspiciousness in groups of patients. 

The finding of two relatively distinct types 
of “suspicious” HIT responses (SCSS I and 
SCSS II) was interpreted as indicating that 
different “degrees” of suspiciousness are 
characterized by qualitatively different types 
of responses to the HIT. This is similar to 
an earlier finding which related qualitative 
differences in HIT content to different “inten- 
sities” of depression (Endicott & Jortner, 
1966). 

The HIT “suspicious popular” (SP) re- 
sponses were added to both the HIT SCSS I 
and II on the assumption that two of the 
more significant processes involved in the 
production of the SP responses had opposing 
directional effects and tended to cancel one 
another insofar as the correlations with rated 
suspiciousness were concerned. On the one 
hand, as the severity of suspiciousness in- 
creased there is a tendency to perceive a 
larger number of faces, masks, eyes, etc., on 
inkblot tests. This tendency is reflected most 
strikingly in responses to relatively unstruc- 


tured inkblots. As the inkblots become more 
structured and their form more closely comes 
to resemble a face or mask, etc., another 
process, or group of processes, assumes greater 
significance. It has been demonstrated that 
the severely emotionally disturbed tend to 
view the world idiosyncratically and, as a 
consequence, produce fewer popular responses 
to inkblot tests than do “normals” (Holtz- 
man, Thorpe, Swartz, & Herron, 1961). 
Because of the inverse relationship between 
suspiciousness and “mental health,” the ink- 
blot percepts of the mildly suspicious tend to 
resemble those of the general population 
(i.e., populars) more closely than do those of 
the markedly suspicious group. By relying on 
the “suspicious” content of the remainder of 
the HIT protocol the authors hoped to 
delineate which of these processes was most 
relevant in each patient's case. 

The method of assigning the HIT SP re- 
sponses to the HIT SCSS I and II was clearly 
more effective in the case of the SCSS II 
(Table 1). In the outpatient group the addi- 
tion of the SP responses actually lowered the 
correlation between rated suspiciousness and 
the SCSS I scores from .58 to .52. Because of 
the greater number of SCSS I responses in 
the outpatients than in the hospitalized group, 
the cutoff points developed in the latter group 
were too low for the outpatients. As a conse- 
quence, a larger number of SP responses was 
added to the SCSS I scores of the outpatient 
moderately suspicious group than to those of 
the markedly suspicious and the correlation 
was lowered. The present contextual scoring 
system is regarded as preliminary and further 


TABLE 1 


Pnopucr-MOMENT CORRELATIONS BETWEEN 
RATED SusPICIOUSNESS AND TEST SCORES 


Holtzman Inkblot Technique MMPI* 

Group 
SCsS 1] sess |SCSS 1 SGH $28 | Pa 
Hospitalized Ex. 
outpatients | See | TS" | S | a de | 29 


Note—Abbreviations: SCSS = Suspiciousness Content 
Scoring System; SP = suspicious popular responses. 

* The hospitalized patients’ MMPI correlations are based 0? 
84 tests. 

bn =90. 

cn = 40. 

*p < 01. 
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TABLE 2 
MEANaTEST SCORES AND RATINGS OF “DEGREE” OF SUSPICIOUSNESS 
HIT HIT HIT HIT MMPI MMPI 
Clinical ratings of SCSS I SCSS II SCSSI+SP | SCSS II+SP S18 Pa 
suspiciousness 
M SD M | SD M SD | M SD M | sp | Mm | sD 

Hospitalized group (7 90)* 
Extreme (n = 4) 8.75 0.50 9.25 0.50 3.3 57.0 
Marked (n = 25) 5.52 | 4.91 | 0.35 | 0.49 | 5.88 | 5.30 | 0.35 | 049 | 6.7 3.4 | 67.8 | 14.0 
Moderate (n = 23) 4.04 | 3.39 | 0.48 | 0.66 | 4.35 | 3.73 | 0.52 | 0.66 | 43 3.3 | 748 94 
Mild (n = 38) 1.89 | 1.89 | 1.11 | 0.92 | 2.00 | 2.22 | 132 | 110 | 29 2.4 | 59.7 | 10.7 
Total group (n = 90) | 3.76 | 4.07 | 0.71 | 0.81 | 4.03 | 4.44 | 0.82 | 0.92 | 4.3 3.4 | 63.8 | 122 

Outpatient group (v 40) 
Marked (n = 10) 9.70 | 4.24 | 0.20 | 0.42 | 10.40 | 4.84 | 0.20 | 0.42 | 5.1 $.5 | 66.6 | 11.7 
Moderate (n — 22) 6.05 | 3.42 | 0.64 | 0.73 | 6.95 | 4.37 | 0.68 | 0.84 | 2.4 24 | 639 | 15.6 
Mild (” = 8) 2.50 | 1.69 | 1.00 | 1.07 | 3.00 | 2.45 | 1.75 | 175 | 18 14 | 545 6.4 
Total group (n = 40) | 6.25 | 411 | 0.60 | 0.78 | 7.02 | 4.80 | 0.78 | 1.12 | 2.9 2.7 | 62.7 | 13.7 


Note.—Abbreviations: HIT SCSS = Holtzman Inkblot Technique, Suspiciousness Content Scoring System; SP = suspicious 


popular responses. 


* For the MMPI scores the number of patients is: extreme, 4; marked, 24; moderate, 21; and mild, 35. 


research is necessary in order to develop more 
sophisticated scoring systems of this type. 

The finding of a higher correlation between 

rated suspiciousness and the HIT SCSS I 
Scores in the cross-validation group (the out- 
patients) than in the hospitalized group is 
probably due in large part to the greater 
number of SCSS I responses in the former 
group (Table 2). The reasons for this dif- 
ference between the two groups is unknown. 
The SCSS I scores were not significantly cor- 
related with WAIS IQ, education, or field 
dependence-independence as measured by the 
EFT (Witkin, Dyk, Faterson, Goodenough, & 
Karp, 1962). Whatever the causes of this 
difference, it is not a general finding with 
respect to all HIT content measures of 
‘psychopathology.” For example, the two 
groups did not differ significantly in HIT 
depressive” content, but the hospitalized 
group had significantly more HIT content 
associated with somatic preoccupation than 
did the outpatient group (Endicott & Jortner, 
1966, 1967). 

Another factor which may have contributed 
to the difference between the correlations of 
the ratings and the test scores for the two 
groups was a lower accuracy of the ratings 
in the hospitalized group. Suspiciousness is 


generally considered a socially undesirable 
trait and is frequently denied. In the hospital- 
ized group the ratings were made after one 
interview of 60-90 min. duration. On several 
occasions it was later discovered, through 
conversations with the patient’s psychothera- 
pist, nurses, or other patients, that the authors 
had underestimated the degree of the patient’s 
suspiciousness. This occurred most frequently 
among the alcoholic patients. 

In the outpatient group the ratings were 
usually made after five 1-hr. interviews, and 
as a consequence the rater had considerably 
more information about them than about the 
hospitalized group. In addition, the outpa- 
tients came to the interviews under conditions 
more conducive to self-revelation than did the 
hospitalized patients. 

The findings of the present study suggest 
a number of procedures whereby more valid 
objective measures of suspiciousness might be 
developed. For example, some of the scoring 
categories in the SCSS I were consistently 
more discriminating than others. If the same 
differences in discriminatory power are found 
in other groups, a weighting system could be 
developed to use this differential. 

The finding that different “degrees” of 
suspiciousness are characterized by qualita- 
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tively different types of HIT responses sug- 
gests another method of developing a more 
valid method of evaluation. For example, 
using the combined patient group of 130, 
there were 12 patients who had the pattern 
SCSS I = 0 and SCSS II = 0-3. Ten of these 
patients were rated mildly suspicious and 
two were rated moderately suspicious. Several 
other HIT score patterns had a similar dis- 
criminatory power. By using such highly dis- 
criminatory HIT patterns, and other mea- 
sures derived from tests, in a method based 
on the decision tree model (Kleinmuntz, 
1963), it should be possible to develop objec- 
tive evaluative techniques which are much 
more reliable and valid than the relatively 
simple linear methods discussed in this report. 
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The Revealed Differences Technique was used to study dominance and conflict 
patterns in the interaction of neurotic and normal children of both sexes. Two 
measures of relative dominance indicated that neurotic boys have dominant 
mothers and neurotic girls have dominant fathers. Two measures of conflict indi- 
cated a more hostile interaction between the parents of neurotic children. In 
the neurotic group, there was a relationship between the areas of husband- 
wife conflict and independent clinic ratings of corresponding problem areas in 
the behavior of the child. The results were interpreted in terms of an identifi- 


cation learning conflict. 


Tn recent years, there has been a consider- 
able focusing of interest on the family situa- 
tion as a major determinant of a wide variety 
of neurotic, delinquent, and psychotic pat- 
terns of pathological behavior (Ackerman, 
1958; Lidz, Fleck, & Cornelison, 1965; Spie- 
gel & Bell, 1959). It has been suggested that 
the development of pathological behavior in 
the child is related not only to emotional dis- 
turbances in the individual parents but also to 
disturbances in the relationship between par- 
ents which may affect the mode of functioning 
of the entire family. Along with this, a major 
movement to develop and apply psychothera- 
peutic techniques to family groups has 
emerged (e.g, Boszormenyi-Nagy & Framo, 
1965; Satir, 1964). Unfortunately the em- 
pirical base for these theoretical and thera- 
peutic developments is exceedingly weak. 
Most of the work has consisted of clinical ob- 
servations, and the more systematic clinical 
research has been characterized by a lack of 
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comparison groups, the use of inadequate 
measuring procedures, and a failure to control 
for variables such as age, sex, social class, and 
type of pathology (Fontana, 1966; Frank, 
1965). 

One of the major problems in this area is 
how to reconstruct the family situation as it 
may have been during the presumably critical 
early years of the childhood of the disturbed 
individual. Retrospective accounts by either 
the individual or his parents are subject to a 
number of distortions and measurement diffi- 
culties. Therefore, attention has been turned 
toward a direct study of the current person- 
alities in the family and their interaction with 
one another. Fontana (1966) has cogently 
argued for the utility of this method in the 
absence of longitudinal studies, while at the 
same time pointing out its limitations. 

The direct study of individual members of 
a family with a pathological child suggests 
that the parents are maladjusted in various 
ways (Fisher, Boyd, Walker, & Sheen, 1959; 
Peterson, Becker, Hellmer, Shoemaker, & 
Quay, 1959; Winder, 1960). However, it has 
been difficult to specify these parental char- 
acteristics, particularly in relation to different 
types of pathology. Many investigators now 
believe that the interaction between family 
members is a more significant factor than indi- 
vidual personalities (Handel, 1965). The most 
widely used measure of parental interaction 
is the Revealed Differences Technique (RDT; 
Strodtbeck, 1951), an experimental procedure 
in which parents answer standardized ques- 
tions individually and are then brought to- 
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gether to reconcile their views. Unfortunately, 
nearly all of the studies using the RDT have 
been concerned with schizophrenia (e.g., Fa- 
rina, 1960). 

The purpose of the present study is to ex- 
amine, with the aid of the RDT and with 
better controlled conditions, the interaction of 
parents of neurotic children in comparison 
with the interaction of parents of normal chil- 
dren. The interest in neurosis was determined 
by the relative neglect of this level of psycho- 
pathology in the family research literature as 
compared with the relative importance of neu- 
rotic phenomena in the formulation of classic 
and current theories of personality and psy- 
chopathology. 

In addition, the use of a child guidance 
center population of neurotic children has 
several advantages in the general study of 
family psychopathology. First, the study of 
families is more feasible in this situation 
than many others. Second, the use of this 
young outpatient population with relatively 
acute problems minimizes the influence of 
chronicity and hospitalization that compli- 
cates the interpretation of many studies using 
adult psychiatric patients. Third, the close 
temporal proximity of the child’s pathology 
and the family patterns studied, while not 
eliminating the problem of interpreting causal 
effect, makes such interpretations a little more 
credible than when one must extrapolate back 
20 yr. or more. 

The first variable studied was parental 
dominance. This has been considered a criti- 
cal variable in psychopathology since the 
early work of Levy (1945) but the supporting 
evidence is rather weak (Fontana, 1966; 
Frank, 1965). There are actually several ver- 
sions of the dominance hypothesis: first, that 
excessive maternal dominance (or alterna- 
tively paternal dominance) is a critical factor 
in pathological behavior regardless of the sex 
of the child; second, that an extreme disparity 
in the dominance of the parents, regardless of 
direction, produces a pathological situation for 
all children; third, there is a variation em- 
phasizing the interaction between the sex of 
the dominant parent and the sex of the dis- 
turbed child. All of these versions assume that 
abnormal dominance patterns interfere with 
the sex-role-identification learning of the 


child. These three variations of the dominance 
hypothesis were tested with measures from 
the RDT using parents of normal and neurotic 
children. 

The next variable studied was conflict be- 
tween the parents. It has been suggested that 
neurotic behavior patterns could result from 
a child being drawn into a conflict between 
the parents (Vogel & Bell, 1960). However, 
Fisher et al. (1959) found little conflict with 
parents of adult neurotics as compared to 
parents of schizophrenics. Parental conflict 
has been found rather consistently with 
schizophrenia (Fontana, 1966). In the present 
study, the hypothesis that parents of neurotic 
children show greater conflict than parents of 
normal children was tested with measures 
from the RDT. 

The conflict theory, of course, supposes not 
only that the parents be in conflict but that 
the child’s behavior is affected by this parental 
conflict. The objective evidence for this is 
extremely limited. Peterson et al. (1959) and 
Becker, Peterson, Hellmer, Shoemaker, and 
Quay (1959) did find some relationship be- 
tween parental characteristics and the prob- 
lems presented by emotionally disturbed chil- 
dren at a child guidance clinic. Unfortunately, 
the child’s problem was assessed from the 
same parental interview on which the parent's 
personality was judged. Therefore, the present 
study incorporated a method of assessing three 
areas of conflict between parents—achieve- 
ment, affiliation, and aggression—which could 
be related to independent ratings of three 
areas of the neurotic child’s pathology—un- 
derachieving, social withdrawal, and aggres- 
sive behavior. It was hypothesized that there 
would be a positive relationship between the 
areas of parental conflict and the neurotic 
child's areas of difficulty. 

If a relationship were found between areas 
of parental conflict and the child's problem, 
there still would be no way of identifying the 
antecedent variable. The child's problem 
could have grown out of the parental problem 
or the parental problem might simply repre- 
sent a reaction to the child's problem. Al- 
though the problem of causal direction will 
not be solved for a long time in any definitive 
way, it was thought that some relevant infor- 
mation might be obtained. Therefore, thé 
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| parents were asked two types of questions— 


half about parent-child situations and half 
about husband-wife situations having nothing 
to do with children. It was assumed that if 
the parental conflicts merely reflected a re- 
sponse to the child’s problem, the relationship 
between area of child difficulty and area of 
parental conflict should be greater with the 
parent-child situations. On the other hand, if 
the child’s behavior was in response to a 
parental problem, the relationship should be 
as strong or stronger with husband-wife situ- 
ations. 


METHOD 
Subjects 


The Ss consisted of the parents of 30 neurotic 
children, 18 male and 12 female, at the local child 
guidance clinic and the parents of 30 normal children, 
16 male and 14 female, obtained through a local 
school system. Parents of children with organic, 
psychotic, or legally delinquent problems were ex- 
cluded from the neurotic group. Other than this, 
successive referrals were seen as part of the diag- 
nostic procedure with no refusals. The children were 
typical of neurotic-level child guidance cases with 
school problems, anxiety symptoms, and social mal- 
adjustment. Parents of children with any psycho- 
pathology known to the school psychologist were 
excluded from the normal group. Other than this the 
normal families were selected at random by com- 
puter and then asked to participate in a study of 
normal families. About 45% of the normal families 
contacted agreed to take part in the study. Since 
Zuckerman, Barrett, and Bragiel (1960) found that 
the parents of the most maladjusted school children 
Were the most reluctant to participate in this sort of 
study, the voluntary participants in the present study 
Presumably represent a well-adjusted sample. 

The families in the two groups were not signifi- 
cantly different with respect to the ages of the 
father, mother, or child. The ages of the children in 
both groups ranged from 6 to 16, with a mean of 
about 11. The social class of the families was mea- 
Sured by the Warner (1960) scales for father’s occu- 
pation and income with good reliability between two 
independent judges (r= .84, p< .01). Both groups 
consisted predominately of lower-middle-class fami- 
lies and were not significantly different on this 
variable. 


Procedure 


The parents in the neurotic group were seen at 
the child guidance clinic and the parents in the 
Normal group at the university by the same female 
Braduate student. All parents were interviewed first 
individually and then together. In the individual 
interview each parent was asked what he or she 
Would do in 12 hypothetical situations. In the joint 


Fil interview the same situations were read and the 


parents were asked to come to an agreement on how 
they would react. The instructions closely followed 
those used by Farina (1960). 

The 12 situations were designed to touch upon 
three motivational systems—4 were about achieve- 
ment, 4 were about affiliation, and 4 were about 
aggression. Four graduate students in clinical psy- 
chology each correctly identified the motive involved 
in well over 90% of the situations, In addition, half 
of the situations, two from each motivational group, 
dealt with a problem between a husband and wife 
and the other half with a problem between a parent 
and a child. In half of the husband-wife situations, 
the husband was the central character, and the wife 
in the other half. The child was the central character 
in the parent-child situations. The situations were 
presented, for all Ss, in a fixed order that was ran- 
domly determined except that successive situations 
did not touch upon the same motive. While these 
hypothetical situations were newly created, they are 
of the same genre as those used by Farina. An ex- 
ample is: “A wife very much wants to live in a 
better neighborhood. The only way the family could 
afford this is if she takes a part-time job. If you were 
the [wife, husband], what would you think she 
should do?” This is a husband-wife situation, with 
the wife as the main character, dealing with achieve- 
ment motivation. 

In order to minimize E bias, E was provided with 
a fixed program of appropriate responses (eg, 
“um-hmm,” “o.k.”) for use after each answer of 
the S. In addition, E carefully controlled facial ex- 
pressions and glances. As a further check on the main 
E, who saw 20 sets of parents in each group, seven 
graduate students in clinical psychology, who were 
unaware of the specific purpose of the study or the 
nature of the measures to be used, interviewed the 
other 10 sets of parents in each group. Later, these 
were compared with 10 randomly selected sets of 
parents from each group seen by the main E and no 
significant, or even suggestive, differences were found 
on any of the measures to be described below. 


Measures 


The analysis was based on the taped interview 
with the parents together. The measures, taken di- 
rectly from Farina (1960), concerned dominance and 
conflict. 

The measures for dominance were: (a) Speaks 
First and Last (number of times the father spoke 
first or last during the interview minus the number 
of times the mother did the same) ; (b) Passive Ac- 
ceptance (the number of the 12 situations in which 
the mother simply agreed with the other's solution 
minus the number of times the father behaved simi- 
larly); (c) Total Time Spoken (difference in time 
in seconds during which father and mother spoke). 
Positive scores indicated paternal dominance and 
negative scores maternal dominance. Dominance 
measures could be expressed either as relative domi- 
nance of one parent over the other or as absolute 
disparity in dominance. The measures could also be 
related to sex of child. 
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The measures for Conflict were (a) Disagree- 
ments and Aggressions (the number during the inter- 
view by father plus mother); (b) Failure to Agree 
(the number of the 12 situations with no mutually 
acceptable solution) ; (c) Simultaneous Speech (length 
of time in seconds during which both parents spoke 
concurrently); (d) Interruptions (the number of 
times either parent interrupted the other); (e) Verbal 
Activity (total amount of time spoken by father and 
mother). 

The tapes were scored blind by the main E ap- 
proximately 6 mo. after the data were gathered. As 
a reliability check, two graduate students in clinical 
psychology, unaware of the purpose of the study, 
scored 10 randomly selected tapes from each of the 
groups. The scores were quite similar and rank-order 
correlations between the main E and the naive 
judges ranged from .79 to .99 (p<.01 or better) 
for the various measures. 


Ratings of Neurotic Children 


Preliminary discussions with the clinic staff sug- 
gested that the neurotic children could not be easily 
grouped into those with only achievement, affiliation, 
or aggression problems since these behavioral fea- 
tures frequently overlapped. Therefore, each child 
was rated on all three variables. The staff member 
of the child guidance clinic most familiar with the 
child was asked to make the ratings at the end of 
the intensive 3-hr. diagnostic procedure. Three 7- 
point scales ranging from "very little problem" to 
"great difficulty” were used to rate the problem 
areas of underachieving, social withdrawal, and ag- 
gressive behavior. The nature of the diagnostic pro- 
cedure precluded an adequate assessment of the reli- 
ability of these scales. 


RESULTS 
Parental Dominance Hypotheses 


The first version of the dominance hypothe- 
sis tested was that the parents in the neurotic 
group as a whole would show a pattern of 
maternal dominance, or alternatively a pat- 
tern of paternal dominance, as contrasted 
with the parents in the normal group. Table 1 
shows the mean relative dominance scores for 


TABLE 1 


MEAN RELATIVE DOMINANCE MEASURES FOR PARENTS 
or'NEuROTIC AND NORMAL CHILDREN 


Parents | Parents 
Measure ol of 
neurotics | normals 


Speaks First and Last 


(No. father minus no. mother) 1.33 —.60 .953 
Passive Acceptance ID —.53 814 
(No. mother minus no. father) 
Total Time Spoken 
(Sec. father!minus sec. mother) | 71.13 1.954 


18.27 
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TABLE 2 


MEAN ABSOLUTE DOMINANCE MEASURES FOR PARENTS 
or NEUROTIC AND NORMAL CHILDREN 


Parents | Parents 
Measure of of 
neurotics | normals 


Speaks First and Last 

(Father-mother difference) 6.33 | 4.60 | 1.870 
Passive Acceptance 

(Father-mother difference) |] 1.03 | 1.00 | .009 
Total Time Spoken 


(Father-mother difference) | 128.60 | 86.73 | 2.286 


the three dominance measures for the parents 
in each group. If the neurotic parents as a 
group showed maternal dominance the scores 
should be relatively more negative than those 
of the neurotic group. If anything, the neu- 
rotic parents are slightly more paternal in 
dominance patterns. Actually, none of these 
differences is statistically significant. The 
data in this section were evaluated statisti- 
cally by a computer program for multiple 
analysis of variance appropriate for groups of 
equal or unequal size developed by Clyde, 
Cramer, and Sherin (1966). Thus, the par- 
ents of the neurotic group as a whole do not 
appear to show excessive maternal or paternal 
dominance. 

Next, the idea that parents of neurotic chil- 
dren show an extreme disparity in the domi- 
nance of the parents, regardless of direction, 
was tested by examining the absolute differ- 
ences in the three dominance measures. The 
results are shown in Table 2. Although there 
is a trend for parents of neurotic children to 
show a greater dominance discrepancy than 
parents of normal children, none of these 
differences is statistically significant. There- 
fore, dominance discrepancy between the par- 
ents does not appear to be a strong variable 
in the family backgrounds of these neurotic 
children. 

Finally, the version of the dominance hy- 
pothesis emphasizing the interaction between 
the sex of the dominant parent and the sex of 
the neurotic child was examined. The mean 
relative dominance measures for parents of 
boys and girls in the neurotic and normal 
groups are shown in Table 3. The Speaks First 
and Last and the Passive Acceptance mea- 
sures show a significant interaction between 


group and sex of the child. Both measures sug- ^ 


TABLE 3 


— Mean RELATIVE DOMINANCE MEASURES FOR PARENTS 
or NEUROTIC AND NORMAL Boys AND GIRLS 


Parents of Parents of 
neurotic normal 
Measure zio kissed Pages Many al 
Boys | Girls | Boys | Girls 
Speaks First and Last | —2.22| 6.17 .00 | —1.29 | 7.256* 
—— Passive Acceptance —.67 | .58| —.06 | —1.07 | 7.547* 
Total Time Spoken 68.83 | 74.58| 2.88 | 35.86 | .127 


*p <.01. 


gest that neurotic boys come from maternally 

"dominated families and the girls from pa- 
ternally dominated families. The measures for 
the parents of normal boys do not show much 
deviation in either direction while the parents 
of normal girls are slightly in the maternal 
direction. The Total Time Spoken measure 
does not show a statistically significant inter- 
action between group and sex of child, possi- 
bly because of the tendency of fathers in all 
groups to talk more. 

Analyzing the data in another way, there is 
a significant difference between the parents of 
meurotic boys and girls on Speaks First and 
Last (F = 8.721, p < .01) and Passive Ac- 
ceptance (F = 4.571, p < .05). The parents 
of normal boys and girls do not differ signifi- 
cantly on these measures. So, too, the parents 
of neurotic girls and the parents of normal 
girls are significantly different on Speaks First 
and Last (F = 8.540, p < .01) and Passive 
Acceptance (F= 5.318, p < .05). The cor- 
Tesponding differences for parents of neurotic 
and normal boys do not reach statistical sig- 

nificance, although they are in the same direc- 
tion. Thus, the results are most striking for 
the parents of neurotic girls. 

Do the averaged data reflect a few extreme 
Cases or a more prevalent association between 
parental dominance and sex of neurotic child? 
Using the Speaks First and Last measure, 
each family was simply classified as showing 
paternal or maternal dominance on the basis 
of the sign of the score with ties eliminated. 
These frequency data are shown in Table 4 

_ where it can be seen that a substantial pro- 
portion of the families of neurotic boys were 
- Classified as maternal and an even larger 
Proportion of the families of neurotic girls 
were classified as paternal (corrected x? = 5.1, 
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TABLE 4 


FREQUENCY OF PATERNAL OR MATERNAL DoMINANCE 
IN PARENTS OF NEUROTIC AND NORMAL 
Boys AND GRIS 


Parents of Parents of 
Dominance based on neurotic normal 
speaks first or 
last measure 
Boys | Girls Boys | Girls 
Paternal dominance 4 9 7 5 


Maternal dominance 


df = 1, p < .05). The normal boys and girls 
appear to come about equally from paternal 
and maternal homes (corrected x? < 1, df = 
1, ns). The Passive Acceptance measure shows 
a similar trend but, because of a larger num- 
ber of ties, the results are not as striking. 

An examination of the individual data in 
the four subgroups suggests that the groups 
are roughly comparable in the size of discrep- 
ancies between the dominance of father and 
mother. While the two or three most extreme 
discrepancies occurred in the neurotic groups, 
there are no significant differences between 
the subgroups using the three absolute domi- 
nance measures nor did the absolute domi- 
nance measures show significant interactions 
between group and sex of child. It would be 
more accurate to say that neurotic children 
more frequently came from homes showing 
cross-sex parental dominance than that neu- 
rotic children came from homes showing a 
greater amount of cross-sex parental domi- 
nance. 


Parental Conflict Hypothesis 


The results for the five measures of conflict 
are presented in Table 5. The data were eval- 
uated statistically by the computer program 
for multiple analysis of variance developed by 


TABLE 5 


MEAN CONFLICT MEASURES FOR PARENTS OF NEUROTIC 
AND NORMAL CHILDREN 


Parents | Parents 


Measu of neu- | of nor- 
= rotics | mals 
Disagreements and Aggressions (no.) 4.77 63 | 6.975* 
Failure to Agree (no.) 183 AT 6.947" 
Simultaneous Speech (sec.) 6.00 | 6.30 E 
Interruptions (no.) 12.83 16.07 | 1.21: 
Verbal Activity (sec.) 488.07 | 378.33 | 2.137 


*p <02. 
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Clyde et al. (1966). There are significantly 
more instances of Disagreements and Aggres- 
sions between the parents of neurotic children 
than between the parents of normal children. 
The parents in the neurotic group failed to 
reach a mutually acceptable solution in sig- 
nificantly more situations than did the parents 
in the normal group. The other three measures 
did not differentiate the two groups at a sta- 
tistically significant level. The pattern of re- 
sults was similar for parents of boys and 
girls, but statistically stronger for the boys. 
There was no significant interaction between 
group and sex of child on any of the conflict 
measures. 

Qualitatively, the groups appear to be dif- 
ferentiated on the basis of overt parental hos- 
tility. The impression gained from listening to 
the tapes was that the first two measures re- 
flected a hostile interaction. It is interesting to 
note that these are the same two measures 
judged most valid by Farina and Dunham 
(1963). The other three measures, however, 
seemed to represent a positive give-and-take 
discussion as often as a hostile interaction. 
Farina and Dunham, as well as Hetherington 
and Frankie (1967), combined several of 
these conflict indicators to get an overall con- 
flict measure, but this did not appear to the 
present authors to be legitimate because the 
indicators are not all independent of one an- 
other. 

The hostile interaction in the neurotic 
group was about evenly initiated by the father 
and the mother. The parents were very close 
on the measures of Disagreements and Ag- 
gressions and Interruptions. There was a non- 
significant trend for the father to talk more. 
The Failure to Agree and Simultaneous Speech 
measures could not be broken down in this 
way. Furthermore, the data did not suggest 
an interaction between sex of parent and sex 
of child on any of the three relevant measures 
of conflict. 


Relationship between Parental Dominance 
and Conflict 


The results so far show that while cross- 
sex parental dominance is more frequent in 
the neurotic group it also occurs in a substan- 
tial proportion of the normal families. The 
groups are also differentiated on the basis of 


interparental conflict. The question arises as 
to whether the cross-sex parental dominance 
in the neurotic group is accompanied by in- 
terparental conflict while in the normal group 
cross-sex parental dominance occurs without 
appreciable interparental conflict. In order to 
answer this question, both neurotic and nor- 
mal families were classified as showing cross- 
sex or same-sex parental dominance on the 
basis of the Speaks First and Last measure as 
was done for Table 4. They were also classi- 
fied as high conflict if they had two or more 
Disagreements and Aggressions or as low con- 
flict if they had one or less. 

The results of this analysis are shown in 
Table 6 where it can be seen that for neu- 
rotic families most of the cross-sex parental 
dominance cases also show high conflict but 
in the normal families cross-sex parental domi- 
nance occurs without high conflict (corrected 
x:—92, df — 1, p< 01). When the boys 
with dominant mothers are considered sepa- 
rately, the same relationship is found. Be- 
cause of small expected frequencies this rela- 
tionship was tested with the Fisher exact 
probability test (Latscha, 1953; Siegel, 1956) 
and found to be significant (p < .01). The 
results for girls with dominant fathers were 
in the same direction but failed to reach sig- 
nificance (Fisher test). 

The data for children with the same-sex 
parent dominant are shown in the right half 
of Table 6 for comparison. The neurotic and 
normal groups are not significantly different 
(Fisher test). In comparing the neurotic 
children with same-sex and cross-sex parental 
dominance, there is a trend for the former to 
show more frequent low-conflict cases as Op- 
posed to the high-conflict cases in the latter, 
but this reversal just failed to reach statisti- 


TABLE 6 
PARENTAL DOMINANCE AND PARENTAL CONFLICT 


Cross-sex parent | Same-sex parent 


dominant dominant 
Parents of 2 
High | Low | High | Low 
conflict | conflict | conflict | conflict 
Neurotic children | 14 5 2 4 
Normal children 1 10 1 12 


. Note—Frequency of high and low interparental conflict 
in families with cross-sex and same-sex parental dominance for 
neurotic and normal children. 


dias 


- 
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cal significance (Fisher test). A comparison 
of the normal families in the same-sex and 
cross-sex parental dominance groups shows no 
significant difference in frequency of high- and 
low-conflict cases (Fisher test). 


Relationship between Parental Conflict and 
Problems of Neurotic Child 


As anticipated, most of the neurotic chil- 
dren did have a variety of combinations of 
problems of underachievement, social with- 
drawal, and aggressive behavior according to 
the ratings of the clinic staff. These problems 
were about equally represented in the group 
of neurotic boys but there were fewer neu- 
rotic girls with a problem of aggressive be- 
havior. In general, the procedure of rating 
degree of difficulty in several areas provided 
a much more meaningful picture of the 
neurotic child’s problems than a simple group- 
ing, according to the most salient problem, 


- would have. 


It was predicted that there would be a 
relationship among the measures of parental 
conflict in the three areas and the degree of 
disturbance of the neurotic child in these 
same areas. Two technical problems arose in 
testing this relationship. First, when the 
various measures of parental interaction were 
broken down by content area, the values 
became quite small, with a preponderance 
of zeros. Second, in making ratings of the 
problem areas of the neurotic children, the 
Staff members of the child guidance center 
tended to use quite different ranges of the 
rating scales making direct comparisons 
specious. These technical problems precluded 
the more obvious methods of analysis. 

The relationship between parental inter- 
action and child behavior was tested in some- 
what an unusual fashion. First, one of the 
parental measures of conflict, Verbal Activity 
(total time spoken by father and mother), 
was not grossly distorted by the breakdown 
and showed an adequate variation among 
content areas. Thus, it was possible to rank, 
for each pair of parents, the amount of time 
Spent in discussing the situations in the areas 
of affiliation, achievement, and aggression. 
Furthermore, although the center staff used 
different ranges of the scale, it was possible 
to rank the ratings of degree of difficulty in 


the areas of withdrawal, underachieving, and 
aggressive behavior for each neurotic child. 
Thus, it was feasible to compute a rank-order 
correlation for each family between the Verbal 
Activity of the parents in three areas and the 
degree of difficulty of the child in the cor- 
responding three areas. 

In order to express the relationship between 
parental interaction and child behavior for 
the neurotic group as a whole, note should be 
taken of the fact that with an N of 3 only 
four values of rho can be obtained: +1.00, 
+.50, —.50, and —1.00. The number of rhos 
that would be expected by chance alone can 
be obtained from the normal probability 
table. The expected frequencies for 28 fam- 
ilies (2 families were eliminated because of 
ties in the staff ratings) are shown in the top 
row of Table 7. The frequencies of rhos 
obtained are shown in the second row of 
Table 7. It can be seen that there are many 
more positive rank-order correlations obtained 
than would be expected by chance alone 
(x2 = 9.2, df = 3, p < .05). Therefore, there 
does seem to be a positive relationship be- 
tween the relative length of time the parents 
discuss situations in the three areas and the 
relative degree of difficulty the child is seen 
as having in the corresponding areas. 

The degree to which the problems of the 
child were related to parent-child versus 
husband-wife conflict was examined in the 
following way. Separate rank-order correla- 
tions were computed between the rankings of 
the child's difficulty in the three areas when 
only the two situations in each area dealing 
with husband-wife problems were included 
and when only the two situations in each area 
dealing with parent-child problems were in- 


TABLE 7 


PARENTAL CONFLICT AND PROBLEMS 
or NEvunoric CHILD 


Area (Eat eS eS 
+1.00 | +.50 | —.50 | —1.00 
Chance expectation 43 9.5 9.5 4.3 


Obtained 12 situations | 9 12 6 1 


Note.—Expected and obtained distributions of rank-order 
correlations between the length of time the parents discussed 
Situations in the areas of affiliation, achievement, and aggres: 
sion and the degree of difficulty the neurotic child was judged 
to have in these areas. 
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TABLE 8 
HussAND-WirE VERSUS PARENT-CHILD CONFLICTS 


rho 
Area 


+1.00| +.50 | —.50 | —1.00 

Chance expectation 43 9.5 | 9.5 | 43 

Obtained for six husband- | 6 14 8 0 
wife situations 

Obtained for six parent- 5 12 5 6 


child situations 


Note.—Rank-order correlations between the length of time 
the parents discussed situations in the areas of affiliation, 
achievement, and aggression and the degree of difficulty the 
child was judged to have in these areas—distributions for 
chance expectation, obtained for husband-wife situations, and 
obtained for parent-child situations, 


cluded. The results are shown in Table 8. The 
distribution of rhos for the husband-wife 
situations deviates in positive direction and 
is significantly different from the chance dis- 
tribution (x? = 7.89, df = 3, p< .05). The 
distribution for the parent-child situations is 
only slightly in the positive direction and 
does not deviate significantly from chance 
(x2 = 3.59, ns). Thus, the relationship be- 
tween parental interaction and child behavior 
holds when the parents are discussing hus- 
band-wife situations rather than when they 
are discussing parent-child situations. 


Discussion 


Before interpreting the results, several 
limitations of the study should be noted. 
First, the results should not be generalized 
beyond a population of lower-middle-class 
parents of school-age children with neurotic- 
level problems who are being seen at a child 
guidance center. Second, the parents in one 
group were seen as part of an intake pro- 
cedure at a child guidance center while those 
in the other group were told that they were 
part of a study of normal families. While it 
is unlikely that these differing circumstances 
account for all of the results, their influence 
on the behavior of the parents could not be 
evaluated in this study. Third, while the 
critical measures from the Revealed Differ- 
ences Technique (RDT) seem reasonably 
well validated in other studies, the frankly 
experimental ratings of the problems of the 
neurotic child are of unknown reliability and 
validity. 

The results from the dominance measures 
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clearly support the hypothesis that neurotic 
behavior is related to the sex of the dominant 
parent and the sex of the disturbed child. In 
the present sample, neurotic behavior is not 
associated simply with dominant mothers nor 
simply with dominant fathers without respect 
to the sex of the child. So, too, neurotic 
behavior is not associated with a discrepancy 
in the dominance of the parents regardless of 
direction, Rather, neurotic boys tend to come 
from maternally dominated homes and neu- 
rotic girls from paternally dominated homes. 
Presumably, these situations produce difficul- 
ties in identification learning. 

Nevertheless, an explanation of neurotic 
behavior on the basis of cross-sex parental 
dominance is inadequate. Recall that about 
half of the normal children come from homes 
showing about the same degree of cross-sex 
parental dominance. Obviously some other 
variable must be involved. The results from 
the conflict measures suggest that this addi- 
tional variable is an overtly hostile conflict 
between the parents. One might be tempted to 
say that interparental conflict is a sufficient 
explanation of fhe neurotic patterns in this 
sample but this would ignore the intriguing 
results on the distribution of  cross-sex 
parental dominance between the groups as 
well as the association of cross-sex parental 
dominance and interparental conflict in the 
neurotic group. 

One possible interpretation of the results 
is that parental conflict and cross-sex parental 
dominance interact to produce a serious prob- 
lem for the child, Several studies have shown 
that normal children tend to imitate the domi- 
nant parent regardless of sex (Bandura, Ross, 
& Ross, 1963; Hetherington, 1965). In à 
recent study using the RDT, Hetherington 
and Frankie (1967) found that parental con- 
flict tended to increase imitation of the domi- 
nant parent. A strong identification with a 
parent of the opposite sex may lead to a 
variety of social and school adjustment 
difficulties (Kagan, 1964). 

Nevertheless, the results relating the areas 
of parental conflict and the problems of the 
child suggest a more subtle process than 
simply the development of inappropriate seX- 
role behavior. The actual problem behavior 
of the child appears to be determined by the 
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nature of the conflict between the parents. 
Let us assume, along with Vogel and Bell, 
that the child is drawn into the husband-wife 
conflict. Now, if the child is the same sex as 
the dominant parent, he very likely follows 
the lead of that parent in resolving the con- 
flict with no consequent sex-role problem, 
although with some alienation from the non- 
dominant parent. Depending on the charac- 
teristics of the dominant parent, such a child 
may show nonconforming behavior but is not 
likely to show up in a neurotic child guidance 
clinic population. On the other hand, if the 
child is of the opposite sex of the dominant 
parent he cannot follow the lead of that 
parent without entering into a sex-role con- 
flict. Thus, a boy with a proachievement 
dominant mother may try to do well in school 
but fail because he does not want to be a 
sissy. At the same time he does not become 
a truant because this involves identification 
with the antiachievement nondominant father 
who is disparaged by the mother. While this 
explanation is speculative, it does account for 
the three major findings of the study—the 
cross-sex dominance pattern, the presence of 
hostile conflict, and the relation between 
parental conflict and child’s problem. 
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SYMPTOM TREATMENT AND SYMPTOM 
SUBSTITUTION IN ENURESIS * 


BRUCE L. BAKER? 
Yale University 


This study explored the hypotheses that (a) Aspects of the therapist-patient 


relationship are responsible for successful behavior therapy; and (b) substitute 
problems will arise following remission of symptoms. Thirty enuretic children 
were treated either by a conditioning method or by methods devised to duplicate 
its motivational aspects. Adjustment measures on the enuretic Ss and control 
children were obtained independent of treatment. Conditioning was superior to 
other methods, suggesting that successful treatment was not based solely on 
the therapist-patient relationship. Subsequent to treatment, test measures did 
not indicate a decline in adjustment; on several measures, significant im- 


provement was found. 


Two questions are frequently posed to the 
behavior therapist. First, what are the total 
effects of treatment on the individual? Tradi- 
tional psychiatry (e.g., Freud, 1959) views 
maladjustive behavior as a sign that there is 
an underlying disorder, and warns that even 
if a problem can be removed by a nondynamic 
therapy a relapse or another new problem 
will follow, since the basic causes have not 
been treated. This “symptom substitution” 
hypothesis is probably the principal theoreti- 
cal reason why many clinicians hesitate to 
use behavior therapy. Yet while the vast 
majority of behavior-therapy follow-up re- 
ports indicate that no new problems have 
arisen following successful treatment (e.g., 
Baker, Kahn, & Weiss, 1968; Lang & Lazovik, 
1963; Lazarus, 1961, 1963; Wolpe, 1961), 
there has been no well-controlled study of this 
hypothesis. 

Second, why does behavior therapy work 
when it does? Although behavior therapists 
stress the role of the learning paradigm in 
treatment, the learning theory upon which 
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some of these methods are based has been 
criticized as outmoded (Breger & McGaugh, 
1965). It seems possible that behavior ther- 
apy's success might be attributed to other 
aspects of the treatment, such as the thera- 
pist's enthusiasm, the kind of therapist-patient 
relationship which behavior-therapy methods 
establish, and the demands implicit in a 
“scientific” treatment. 

The present study was concerned with both 
of the above questions. Behavior-therapy 
methods were employed to treat enuresis in 
children, and special attention was directed 
toward the efficiency of conditioning relative 
to other methods, and the changes in adjust- 
ment immediately following cure. 

Bed-wetting seems a particularly appropri- 
ate disorder for research into the questions 
posed. Improvement can be accurately as- 
sessed, and the point of complete symptomatic 
relief is attainable and well defined. Also, 
traditional psychiatric conceptions of enuresis 
as a symptom of emotional disturbance (sum- 
marized in Mowrer, 1950) are quite different 
from the behavior therapist's view of enuresis 
as an isolated habit deficiency in the great 
majority of cases (eg. Lovibond, 1964). 
Although the conditioning treatment of enu- 
resis has been reported to be quite effective 
by many investigators (summarized in Lovi- 
bond, 1964), it is still not widely used, in 
part because of the symptom-substitution 
concern, For example, Sperling (1965) has 
recently written: “The removal of the symp- 
tom of enuresis, without providing other 
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outlets for the child, leads to a replacement 
by other symptoms . . . [p. 30]." The few 
studies which have investigated this question 
of adjustment following cure (Baller & 
Schalock, 1956; Behrle, Elkin, & Lay- 
bourne, 1956; Biering & Jespersen, 1959; 
Lovibond, 1964) are inconclusive, however, 
because they were poorly (or not at all) con- 
trolled, for the most part lacked quantified 
measures, and did not assess change inde- 
pendent of the therapy. 

The present study separated the treatment 
of enuresis and measurements of adjustment, 
so as to avoid the subtle forms of experi- 
menter bias (Rosenthal, 1966) and demand 
characteristics (Orne, 1962) which continue 
to haunt practically all therapy-outcome stud- 
ies. Measures of adjustment were taken by an- 
other E in the enuretic child's school, and 
these measures were taken under the guise of 
an entirely different research project. 


METHOD 
Subjects 


The Ss were 90 elementary school children—30 
enuretics and 60 controls. Two control Ss of the same 
sex as the enuretic S were selected randomly from 
each enuretic S’s classroom. 

The enuretic Ss were obtained primarily in re- 
sponse to a newspaper article describing the project 
and inviting participation. A preliminary phone inter- 
view obtained the child’s name, address, age, school 
and grade, and information about the enuresis. Also, 
the parent’s report that in the family physician's 
opinion there was no organic problem was a pre- 
requisite for being included. The enuretic sample 
consisted of 10 girls and 20 boys, with a median age 
of 8 yr. and a range from 6 to 12 yr. All but 4 of 
the children had been wetting since birth, and more 
than half were wet 7 nights a week. 


Apparatus 


The conditioning units used two foil pads, with 
holes in the top pad, separated by an absorbent 
Sheet, and placed under S's lower bed sheet. The 
pads were connected to a white plastic box which 
contained two 6-v. batteries, a sensitive relay, and 
a buzzer. Within seconds after the child began to 
Wet, a circuit was completed and the buzzer sounded. 
The buzzer continued to sound until S got out of 
bed and shut it off. 

The “gadgetry” device, used in the subsequent 
treatment of four waiting-list Ss, consisted of a 
bulletin board for a star chart, a container for stars, 
and a wind-up alarm clock, all mounted on a white 
Wooden stand. 


Procedure 


The 30 enuretic Ss were arranged in triads accord- 
ing to the data obtained in the phone interviews, 
and 1 S was randomly assigned to each of three 
experimental conditions. Ten children were placed 
immediately in behavior therapy with a conditioning 
device (Group C). Ten children were concurrently 
given a wake-up treatment designed to duplicate 
all features of the conditioning treatment except 
the conditioning procedure (Group WU). Ten chil- 
dren were placed on a waiting list to begin treatment 
in the near future (Group WL). The Group WL 
families were scheduled for a brief home interview, 
in which E obtained further information about the 
child's enuresis. 


Treatment 


All treatment was carried out in the child's home. 
In the first visit, E told S's parents that he was 
investigating both the process of cure and the 
changes which follow a child's becoming dry. Parents 
were asked to be particularly attentive to any 
changes in their child—especially new problems— 
both during and after treatment. 

Group C. The E explained the operation of the 
conditioning device and encouraged the child to 
practice setting it up himself. The S was kept on 
the device until he had 14 consecutive days dry; 
the device was then taken off the bed but left in the 
house for an additional 14 days, after which S was 
designated as dry. If a child was not dry (or much 
improved) after 50 reinforcements (buzzer sounds), 
he was designated a failure. If a child later relapsed, 
he was begun again on the device when possible. 

Group WU. The wake-up treatment was similar to 
the routine prescribed by many pediatricians, but 
with emphasis on regularity and thorough awakening 
(adapted from Smith, 1948). A fixed time was 
chosen before S usually wet, and the parents were 
told to awaken S every night at this time; the 
best time evolved during treatment. It was stressed 
that the child must be wide awake before going to 
the bathroom. When S had been dry for a week, 
he was not awakened for several nights during the 
next week, If he was dry on these nights, he was 
put on a new schedule whereby he would only be 
awakened the 2 nights following a wet night. 

Otherwise, treatment was the same for condition- 
ing and wake-up children. All parents aided the 
child in getting up at night and kept daily records 
of progress; all children kept progress records with 
star charts. Every child was visited at home once 
a week and phoned once a week by the author to 
review progress and offer suggestions. Parents in both 
groups were generally very cooperative in keeping 
records and appeared to be adequately following 
the treatment routine. 

Group WL. Approximately 10 wk. after the 
original interview, E phoned the 10 waiting-list 
families. One S in Group WL had become dry and 
another S had not been examined medically after 
repeated requests; consequently, these 2 Ss were not 
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begun in treatment. To explore some peripheral 
hypotheses, the remaining 8 waiting-list Ss were now 
treated in the following way. 

It seemed possible that the conditioning method 
might be effective because E brings a “gadget” into 
the home, thus in some way increasing the child’s 
motivation. To obtain pilot data on this question, 
four Ss were placed in wake-up treatment, but 
were also brought the gadget described above, with 
instructions to place it beside the bed and to set the 
alarm clock at the time designated for awakening. 
The other four Ss received the same conditioning 
treatment as the original Group C. 

Also, as an exploratory means of assessing the 
importance of the therapist’s presence, there were 
two levels of contact with E, full and partial. Two 
Ss in each group received the same amount of con- 
tact with E as Group C and Group WU Ss had 
received (full contact: one visit and one phone call 
each week). The remaining Ss received only half as 
much contact with E (partial contact: a visit one 
week, a call the next week, etc.). 


Measures of Adjustment 


Parent measures. Parents were periodically asked 
whether they had observed any changes in their 
child, and they completed two rating scales, both 
before treatment began and several weeks after 
treatment was terminated. The Adjective Check List 
(ACL) was adapted from a scale used by Sarason, 
Davidson, Lighthall, Waite, and Ruebush (1960) and 
included 20 items designed to assess personality at- 
tributes such as confidence, anxiety, and responsibil- 
ity. The scale included the 12 Sarason items which 
could readily be scored for adjustment and 8 new 
items. Items were presented in the form (Sad 321 
123 Cheerful), and parents were asked to select 
the word which better typified S and to indicate the 
extent (a little, definitely, very much) by circling 
one number. The Behavior Problem Record (BPR) 
was a checklist of 26 specific childhood problems; 
the first 15 items were a scale devised by Lovibond 
(1964) in his work with enuresis, and the last 
11 were added by the author. Many of the prob- 
lems in the BPR have been associated with enu- 
resis in the psychiatric literature or have been 
suggested as possible new symptoms if enuresis is 
removed without treating an underlying cause (e.g., 
fire-setting, thumbsucking, temper outbursts), 

Teacher ratings. The enuretic Ss in the study were 
distributed throughout 23 public schools and 2 
parochial schools; since no two enuretic Ss happened 
to be in the same classroom, 30 teachers were in- 
volved in the ratings. From each enuretic S's class 
list, two other children of the same sex were 
randomly selected as controls. 

Although £ initially contacted the superintendents 
and principals, all contacts with the teachers and 
all testing of the children were performed by another 
E (Es). The research was presented to the teachers 
as a study of creativity. No mention ever was made 
of bed-wetting. Teachers were told that their class and 
these children had been selected at random. The Es 
explained that many of the hypotheses in the project 


would be biased if the teachers were made aware of 
them, but that after the data were all gathered she 
would welcome the chance to meet with the teachers 
and explain more fully the project and how their 
work fit into it. Thus, only when the study was com- 
pleted did teachers learn of the enuresis treatment and 
their relation to it. 

The teachers were given a 67-item rating form 
for each of the three children. Each item was to be 
rated on a scale of either 1-5 or 1-7. Most of the 
items were taken from the Devereaux Elementary 
School Rating Scale (Spivack & Swift, 1966) and 
the Devereux Child Behavior Rating Scales (Spi- 
vack & Levine, 1964). Both scales had been factor 
analyzed, and in devising the teachers’ rating form 
items were drawn from each of the factors. These 
factors represented either aspects of school behavior 
(eg., academic anxiety, dependence on the teacher 
for learning) or personal maladjustment (e.g. social 
aggression, fears, inability to delay). Since most of 
the items were stated negatively and measured 
“problems,” new items were added exploring such 
things as creativity, happiness, confidence, responsi- 
bility, maturity, and capacity for play. Items were 
stated in terms of overt behavior, and teachers were 
asked to compare S's current behavior with that of 
other children his age. 

Teachers were asked to complete these question- 
naires three times throughout the year. Test I was 
just before treatment began for the enuretic S, Test 
II was about 10 wk. later, and Test III about 12 wk. 
after Test II. 

Child measures. At these same times, the enuretic 
child and one of the two control children in each 
class were tested by E», who was not aware of the 
specific hypotheses being investigated and did not 
know which child was enuretic. It was announced to 
the class that E; wanted to find out some things 
about elementary school children and that two names 
had been picked “out of a hat.” The test battery 
began with the Draw-A-Person and Draw-Your- 
Family Tests and included four TAT cards (not to 
be reported here) and a self-report questionnaire in 
two parts: (a) the Self-Image Questionnaire, a 16- 
item scale designed by the author to measure feel- 
ings which might arise from being enuretic and in- 
cluding positively and negatively stated items such 
as, “My parents are very happy with me,” and “I 
often do things I wish I had never done”; and (b) 
the Neurotic Inventory, a 20-item scale measuring 
neurotic problems devised and employed with enu- 
retics by Lovibond (1964) and including items such 
as, “I worry quite a bit about things which might 
happen,” and “Most of the time I feel down in the 
dumps.” Items in both scales were presented verbally 
to S as “things which boys [girls] I know have said,” 
and S was asked if he was like that child. (Eg. 
“Bob said: I get a lot of headaches. Are you exactly 
like Bob? Somewhat? Not at all?”)® 


8 After the first testing, it seemed that some Ss 
might be responding defensively to the questionnaire 
scales, and the need for a defensiveness measure was 
realized. Hence, the D scale employed by Sarason et 
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Neither the parents of tested Ss nor Ss themselves 
had any knowledge of the relationship between this 
school testing and treatment; if parents inquired at 
school about the testing, they too were told that 
their child was in a creativity study of randomly 
selected children. 

After the final testing it was necessary to determine 
which of the control Ss happened to be enuretic. 
The E phoned the parents of the control Ss request- 
ing that they complete a questionnaire about their 
child. Parents were sent the same questionnaire 
which the parents of enuretic Ss had filled out, except 
that two additional items were added to the BPR: 
bed-wetting and daytime wetting. 


RESULTS 
Reliability and Intercorrelation of the Scales 


Table 1 shows the Spearman-Brown cor- 
rected split-half reliabilities of the question- 
naire measures for Test I and the test-retest 
coefficients, These reliabilities are satisfactory 
for questionnaires of this type. Drawing varia- 
bles requiring subjective judgment were scored 
independently by the author and another 
judge, with Test I interjudge reliabilities rang- 
ing from .43 to .95. For analyses of drawing 
variables, an average of the two judges’ rat- 
ings was used. 


Comparison of Enuretics and Controls on 
Pretreatment Measures 


Enuretics and controls did not differ signifi- 
cantly on any item in the self-report scales, 
any factor in the teachers’ ratings, or either 
total score. None of the specific drawing 
variables in either the Draw-A-Person or 
Draw-Your-Family Tests differentiated enu- 
retics from controls. Also, two clinical psy- 
chologists making blind judgments of enuretic- 
control pairs of drawings were unable to 
identify the enuretics’ drawings any better 
than chance expectation. 


Treatment 


Figure 1 summarizes the results of the first 
10 wk. of treatment.. There was a sizable ini- 
tial drop in wetting frequency from the pre- 
level to the first 2 wk. of treatment for both 
treatment groups. Following this, the condi- 
tioning group frequency decreased over the 
10-wk. period, while the frequency of wetting 
in wake-up Ss remained essentially unchanged. 


al. (1960) was administered to most Ss in Test II 
and to all Ss in Test III. 


TABLE 1 


SPEARMAN-BRowNn CORRECTED SPLIT-HALF RELIABILI- 
TIES OF THE QUESTIONNAIRES FOR Test I 
AND TEsT-RETEST RELIABILITIES 


Cor. 


sq | Tests | Tests 
Sar rected | jr. | I-I 
Children's Neurotic Inventory 94 80 70 
Children's Self-Image Questionnaire 
Negative items T9 74 EJ 
Teachers' rating form 85 86 83 
Adjective Check List 
Mother 89 87 ir 
Father .83 92 _ 
Behavior Problem Record 
Mother 89 .62 US 
Father 93 Er = 


a For the child and teacher measures, there was an avi 
of 10 wk. between Tests I and II and an average of 22 wk. 
between Tests I and III. For the parent measures, the ques- 
tionnaire completed just before treatment was correlated with 
the measure following treatment, yielding only one coefficient. 

b The positive self-image items were unreliable and did not 
correlate with the negative items or other measures; conse- 
quently, only the negative items were considered. 


This Conditions x Weeks interaction was sig- 
nificant at p < .05 (F = 2.54, df = 4/72) by 
a Lindquist (1956) Type I analysis of vari- 
ance of the treatment data (excluding the 
prelevel). Comparing the number of wet 
nights in Weeks 1-2 and Weeks 9-10, a sec- 
ond-order £ test found the improvement in 
Group C to have been significantly greater 
than in Group WU (¢ = 2.89, dí = 18, p< 
.01). These results indicate that the two 
conditions produced differential effects over 
time. By Weeks 9-10, the conditioning group 
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Fic. 1. Mean number of wet nights a week over 
the first 10 wk. of treatment for Groups C, WU, and 
WL. 
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was wetting significantly less than the wake- 
up group (¢ = 1.70, df = 18, p = .05).* 

An interview with waiting-list families an 
average of 10.6 wk. after E's first visit found 
the reported mean frequency of wet nights 
unchanged. During this period, one Group 
WL S had become dry, but his previous fre- 
quency had been only 2 wet nights a week, 
the lowest in the study, and his decrease was 
offset by two Ss whose frequency increased. 
Contrasting the number of wet nights in the 
previous 3 wk. with the comparable 3 wk. for 
Group C and Group WU (Weeks 8-10 of 
treatment), the mean number of wet nights 
for Group WL (X — 17.7) was significantly 
greater than either the mean for Group C (X 
— 5.5; £— 4.30, df —18, p< .001) or the 
mean for Group WU (X = 9.3; t = 3.21, df 
= 18, p< 01). 

The improvement during the first 8 wk. of 
subsequent conditioning or *gadgetry" treat- 
ment for the waiting-list children was ana- 
lyzed by an analysis of variance. While the 
conditioning treatment was more effective 
than the gadgetry control, this difference was 
not statistically significant. (The Ss with the 
gadget did no better than previous wake-up 
Ss, but Ss on conditioning did not do as well 
as previous conditioning Ss.) Children who 
had been seen less frequently (partial con- 
tact) showed more improvement than Ss who 
received full contact, though this difference 
was not statistically significant. 

Treatment summary. In all, 14 Ss were be- 
gun on the conditioning device; of these, 11 
attained initial arrest. Fourteen Ss were begun 
in wake-up conditions; of these, 2 attained 
initial arrest. Ten Ss who had shown little 
improvement were switched to conditioning, 
and 7 subsequently became dry. 

Of the 30 enuretic Ss, then, 1 became dry 
without treatment, 1 was not begun in treat- 
ment, and 1 was terminated before treatment 


4 In evaluating treatment differences, it is important 
to note that Group WU Ss had the aid of being 
awakened during the night until they had progressed 
sufficiently to be left on their own, whereas Group 
C Ss did not. The implication is that Group WU Ss 
appear to be more improved than was actually the 
case. For example, six Group WU Ss had improved 
sufficiently to be left on their own some nights. The 
mean percentage of wetting was 22.5 on nights when 
they were awakened and 38.5 on nights when they 
were not awakened (t= 2.51, df — 5, p< 03). 
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could be completed. Of the remaining 27 Ss, 
74% were “cured” (dry for at least a period 
of 1 mo.), and an additional 15% were “very 
much improved” (dry for at least a 2-wk. 
period and wetting less than 1 night a week), 
During the follow up, which averaged 6 mo., 
4 cured Ss relapsed, although 2 of these who 
were retreated became dry again. 


Changes Following Treatment 


Parents’ interviews and questionnaires. It is 
important to note in assessing the results of 
parents’ measures that changes similar to those 
reported for enuretic Ss might also have been 
reported for control Ss if such comparable 
measures were available. In addition, par- 
ents’ measures may reflect a desire to please 
E. It was for these reasons, of course, that 
independent school measures were taken on 
both enuretic and control Ss. 

The most frequently reported observation 
of parents was the child’s happiness at be- 
coming dry. Many children were able for the 
first time to sleep overnight with friends or 
relatives or to go to summer camp; three boys 
immediately joined the Boy Scouts. Children 
were reported to be venturing into new ac- 
tivities, taking more responsibility, and be- 
coming more autonomous. For instance, par- 
ents made such statements as, "She seems 
able to do things for herself”; “He started a 
paper route"; *Now she's cooking, sewing, 
reading." 

While most parents reported that they had 
observed no new problems, several parents 
did report a new problem at some time. One 
boy defecated in his pants several times im- 
mediately following treatment. Another began 
to depend more on his mother to make de- 
cisions for him. A third boy developed an 
“eyeblink.” None of these proved to be last- 
ing, and each seemed to arise from specific 
new stresses, although it is not possible to say 
if they had any relation to the treatment. 

Both mothers and fathers of cured Ss indi- 
cated improvement in adjustment on the ACL 
and the BPR. Combined parents’ scores On 
the ACL showed a mean improvement of 3.5, 
significant at p= .002. Combined parents’ 
scores on the BPR showed a mean improve- 
ment (decrease in reported problems) of 
— 2.8, significant at p = .05. 

School measures. 'Test II afforded a measure 
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of changes which occurred either during treat- 
ment or immediately thereafter. There was an 
average of 10 wk. from Test I to Test II, at 
which time 17 Ss were judged improved (wet- 
ting less than half their original frequency). 
There was an average of 22 wk. from Test I to 
Test III. By Test III, 13 Ss were cured, and 
another 8 Ss were very much improved (wet- 
ting once a week or less). Children designated 
as cured had not been wetting for an average 
of almost 3 mo. 

Teacher ratings. Change scores were deter- 
mined for total teachers’ ratings from Test I 
to Test II; enuretics were divided into “im- 
proved” and “unimproved” and compared with 
each other and with their controls. For Test 
I to Test III comparisons, enuretics were 
classified as cured, very much improved, or 
slightly improved and compared with their 
respective controls. None of the mean change 
scores, nor the differences between them, ap- 
proached statistical significance; teachers 
showed a high degree of consistency in their 
ratings, and mean change scores were small. 
Change scores on only 3 of the 24 factors sig- 
nificantly differentiated cured enuretics from 
their controls, with cured enuretics increasing 
significantly more (p< .05) in academic 
anxiety, drive for academic success, and un- 
ethical behavior.^ 

Children’s self-report questionnaires. There 
was a general downward trend (fewer reported 
problems) on the questionnaires from Test I 
to Test II. The direction of changes for enu- 
retics and controls is shown in Table 2. While 
treatment was in progress or only recently 
terminated, enuretic Ss who had shown im- 
provement in bed-wetting reported fewer prob- 
lems than previously. This improvement com- 
pared favorably with changes shown in ques- 
tionnaire scores for both unimproved enuretic 
Ss and control Ss. 

In analyzing changes from Test I to Test 
III, special attention was given to Ss desig- 
nated as cured,? as these Ss would be most 


5 Actually, cured enuretics only increased somewhat 
in drive for academic success and unethical behavior, 
but controls decreased in score on these variables, 
resulting in the significant difference. However, there 
was a large Test I difference for these Ss on these 
factors, and even after the above changes the control 
Ss still scored higher than enuretics. 

6 One cured S and his control could not be tested 
a third time due to lack of cooperation from the 


TABLE 2 


DrRECTION OF CHANGE IN QUESTIONNAIRE SCORES 
FROM Test I ro Test II 


"PN. Improved) Urim- 
Questionnaire ahd Ieroveda Controls 

Neurotic Inventory 

Increase 0 3 10 

Same 0 3 1 

Decrease 17 7 19 
Self-Image Questionnaire? 

Increase 2 4 6 

Same 1 1 1 

Decrease 14 8 23 


Note.—Decrease indicates improvement (fewer reported 
problems). 

ax? = 15.19, p < .01. 

by? = 2.14, p > .10. 


likely to evidence adverse changes according 
to the symptom-substitution hypothesis. On 
the Neurotic Inventory, no cured S reported 
more problems, and on the Self-Image Ques- 
tionnaire only one S increased in score. The 
mean improvement for cured enuretics was 
greater than the mean improvement for their 
controls on both the Neurotic Inventory (t 
= 1.49, df = 20, p = .08) and the Self-Image 
Questionnaire (¢= 1.79, df = 20, p < .05). 
Combining the questionnaires, the mean im- 
provement from Test I to Test III for cured 
enuretics (X= —15.6) was significantly 
greater than for their controls (X = —6.4) at 
p= 03 (t = 193, df = 20) 

There were no questionnaire items on which 
cured and very much improved enuretics 
showed a significant change for the worse or 
their controls improved more than the enuretic 
Ss. Cured and very much improved enuretics 
improved significantly more than did the con- 
trols (p < .05) on the following items: “I’m 
always being scolded or punished by some- 
one”; “I’ve often been punished for nothing”; 
*T often feel sick in the stomach”; “Most of 
the time I feel down in the dumps.” 

Drawings. Scoring of specific variables on 
the Draw-A-Person and Draw-Your-Family 


school. The control of another cured S had moved 
away, so this pair was not included. In all, then, 
there were 11 cured-control pairs. 3 

7 One explanation for this decrease would be in- 
creased defensiveness. Since no D-scale scores were 
available for Test I, the only change which could 
be analyzed was from Test II to Test III. The 
defensiveness scores were fairly stable, and the 
changes did not differentiate cured enuretics and 
controls. 
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Tests failed to reveal significant changes. A 
total score was derived for each S on the 
former, based on conceptions in the drawing 
literature (e.g, Machover, 1951; Mazen, 
1963) of what constitutes an improvement in 
adjustment. To weight each variable equally, 
only the sign of change was considered. Signs 
of a shift toward better adjustment were an 
increase in number of body parts, size of the 
person, number of colors used, appropriate- 
ness of colors, amount of clothing, smile and 
movement, and a decrease in erasures, pres- 
sure, distortion, bizarreness, paper chopping, 
transparency, stick figures, drawing side views, 
drawing the opposite sex, and negative affect. 
Cured enuretics showed a mean improvement 
(X = 1.36, p = .06) and differed significantly 
from their controls who showed a slight 
worsening (X = —1.45; £, second-order dif- 
ference = 2.29, df = 20, p= .02). 

Also, two clinical psychologists with the 
diplomate and 15 and 30 yr. of experience in 
therapy and diagnosis were asked to evaluate 
the Draw-A-Person Test drawings for expres- 
sions of maladjustment. Each of the 30 enu- 
retic and 30 control Ss was randomly assigned 
a number between 1 and 60, and each S’s 
three drawings (Tests I, II, and III) were 
randomly ordered in one of the six possible 
permutations. Judges were told the age, race, 
and sex of each child and were to rank order 
the three drawings with respect to overall 
adjustment. In assessing the drawings, then, a 
judge knew neither which drawings had been 
done by enuretic and nonenuretic Ss nor the 
order in which they had been drawn. 

In this type of analysis, differences between 
groups are reflected in the slope of the curves 
connecting mean ranks across testings. As 
seen in Figure 2, for each judge the mean 
ranks for the cured enuretics decrease across 
the three tests, whereas the control slope is 
essentially flat. The improvement in adjust- 
ment rank from Test I to Test III for cured 
enuretics was significant for both Judge 1 
(t= 3.81, df = 10, p= .002) and Judge 2 
(t = 2.36, df = 10, p = .02). When this Test 
I-Test III change score for cured enuretics 
was compared with a similar score for their 
controls, the second-order ¢ test showed that 
improvement in adjustment for cured enu- 
retics was significantly greater than for con- 
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Fic. 2. Judges’ ranks of overall adjustment in 
Draw-A-Person Test drawings; mean ranks for each 
judge are plotted over the three testings for cured 
enuretic Ss and their control Ss. (Lower ranks indi- 
cate better adjustment.) 


trols—for Judge 1, p = .007 (¢ = 2.72, df= 
20) and for Judge 2, p = .004 (t = 2.88, df 
= 20). 


Discussion 


The present study suggests, then, that it is 
extremely doubtful that the successful condi- 
tioning treatment of bed-wetting can be ex- 
plained as some kind of “transference cure,” 
based on only the therapist-patient relation- 
ship and the motivation which it engenders. 
The wake-up procedure was not as effective 
as the total behavior-therapy treatment, de- 
spite an equal amount of therapist contact. 
These findings are similar to those of Lang 
and Lazovik (1963), Lazarus (1961), Paul 
(1966), and Davison (1968) who contrasted 
systematic desensitization to other methods 
with equal therapist contact. 

Another approach to this question of the 
relative importance of the therapist and his 
technique is to withdraw the therapist to some 
extent and to see if treatment becomes less 
effective. The exploratory findings indicated 
that treatment is just as effective with less 
therapist contact, a result consonant with the 
fact that many parents buy conditioning de- 
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vices and cure their child's enuresis with no 
professional help at all. 

The measures of adjustment did not show a 
worsening in adjustment subsequent to treat- 
ment; rather, other improvements were found. 
Children were reported to be happier, less 
anxious, and more grown-up, assuming re- 
sponsibility and venturing into new activities. 
The changes on childrens’ self-report items 


about punishment suggest that the cure may 


have had beneficial effects on the parent-child 
relationship or at least the child's perception 
of it. Finally, the drawings showed a dramatic 
improvement in what might be termed “self- 
image." 

Whereas enuresis is considered to be a 
symptom of emotional disturbance by many 
clinicians, traditional predictions about treat- 
ment and subsequent adjustment for such 
symptoms were not upheld. This might sug- 
gest that the symptom-substitution hypothe- 
sis is untenable and that neurotic symptoms, 
in the traditional dynamic sense, can be re- 
moved with equanimity despite the psycho- 
logical gains they afford. Perhaps in those 
Cases where treatment succeeds, the benefits 
derived from being dry outweigh such “pri- 
mary gains." On the other hand, it is possible 
that new symptoms did not arise because bed- 
wetting is simply a habit deficiency rather 
than an expression of, and outlet for, internal 
conflict. In any case, the dangers of a direct 
treatment of enuresis seem to have been over- 
stated, and similar research on other classical 
disorders might be of considerable value in 
further understanding the symptom-substitu- 
tion issue and the more basic question of 
symptom formation. 
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ORAL IMAGERY AND ALCOHOLISM 


SHARON BERTRAND ann JOSEPH MASLING 
State University of New York at Buffalo 


Twenty Veterans Administration patients with histories of heavy daily alcoholic 
consumption were matched on age, education, IQ, and diagnosis with 20 
patients with no history of reliance on alcohol, and their scores on a Rorschach 
measure of orality were compared. As predicted, the alcoholics gave more oral 
dependent responses than the controls (p= 01). Although total scores on oral 
sadism did not discriminate between groups, the alcoholic Ss gave more re- 
sponses on two of the oral sadistic subcategories, burdens and overwhelming 
figures. The striking similarity of subcategory performance of this group to an 
earlier group of obese Israelis was discussed. 


At one time or another almost every theory 
of personality, ranging from psychoanalysis to 
learning theory, and almost every method of 
data collection, ranging from anthropological 
field studies to factor analysis of MMPI items, 
has been used to try to account for the eti- 
ology of alcoholism. This conglomeration has 
produced a mixture of clinical observations, 
unrelated data, and free-floating speculation. 
Perhaps the most frequently suggested pre- 
disposing factor is a state of unsatisfied oral 
dependent needs. Fenichel (1945), for exam- 
ple, described alcoholics as being “character- 
ized by their oral and narcissistic premorbid 
personalities, usually as a result of difficult 
family constellations in which specific oral 
frustrations are created [p. 379].” A standard 
psychiatric text states that “evidence of re- 
sidual oral traits which are often seen in 
adults include such things as alcoholism . . . 
[English & Finch, 1954, pp. 17-18].” 

The importance of oral dependent needs in 
the etiology of alcoholism has received partial 
support from anthropological and experi- 
mental studies. Bacon, Barry, and Child 
(1965) in reviewing 110 preliterate societies 
concluded that heavy consumption of alcohol 
results in part from unsatisfied dependency 
needs. In the United States a longitudinal 
study of alcoholics noted that the effects of 
alcohol and the act of drinking help to resolve 
conflict between dependency needs and the 
pressure from society to be independent (Mc- 
Cord & McCord, 1960). The greater use of 
field-dependent solutions by alcoholics has 
been repeatedly demonstrated (Goldstein & 
Chotlos, 1965; Karp & Konstadt, 1965; Karp, 
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Poster, & Goodman, 1963; Karp, Witkin, & 
Goodenough, 1965; Witkin, Karp, & Good- 
enough, 1959). Wiener (1956) compared posi- 
tive, neutral, and negative oral responses on 
the Rorschach test of alcoholics and neurotic 
depressives and found no differences between 
groups in the total number of oral percepts, 
although the alcoholics gave significantly more 
positive oral responses, and the depressives 
gave more negative oral responses. 

The oral stage of development is typically 
seen as progressing through two periods—an 
oral dependent phase, characterized by suck- 
ing, dependence, passivity, and immaturity, 
and an oral sadistic phase, characterized by 
biting chewing, and ambivalence (Blum, 
1953). Psychoanalytic hypotheses about alco- 
holism have emphasized only the oral de- 
pendent phase, not the oral sadistic phase. 
Measures derived from the Rorschach test of 
the two phases of the oral need have demon- 
strated validity in discriminating between an 
obese and nonobese population (Masling, 
Rabie, & Blondheim, 1967) and between stu- 
dents who yielded and those who did not in 
an Asch-type conformity experiment (Mas- 
ling, Weiss, & Rothschild, 1968). 

The hypothesis of this study is that alco- 
holic Ss will give more oral dependent re- 
sponses on the Rorschach test than a control 
group of nonalcoholic Ss. No differences are 
hypothesized for oral sadistic responses. 


METHOD 
Subjects 


The Ss were 40 Caucasian male veterans receiving 
treatment in the psychiatric unit of a Veterans Ad- 
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ORAL IMAGERY 
TABLE 1 
DESCRIPTIONS OF EXPERIMENTAL AND 
CONTROL SUBJECTS 
Variable meal | Control 
oap group 
Age 
Range 24-60 26-58 
39.25 40.30 
Li 8.20 8.06 
IQ 
Range 87-135 | 90-127 
106.75 | 107.15 
o 13.79 10.12 
Education (no. yr.) 
Range 5-17 8-16 
11.75 11.25 
c 3.51 2.19 
Diagnosis 
Schizophrenic 
Total 11 11 
Chronic undifferentiated 6 4 
Paranoid 4 5 
Schizoaffective 1 1 
Catatonic 0 1 
Personality disorders 
"Total 4 4 
Schizoid personality 1 2 
Passive-aggressive 
Personality 3 2 
Psychoneurotic 
"Total 5 5 
Anxiety reaction i 2 
Conversion reaction 0 1 
Depressive reaction 4 2 


ministration hospital.1 The experimental group con- 
sisted of 20 patients whose therapists had classified 
them as being strongly dependent upon alcohol as a 
means of coping with their life situations; while 
none had a primary diagnosis as an alcoholic, all had 
a history of heavy daily alcoholic consumption 
throughout their adult lives. Their drinking was un- 
controllable, continual, and interfered seriously with 
their vocational and social adjustment. The control 
Ss were selected from the same psychiatric unit and 
were either total abstainers from the use of alcohol 
or were light social drinkers. Control and alcoholic 
Ss were matched for age, number of years of educa- 
tion, scores on the WAIS, and diagnosis. 


Testing Procedure 


All Ss were given individual psychological tests 
within the first 3 wk. of hospitalization. Typically a 
Patient who entered the hospital showing signs of 
Prolonged drinking would be allowed several days to 
“dry out” before taking the tests. The 40 Ss were 
tested by 22 different examiners. Each S was given 


+The authors are indebted to Martin Staiman, 
chief psychologist at the Buffalo Veterans Adminis- 
tration Hospital, for permission to use patient test 
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an individual Rorschach, but only the free-associa- 
tion responses were used in this study. 


Scoring for Oral Imagery 


The definitions of oral dependence and oral sadism 
were adapted from Schafer (1954). In this simple 
scoring method the subcategories of oral dependence 
and oral sadism are defined and examples given. The 
following subcategories were used: oral dependence— 
foods, food sources, food objects, food providers, 
passive food receiver, food organs, supplicants, 
nurturers, gifts and gift givers, good luck, oral ac- 
tivity, needing help or being passive, pregnancy and 
reproductive anatomy; oral sadism—destruction, ag- 
gression, or maiming, overwhelming figures, figures 
that deprive, deprivation, faulty oral capacity, oral 
assault, burdens. 

To prevent contamination of the criterion by al- 
coholic responses on the Rorschach, a separate cate- 
gory was established for all alcohol, drinking, and 
liquid percepts. 


RESULTS 
Reliability 


The Rorschach tests, with all identification 
removed, were scored independently by two 
judges.? There were 1,047 responses on the 40 
Rorschach protocols; of this number there 
were 43 disagreements on the presence or ab- 
sence of an oral percept, producing 96% agree- 
ment on this dimension. The disagreements 
were resolved by the two judges, again with- 
out knowledge of the identity of Ss. Once 
agreement was obtained on the presence of an 
oral percept, the judges agreed on 96% of the 
responses assigned to the oral dependent cate- 
gory and on 93% of the responses assigned to 
the oral sadistic category. 


Total Number of Responses and Oral Percepts 


The correlation between the number of 
responses each S produced and the number of 
oral responses he gave was .62 (p= .005); 
the correlation between the number of oral 
dependent responses and oral sadistic re- 
sponses was .32 (p > .05). 


Comparison of Alcoholic and Control Subjects 


The mean number of Rorschach responses 
was 31.05 for the alcoholic Ss and 21.30 for 
the controls, a difference significant at the .02 
level (£ = 2.52). In order to control for the 
effect of this difference in productivity, the 


? The raters were the first author and Lillie Weiss. 
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oral percepts for each S were divided by the 
total number of responses he gave, so that the 
basic score for each S was expressed as a pro- 
portion. Using proportions, and excluding al- 
cohol and drinking responses, it was found 
that the alcoholic Ss gave more oral dependent 
responses than the controls (x? = 6.40, p= 
01, one-tailed test). The two groups did not 
differ in the overall use of oral sadistic re- 
sponses. 

Of the 13 oral dependent subcategories, 
there were significant differences between 
groups on 2; compared with the responses of 
the control Ss, a greater percentage of alco- 
holic responses was in the food subcategory 
(p = .05) and food organ subcategory (P = 
.05). While there were no differences between 
groups on the overall oral sadistic category, 
the alcoholics gave a greater percentage of re- 
sponses than the controls in 2 of the 7 
subcategories—burdens (p= .02) and over- 
whelming figures (p = .05). Table 2 presents 
the distribution of responses for the alcoholic 
and control Ss on those subcategories in which 
significant differences were found. (For pur- 
poses of comparison, Table 2 also lists sub- 
category differences found in an earlier study 
of Israeli obese—Masling, Rabie, & Blond- 
heim, 1967.) 

Inspection of the individual test records 
uncovered a surprising relationship: the sig- 
nificant differences discussed above were due 
primarily to the response patterns of 10 of 
the alcoholics. Every one of these 10 Ss gave 
a burden response (e.g., a camel, two men 
trying to lift a heavy object) as well as an 
overwhelming figure response (e.g., an alli- 
gator with his mouth open, a witch); the 
pairing of these two responses was found 
neither in the remainder of the alcoholic Ss 
nor in the control Ss. In addition, this sub- 
sample of 10 alcoholic Ss gave more oral de- 
pendent responses (M = 3.6) than either the 
other alcoholic Ss (M = 2.0) or control Ss (M 
= 1.0). When the background factors of these 
10 Ss were examined, it was found that they 
tended to be older, brighter, and better edu- 
cated than the other Ss, although none of the 
differences was significant. 


8 There were four such responses given, all by 
alcoholic Ss. 


TABLE 2 


Comparison OF ALCOHOLIC, OBESE, AND CONTROL 
SUBJECTS ON SELECTED SUBCATEGORIES OF 
RORSCHACH RESPONSES 


Aohau 
s with Obesity study: 
zecords with | Distribution of sub- 
‘subcategory | Category responses? 
Response subcategory | Tesponses* 
Alco- | Con- 
holic | trol | Obese Sso | Control 
Sg | See 
Oral dependence 
Food 9t 3 [Not scored|Not scored 
Food organs ge 2 ge 0 
Nurturers 1 0 ™ 1 
Supplicants 0 1 6* 1 
Oral sadism 
Overwhelming figures | 10** 4 146 1 
Burdens 10% 3 8* 2 


Probabilities determined by chi-square. 

b Probabilities determined by Fisher exact test. Study by 
Masling, ae and Blondheim (1967). 

oN = 20. 


aN = 16, 


Discussion 


These results support clinical observations 
of the oral characteristics of alcoholics and 
also partially confirm the earlier results of 
Weiner (1956) who found that alcoholics give 
more positive oral responses (essentially the 
oral dependent category of the present study) 
than depressive Ss. While the hypothesis was 
confirmed, this type of correlational investiga- 
tion can obviously make no statement regard- 
ing cause and effect; from the evidence re- 
ported here it is as reasonable to conclude 
that oral dependence results from an alcoholic 
condition as it does to conclude that it 
produces it. 

Two findings were especially impressive, 
because both were unexpected. Within the 
alcoholic group, the tendency to give oral 
responses varied considerably, with a sub- 
group of 10 reporting many more oral per- 
cepts than the others. Not only did these 10 
report more oral responses, but they also 
paired two types of images—overwhelming 
figures and burdens—which on the surface 
seem to have few common elements. There 
are many kinds of alcoholic personality pat- 
terns (Jones, 1963; Pollner, 1965), and this 


? 


study may have inadvertently isolated one 
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such group which represents oral dependent 


needs in a relatively strong manner. The pair- 


ing of overwhelming figures and burden 
responses in these 10 Ss suggests that these 
percepts may share some symbolic meaning 
for those with an oral pathology. 

The second unexpected result was the close 
parallel in the subcategory patterns in this 
American male alcoholic population and in a 
predominantly female population of obese 
Israeli (Masling, Rabie, & Blondheim, 1967). 
In both groups the clinical populations gave 
significantly more food organ responses, over- 
whelming figure responses, and burden re- 
sponses than their respective control groups. 
When groups of different sex, different reli- 
gious and national origin, and different pa- 
thology show similar test patterns, it is likely 
that a meaningful phenomenon is involved. 
Logically this resemblance could arise if the 
obese were alcoholic or if the alcoholics were 
obese. Unfortunately, quantitative evidence is 
not available on this point, but experienced 
Observers all report the almost total absence 
of alcoholism in either the Jewish or Moslem 
residents of Israel (Jones, 1963).* Of course, 
the American alcoholics could be obese, but 
the staff members of this particular Veterans 
Administration hospital have reported that 
many Ss used in this study were so severely 
dependent on alcohol that they ate irregularly 
and poorly. 

The similarity between alcoholic and obese 
Rorschach performance may be related to 
early deprivation of oral gratification in in- 
fancy and subsequent fixation at the oral 
level, as suggested by psychoanalysis. Perhaps 
it is related to the learning in infancy of oral 
behavior as a means of reducing anxiety, as a 
learning theory would suggest. In any event, 
the present study on alcoholics in conjunction 
with the earlier study on obesity has estab- 

* A friend of the second author, a physician with 
20 yr. of experience in Israel, has reported that in all 


those years he has seen only one alcoholic—an 
American Marine. 


lished a parallel in responding to inkblots 
of two groups who have only one character- 
istic in common—an overuse of food or drink. 
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HABITUATION OF THE ORIENTING RESPONSE AS A 
FUNCTION OF INDIVIDUAL DIFFERENCES IN 
ANXIETY AND AUTONOMIC LABILITY * 


EDWARD S. KATKIN aw» ROBERT J. McCUBBIN 
State University of New York at Buffalo 


This study evaluated the role of individual differences in anxiety and auto- 
nomic lability on the habituation of GSR responses to auditory stimuli of 
moderate and very low intensity. Sixty Ss were used. They were divided into 
high- and low-anxious groups on the basis of Taylor MA scale scores and 
high and low autonomic lability groups on the basis of spontaneous GSR 
activity during a rest period. No relationship between Taylor score and GSR 
habituation was obtained for either stimulus intensity. Autonomically labile Ss 
showed no habituation to the moderate tone, but did habituate to the low 
intensity tone. Autonomically stabile Ss habituated to both tones. It was 
concluded that individual differences in autonomic lability may predict whether 
an S will respond to moderate intensities with orienting or defensive reactions. 


The purpose of this study was to evaluate 
the role of anxiety in the elicitation and 
habituation of orienting and defensive re- 
sponses, two systems described by Sokolov 
(1960, 1963a, 1963b). The orienting response 
(OR), as Sokolov has conceptualized it, is 
a generalized response system possessing 
autonomic components, which is presumed to 
serve as a feedback mechanism facilitating 
peripheral receptor sensitivity. The defense 
reflex (DR), a second general response 
system, also possesses autonomic components, 
but it is distinguishable from the OR. Ac- 
cording to Sokolov, ORs are elicited by low 
intensity “novel” stimuli, and DRs are elicited 
by high intensity, or noxious, varieties of 
stimulation. Furthermore, ORs habituate 
rapidly to repeated stimulation, whereas DRs 
do not and may even be intensified by 
repeated stimulation. 

Although much research has been directed 
toward clarifying the effects of external stimu- 
lus properties on orienting behavior (Geer, 
1966; Uno & Grings, 1965; Zimny & 
Schwabe, 1965), less attention has been given 


1The authors are indebted to S. Geisser for his 
aid in the statistical analyses. This research was 
supported in part by Grant No. MH-11989 from the 
National Institute of Mental Health, United States 
Public Health Service, and in part by a grant-in-aid 
from the Society of the Sigma Xi. Portions of this 
paper were presented by the second author at the 
meeting of the Eastern Psychological Association, 
Boston, April 1967. 


54 


to the effects of individual differences on the 
elicitation and habituation of ORs and DRs 
(Maltzman & Raskin, 1965). It is clear that 
an E can administer stimuli so intense that 
most Ss will judge them to be noxious; 
similarly, very weak stimuli can be admin- 
istered so that few, if any, Ss will respond 
to them as if they were noxious. When moder- 
ately intense stimuli are presented, the influ- 
ence of individual differences on response 
characteristics becomes crucial; some Ss are 
likely to respond as if the stimuli were quite 
intense, and other Ss are likely to respond as 
if they were not at all intense. Among the 
variety of individual difference variables of 
importance in contemporary personality re- 
search and theory, anxiety stands out as one 
which is related frequently to autonomic 
responsivity (see Martin, 1961). With respect 
to the issue at hand, it might be expected 
that anxious Ss would interpret moderately 
intense stimuli as noxious more frequently 
than nonanxious ones. Thus, anxious Ss would 
be expected to show autonomic responses 
characteristic of the DR to moderately in- 
tense stimuli, that is, responses which resist 
habituation; nonanxious Ss would be expected 
to show autonomic responses characteristic of 
the OR, that is, those which habituate rap- 
idly. Therefore, the present experiment en- 
deavored to investigate autonomic responses 
to moderately intense levels of auditory 
stimulation for Ss who differ in anxiety level. 


HABITUATION OF THE ORIENTING RESPONSE 


As a comparison, stimuli of very low intensity 
were also administered, since it would be ex- 
pected that such stimuli would elicit ORs 
from all Ss, irrespective of anxiety level. 

Although anxiety is a construct which is 
presumed to have autonomic response com- 
ponents, much research reveals no relationship 
between clinically or psychometrically evalu- 
ated anxiety and autonomic response patterns 
(see Martin, 1961; Sarason, 1960). The 
clinical measures and psychometric scales 
used in most of this research are considered 
to be relatively valid indexes of anxiety, and 
the physiological measures with which they 
are compared are considered to reflect accu- 
rately autonomic nervous system (ANS) 
activity. Thus, the failure to obtain relation- 
ships between these indexes cannot be attrib- 
uted easily to the invalidity of one or another 
of the measures employed. Rather, it may be 
that “anxiety” is not a unitary construct, but 
a complex one, composed at least of an 
autonomic component, reflected in ANS activ- 
ity, and a “perceptual” or “cognitive” com- 
ponent, reflected in an individual’s ability to 
recognize and report his ANS responses. Thus, 
one explanation for the noticeable lack of a 
relationship between self-report measures of 
anxiety and ANS activity may be simply 
that Ss who are aroused equally possess dif- 
ferential abilities to recognize and/or report 
the internal phenomena (Mandler, Mandler, 
& Uviller, 1958). The present experiment 
attempted to use both psychometric and 
physiological indexes as independent measures 
of anxiety. 

Many  psychometric instruments have 
earned their labels as "anxiety scales," but 
few physiological indexes have been so 
blessed. In the present investigation a mea- 
sure of autonomic lability was used as an indi- 
vidual difference measure and employed as 
if it were an index of anxiety. The physio- 
logical measure employed, nonspecific (NS) 
electrodermal activity, is an index of spon- 
taneous fluctuations in skin resistance which 
occur in the absence of external stimulation. 
NS activity has been shown to be an effective 
Monotonic indicator of central nervous system 
activation (Greiner & Burch, 1955) and 
autonomic arousal (Silverman, Cohen, & 
Shmavonian, 1959), as well as a measure of 
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transitory response to stress (Katkin, 1965, 
1966). Furthermore, Rappaport and Katkin 
(1967) have found that NS activity during 
moderately stressful situations is related to 
scores on the Taylor Manifest Anxiety (MA) 
scale (Taylor, 1953). 

The paper-and-pencil measure employed in 
this study was the MA scale. This test is 
widely used and has been demonstrated to 
be related to clinical judgments of anxiety 
(Buss, 1955; Buss, Wiener, Durkee, & Baer, 
1955; Eriksen & Davids, 1955; Hoyt & 
Magoon, 1954), as well as to other scalar 
measures of anxiety and general neuroticism 
(Bendig, 1957; Cattell & Scheier, 1958; 
Lauterbach, 1958). Furthermore, the MA 
scale has been described by its developers as 
an index of emotional responsivity, a hypo- 
thetical component of general drive (J. T. 
Spence & K. W. Spence, 1966; K. W. Spence, 
1958). As such it has been compared directly 
with such concepts as “activation” and 
“arousal” (see Malmo, 1959). 

The galvanic skin response (GSR) was 
chosen as the index of orienting and defensive 
behavior, and the following hypotheses were 
formulated: (a) Moderately intense stimuli 
will elicit larger GSRs than low intensity 
stimuli for all Ss, irrespective of anxiety 
level; (b) for moderately intense stimula- 
tion, nonanxious Ss will show rapid habitua- 
tion of the GSR, typical of the OR, 
while anxious Ss will not show habituation, 
typical of the DR; and (c) for very low 
intensity stimuli, there will be no difference 
in rate of habituation of GSR for the two 
groups. 

METHOD 
Subjects 

The MA scale and the MMPI L scale were admin- 
istered to all male undergraduates in the introductory 
psychology course as part of a larger battery of 
personality tests, Subsequently, male Ss who scored 
below 7 on the L scale were recruited for the experi- 
ment. They were contacted by phone and asked to 
participate in an experiment concerning “Dsycho- 
physiology.” In this manner, 60 volunteers were 
recruited; they received one credit toward fulfilling 
a course requirement. 


Apparatus 


All Ss were run in a soundproof chamber equipped 
with a comfortable reclining lounge chair. A one-way 
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mirror behind the chair allowed E to monitor Ss 
visually from outside the chamber in order to observe 
possible artifact-producing movements. Skin resist- 
ance was detected by a pair of Beckman silver- 
silver chloride electrodes, employing Beckman NaCl 
electrode paste as a contact medium. The active 
electrode was placed on the left palm and the refer- 
ence electrode on the left forearm, The electrodes 
were secured with stomaseal collars and masking tape 
around the hand for additional support. 

The amplification unit and polygraph were located 
outside the chamber. Skin resistance was amplified 
by a two-channel GSR amplifier, Model 201, manu- 
factured by Bio-physical Research instruments. This 
amplifier, which with the 1 sq.cm. Beckman elec- 
trodes produced an effective current density of 20 
microamp./sq.cm. for exosomatic recording, provided 
both high-gain and low-gain amplication of the 
skin-resistance signal. The high-gain amplification 
allowed for the recording of small amplitude NS 
activity, and the low-gain amplification allowed for 
the recording of larger amplitude electrodermal 
responses to the stimuli, and for the assessment of 
basal skin resistance levels. Both signals were 
recorded on a Grass polygraph, Model 5D. 

Auditory stimulation was delivered to Ss through 
a headset by an Eico Audio Frequency Generator, 
Model 377. Stimulus duration and interstimulus 
intervals were controlled by a series of Hunter 
timers. 


Procedure 


Ss were initially assigned randomly to either a 
moderate intensity (M Int) or a low intensity (L Int) 
condition, but toward the end of the experiment ran- 
domness was abandoned in order to assure the 
assignment of 30 Ss to each group. 

At the beginning of the experiment all Ss were 
told: 


We are interested in how the autonomic nervous 
system reacts to stimulation. I would like you to 
sit here and relax while we measure your responses 
to some tones. You will hear them through these 
earphones, but first there will be a rest period 
during which you should try to relax. Be sure 
that you don’t fall asleep, however. 


The E then attached the electrodes to Ss, adjusted 
the headset for maximum comíort, and urged the 
Ss not to move their bodies, particularly their arms. 

After a 10-min. adaptation period in which Ss 
acclimated to the situation, and resting levels of NS 
activity were recorded, stimulation began. All Ss 
received 15 presentations of a 1,000 cps sine-wave 
tone which lasted 2 sec. and was presented at variable 
intervals ranging from 30 to 90 sec., with a mean 
of 60 sec. For Ss in the M Int group, the volume 
level of the generator was set at 80; for Ss in the 
L Int group, the volume level was set at 20. These 
settings were chosen because previous experience sug- 
gested that they were easily distinguishable; the L 
Int tone was faint but audible, whereas the M Int 
tone was quite noticeable, but not noxious. 
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After 15 presentations of the stimuli, Ss were dis- 
missed from the experiment. Data obtained from 1 
S in the L Int condition were found to be unscorable, 
leaving 59 Ss in the final sample. 


RESULTS 


The GSR index of the OR was expressed as 
the change in low conductance, which was de- 
termined for each of the stimuli in the follow- 
ing manner: First, the prestimulus skin-re- 
sistance level was determined and designated 
R3; then the skin-resistance level at the point 
of maximum deflection after stimulation was 
determined and designated Rz. Response mag- 
nitude was then calculated by the formula 
log 1/Rə — log 1/Rı. The 15 scores deter- 
mined in this manner were then reduced to 
five trial blocks of three trials each. 

The relationships among stimulus intensity, 
habituation across trial blocks, and scores on 
the MA scale will be discussed first. Then the 
relationships among stimulus intensity, habit- 
uation, and NS activity will be evaluated. 
Finally the relationship between MA scale 
scores and NS activity will be presented. 


MA Scale Scores, Stimulus Intensity, and 
Habituation of GSR 


Scores on the MA scale ranged from 4 to 
35, with a median of 14. Those Ss who scored 
between 4 and 13 were designated low anx- 
ious (Lo A), and those who scored between 
15 and 35 were designated high anxious 
(Hi A). Four Ss who scored at the median 
were eliminated. Thus, four experimental 
groups were constituted: M Int-Hi A (N = 
14), M Int-Lo A (N —15), L Int-Hi A 
(N = 14), and L Int-Lo A (N = 12). To 
provide equal cell frequencies for analysis of 
variance, 2 Ss were eliminated randomly 
from the M Int-Hi A group, 3 from the 
M Int-Lo A group, and 2 from the L Int- 
Hi A group. Thus the final data pool was 
based upon 48 Ss, divided equally among the 
four conditions. 

The mean GSR amplitudes for Ss in the 
four experimental groups across five trial 
blocks are presented in Figure 1, in which 
it is evident that responses were greater in 
all trial blocks for Ss who received the moder- 
ate stimulation and that they diminished 
across trial blocks for all Ss, irrespective of 
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MEAN RESPONSE AMPLITUDE IN LOG 
CONDUCTANCE CHANGE 


BLOCKS OF THREE TRIALS 


Fic. 1. Mean amplitude of response (change in log 
conductance) for Hi A and Lo A Ss in the M Int 
and L Int groups across five trial blocks. 


anxiety level. These observations were con- 
firmed by an analysis of variance (Type III; 
Lindquist, 1953) which yielded significant 
main effects for stimulus intensity (F = 20.19, 
df= 1/44, p< .001) and for trial blocks 
(F = 18.00, df = 4/176, p < .001). No main 
effect for MA scale scores and no interactions 
of MA scale scores with intensity or trial 
blocks were found. 


NS Activity, Stimulus Intensity, and 
Habituation of the GSR 


NS activity level was defined as the number 
of spontaneous fluctuations in skin resistance 
of at least 100 ohms magnitude occurring in 
the 2-min. period just prior to presentation 
of the first stimulus. This allowed Ss 8 min. 
in which to adjust to the experimental situa- 
tion, thereby eliminating possible individual 
differences in NS activity which might have 
arisen from the newness of the situation. NS 
scores ranged from 0 to 34, with a median of 
9. Twenty-five Ss whose NS responses ranged 
from 0 to 8 were designated the low NS 
(LNS) group, and 25 Ss whose responses 
ranged from 11 to 34 were designated the high 
NS (HNS) group. Five Ss whose NS levels 
were at the median and 4 Ss who had 10 


responses were eliminated from the analysis. 
In this manner four groups of Ss were again 
constituted: M Int-HNS (N = 15), M Int- 
LNS (N — 11), L Int-HNS (N = 10), and 
L Int-LNS (N — 14). To provide equal cell 
frequencies for analysis of variance, 5 Ss 
were eliminated randomly from the M Int- 
HNS group, 1 from the M Int-LNS group, 
and 4 from the L Int-LNS group. Thus, 
the final analysis of these data was based upon 
a sample of 40 Ss, equally divided among the 
four experimental groups. 

Mean change in log conductance for Ss in 
the four experimental groups across the five 
trial blocks is presented in Figure 2, which 
shows that the response amplitudes for Ss in 
the two M Int groups appear to be greater 
than comparable responses for Ss in the two 
L Int groups in all trial blocks. Figure 2 also 
indicates that the habituation rate for the M 
Int-LNS group appears steeper than that for 
the M Int-HNS group, and that the habitu- 
ation rates for Ss in the two L Int groups are 
similar. Analysis of variance of these data 
(Type III) confirmed these observations, 
yielding a significant NS Activity X Stimulus 
Intensity X Trial Blocks interaction (F = 
3.80, df— 4/144, p< 01), a significant 
Stimulus Intensity X Trial Blocks interaction 
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MEAN RESPONSE AMPLITUDE IN LOG 
CONDUCTANCE CHANGE 


BLOCKS OF THREE TRIALS 
Fic. 2. Mean amplitude of response (change in 
log conductance) for HNS and LNS Ss in the M Int 
and L Int groups across five trial blocks. 


(F = 3.45, df = 4/144, p < .025), and a sig- 
nificant NS Activity X Trial Blocks interac- 
tion (F = 3.72, df = 4/144, p < .01), as well 
as main effects for stimulus intensity (F = 
11.57, df = 1/36, p < .005) and trial blocks 
(F = 15.70, df = 4/144, p< .001). It may 
, also be noted that there were initial differ- 
ences in response strength among the groups. 
Analysis of the first trial block indicated a 
| significant difference between the two M Int 
groups and the two L Int groups (F — 13.31, 
df = 1/36, p < .001), but no differences be- 
tween the HNS and LNS groups (F — 3.31, 
df = 1/36), and no interaction among the 
four groups (F — 1.24, df — 1/36). Thus, 
the obtained effects of NS activity were not 
attributable to initial differences in response 
| magnitude for the HNS and LNS groups. 
Subsequent analyses of the data confirmed 
that for Ss in the L Int conditions there was 
no differential effect of NS activity on habitu- 
| ation (F — .58, df — 4/72); GSRs habituated 
at the same rate irrespective of NS level (F 
= 8.68, df = 4/72, p < .01). However, for Ss 
| in the M Int groups only, an analysis of vari- 
ance confirmed the observation that habitua- 
tion was steeper for the LNS group than for 
the HNS group (F = 541, df = 4/72, p< 
.001), and an additional analysis indicated 
that this interaction was determined by the 
fact that the M Int-HNS group showed no 
change across trial blocks (F = .47, df = 4/ 
72), while the M Int-LNS group showed sig- 
nificant change across trial blocks (F — 6.80, 
df = 4/72, p < .001)2 


Relationship between MA Scale Scores and 
NS Activity 


To determine the relationship between the 
two measures of anxiety employed in this ex- 


?In all of the reported analyses concerning main 
effects of trial blocks or any interactions of trial 
blocks with other variables there is a possibility 
inherent in repeated-measures designs of spuriously 
high F ratios resulting from inequalities and asym- 
metries of covariance matrices. In order to rule out 
this possibility, all degrees of freedom in tests in- 
volving trial blocks were adjusted according to the 
modification suggested by Greenhouse and Geisser 
(1959). Multiplying all df values by .57 (the epsilon 
value obtained from the covariance matrix) did not 
alter any of the conclusions derived from the analysis 
of variance, although in some cases alpha levels 
should be placed at the .03 level rather than at .01. 
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periment, a Pearson product-moment correla- 
tion was computed between NS activity levels 
and scores on the MA scale for all 59 Ss. The 
correlation obtained was low, negative, and. not 
significantly different from a chance relation- 
ship (r — —.17). 


Discussion 


In this experiment it was hypothesized that 
moderately intense stimuli would elicit larger 
autonomic responses than stimuli of very low 
intensity for all Ss regardless of anxiety level. 
It was also hypothesized that stimuli of very 
low intensity would elicit ORs for all Ss, but 
that moderately intense stimuli would elicit 
DRs for anxious Ss and ORs for nonanxious 
Ss. Anxiety was defined operationally by MA 
scale scores and NS activity. 

The first hypothesis was confirmed, and the 
subsequent hypotheses were confirmed only 
for the autonomic measure of anxiety. Thus, 
when an index of autonomic lability was in- 
terpreted as a measure of anxiety, expecta- 
tions about the relationship of anxiety to ori- 
enting behavior were sustained, but when a 
self-report index was used to define anxiety, 
predictions were not confirmed. Furthermore 


the self-report index and the measure of auto- ~ } 
` nomic lability were not related. 


The finding that scores on the MA scale 
were unrelated to magnitude and habituation 
of response, while disappointing, is consistent 
with previous findings which indicate that the 
MA scale and autonomic responses are inde- 
pendent (Katkin, 1965; Sarason, 1960). Thus, 
whereas the construct underlying MA scale 
scores has been related to clinical judgments 
of anxiety (Buss, 1955) and to condition- 
ability (J. T. Spence & K. W. Spence, 1966), 
it has not been possible to demonstrate any 
consistent relationship between it and ANS 
measures. 

The finding that autonomic responses typi- 
cal of the DR were elicited from autonomi- 
cally labile Ss and that responses character- 
istic of the OR were elicited from stabile Ss 
confirmed the main hypotheses and enhanced 
the validity of NS activity level as a reliable 
index of individual differences in “anxiety.” 
Support for the notion that anxiety level may 
contribute to the differential elicitation of ORs 
and DRs may be inferred also from Israel’s 
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(1966) recent demonstration that schizo- 
phrenic patients show marked resistance to 
habituation of GSR, while normal controls 
show rapid habituation. Although Israel did 
not comment directly upon the anxiety levels 
of her Ss, it is commonly accepted that 
schizophrenics are more anxious than nor- 
mals. Obviously, it would be of interest to 
investigate differences in levels of NS activity 
among psychiatric and nonpsychiatric pa- 
tients. Further support for the present find- 
ings comes also from a recent report by 
Koepke and Pribram (1966), who found that 
in response to moderately intense tones GSRs 
of autonomically stabile Ss habituated and 
those of autonomically labile Ss did not. In 
their experiment, however, autonomic lability 
was measured by 1,000-ohm rather than 100- 
ohm deflections in skin resistance.* 
Although the present findings suggest that 
NS activity may be a valid predictor of indi- 
vidual differences in autonomic responsivity, 
the generality of these results is limited. 
Clearly, the present contention that “anxiety” 
is being measured by this autonomic index 
requires further validation. To this end, fu- 
ture studies must be directed toward elucidat- 
ing the relationships among NS activity and 
commonly accepted criteria for anxiety, such 
as clinical assessment, task performance, re- 
sponse to stress, and self-report (see Krause, 
1961). Furthermore, although the present 
findings substantiate the hypotheses when 
this measure is conceptualized as an anxiety 


_index, it is clear that the definition is prob- 


lematic. It is obvious that it lacks certain 
traditional characteristics of anxiety. Most 
definitions of anxiety imply that an S’s recog- 
nition of his unpleasant state is essential 
(Freud, 1936); yet Ss are notoriously unable 
to identify or describe their autonomic activ- 
ity (Mandler et al., 1958; Rappaport & Kat- 
kin, 1967). It is possible, of course, that al- 
though Ss are unable to identify their NS 
activity they may be able to report reliably 


8There has been little research comparing the 
frequency of very small amplitude NS activity (100 
oms or less) with larger amplitude NS activity (500 
ohms or more), but there is some evidence that the 
small amplitude responses are independent of basal 
conductance levels (Katkin, 1964), and therefore 
probably not affected as much by resting levels. 


upon aspects of themselves which are related 
to this activity. In that case, it would seem 
fruitful to attempt to delineate those charac- 
teristics of autonomically labile and stabile 
individuals which can be self-observed and 
self-reported. 

To summarize, the present study suggested 
that individual differences in anxiety state 
will determine whether an S shows orienting 
or defensive response patterns to moderately 
intense stimuli. Finally, it was shown that an 
index of autonomic lability can be profitably 
used as a measure of individual differences in 
anxiety state and can predict Ss’ character- 
istic mode of autonomic response, whereas a 
self-report index of anxiety cannot. 
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previous research in this area has indicated 
high reliability of judgments, but little valid- 
ity. However, Kramer (1963) noted that: 
The validity criteria, such as the Bernreuter Per- 
sonality Inventory, which have been most fre- 
quently used in voice and personality studies, are 
often highly imperfect measures of those traits 
that they are used to validate [p. 248]. 


A recent study has supported the validity 
of the relationship between voice quality and 
at least one type of personality trait (Markel, 
Meisels, & Houck, 1964). In this study, rather 
than using a psychological test or inventory, 
speakers were actually either schizophrenic 
or nonschizophrenic. All speakers were hos- 
pital patients, and those classified as “schizo- 
phrenic” were patients on the psychiatric 
ward of the hospital and had been diagnosed 
schizophrenic by two psychiatrists. All speak- 
ers were recorded reading exactly the same 
paragraph, and the semantic differential was 
used to obtain ratings of the speakers’ voices. 
College students using these ratings differ- 
entiated the voices of the schizophrenic and 
nonschizophrenic patients. The positive re- 
sults of this study indicated that voice qual- 
ity is, actually, a significant carrier of a “mes- 
sage” regarding personality and provided the 
impetus to develop a structural method to 
quantify voice qualities (Markel, 1965). The 
advantage of analyzing voice qualities by a 
structural method (e.g., pitch, loudness, 
tempo) rather than by a functional method 
(e.g., dominance, introversion, pleasant) is 
that a structural method can be applied 
meaningfully in any study in which voice 
quality is the behavioral variable. The pres- 
ent investigation employs this method to ob- 
tain voice-quality “profiles” and examines 
the personality traits of groups of speakers 
with different voice-quality profiles. This 
study is seen as a step in the direction of 
relating specific structural aspects of voice 
quality to specific “permanent” personality 
traits. 

METHOD 
Subjects 

The Ss were 78 patients in the Veterans Ad- 
ministration Neuropsychiatric Hospital in Canadai- 
gua, New York. All were male, Caucasian, and 


spoke essentially the same dialect of English. Their 
age range was 20-58, with a mean age of 40.2. 
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The 78 Ss were selected from a total group of 
90 patients who were admitted to the hospital 
from the time this study began. Twelve patients 
were not used as Ss in this study because of (a) 
their inability or refusal to provide an adequate 
speech sample, (b) a foreign accent, or (c) a dialect 
distinctly different from the other Ss. The 12 Ss 
were eliminated before voice qualities were analyzed 
and without any knowledge of the results of their 
psychological tests. 


Personality Measure 


The MMPI, administered to S as part of a bat- 
tery of tests upon admission to the hospital, was 
given in booklet form. All 550 items were scored 
following the standard procedure described in the 
test manual. The Ss’ T scores (K corrected) were 
obtained for the 10 clinical scales, and these T 
scores are the measures of personality structure 
used in this investigation. 


Speech Samples 


The speech samples were obtained as part of 
the admission procedure and at the same time the pa- 
tients were being given other psychological tests. A 
microphone was put on lavaliere fashion, and each 
patient was asked to read the sentence, “Many 
of the reports received last week showed a large 
increase in earnings,” which was typed in capital 
letters. The same tape recorder (a Wollensak No. 


TABLE 1 
RELIABILITY OF RATING VOICE QUALITIES 


Item Pitch Loudness | Tempo 
Interrater reliability* 
Group 1 
Session 1 19 91 91 
Session 2b 82 92 90 
Group 2 
Session 1 12 94 .88 
Session 2b te .93 87 
"Test-retest reliability® 
Group 1 
Sessions 1 and 24 wie .87* 92* 
Group 2 
Sessions 1 and 24 .79* .95* .92* 
Group 1 and Group 2* 48* -78* .91* 


Note.—N = 3 for all groups and sessions under ''Interrater 
reliability.” 

à Ebel (1951), 

» For Group 1, Rating Sessions 1 and 2 were separated by 
4, 1, and 2 days for pitch, loudness, and tempo, respectively: 
For Group 2, Rating Session 2 followed immediately after 
Session 1. 

Pearson r. 

4The means of the ratings for Ss in each session were the 
measures correlated. 

_ © The means of the ratings for Ss by each group in both ses- 
spe eue measures correlated. 
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TABLE 2 


Mean T scores on PrrcH, Loupness, AND TEMPO 
FOR THREE VorcE-QuALITY GROUPS 


Group N | Pitch | Loud | Tempo 
Peak-Pitch 29 56 43 46 
Peak-Loudness 27 47 59 48 
Peak-Tempo 22 45 47 58 


1515) was used for each patient, the volume con- 
trol was set at the same level for each patient, 
and during the period that these patients were 
being recorded the tape recorder was not moved 
or used for any other purpose. Following the col- 
lection of data, the speech samples of the 78 pa- 
tients were dubbed in random order on a “test 
tape.” The logic of this study requires the assump- 
tion that a single speech sample obtained from each 
patient was a typical one for that patient in a 
reading situation. 


Rating Voice Qualities 


In order to assure reliability, there were two 
groups of raters with three raters in each group. 
Group 1 consisted of three undergraduate students, 
and Group 2 consisted of three graduate students 
in the clinical psychology program at the University 
of Florida. None of the six raters had any previous 
training in rating voice qualities. They were trained 
to rate the qualities of pitch, loudness, and tempo 
following procedures which have been previously 
described (Markel, 1965). The rating scales for 
pitch, loudness, and tempo, respectively, were Low- 
High, Soft-Loud, and Slow-Fast. The scales were 
anchored by having a verbal label appear at each 
of the seven scale positions. For example, the scale 
for pitch was extremely Low, quite Low, slightly 
Low, average, slightly High, quite High, extremely 
High. The scale positions were also numbered from 
1 to 7, and the raters used these numbers to indi- 
cate their ratings. 

The speech samples on the training and practice 
tapes had been selected from previous studies so 
that no voice rated during the training procedures 
appeared on the test tapes of the 78 voices used 
in this study. None of the raters had any informa- 
tion as to the nature of the MMPI scores of Ss 
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Fic. 1. Mean T scores on pitch, loudness, and 
tempo for the three voice-quality groups: P = Peak- 
Pitch; L=Peak-Loudness; T = Peak-Tempo. 


whose voices they were rating. Both groups rated 
each voice quality twice. For Group 1, Rating 
Sessions 1 and 2 were separated by 4, 1, and 2 
days for pitch, loudness, and tempo, respectively. 
For Group 2, Rating Session 2 followed immedi- 
ately after Rating Session 1 for each voice quality. 
Interrater reliability for each group at each rating 
session was estimated by the method developed 
by Ebel (1951) and specifically followed the pro- 
cedures described by Guilford (1954). These re- 
liabilities are listed in Table 1. Test-retest reliability 
was determined by the correlation between the 
mean ratings of the 78 Ss. For the between-rating 
session correlation for each group of raters, the 
scores for each S were means of the three ratings 
by that group in each rating session. For the be- 
tween-groups correlation the scores of each S were 
the means of six ratings by the three judges in a 
group in the two rating sessions. These correlations 
are also listed in Table 1. 


Voice-Quality Profiles 


The mean of the 12 ratings given a particular S 
on a particular voice quality was taken as that S’s 
score on that voice quality. In order to compare the 
scores for pitch, loudness, and tempo, the scores 
for the 78 Ss on each quality were transformed to 
T scores (X =50, SD=10). T scores on the three 
parameters were then compared, and each S was 
assigned to one of three groups according to which 


TABLE 3 
MEAN T scores oN MMPI SCALES FOR THE THREE VOICE-QUALITY GROUPS 


Validity scale Clinical scale 
Group N sd 
Lm x | as | D | Hy | Pa | Mf | Pe | Pc | Sc | Me St 
Peak-Pitch 29 | so | 64 | 51 | 65 | 79 | 65 | 69 | 59 | 62 | 72 71 | 60 | 61 | 66 
Peak-Loudness | 27 | 51 | 63 | 53 | 72 | 74 | 72 | 72 | 60 | 65 | 72 73 | 59 | 55 | 67 
Peak-Tempo 22 | 5t | 57 53 | 56 | 68 | 63 70 | 60 | 61 | 62 62 | 61 | 51 | 61 
x 65 | 74 | 67 | 71 | 60 | 62 | 69 | 69 60 | 56 
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one of the three voice qualities had attained the 
highest T score. The groups were labeled Peak- 
Pitch, Peak-Loudness, and Peak-Tempo. The mean 
voice-quality T scores and the N in each of the 
three profile groups are listed in Table 2 and 
plotted in Figure 1. One-way analysis of variance 
was used to evaluate the significance of the differ- 
ence between the groups for each voice quality, and 
for each of the three voice qualities the difference 
between groups was significant beyond the .01 
level, F (2/75) —10.62, 37.89, 14.76, for pitch, 
loudness, and tempo, respectively. The Newman- 
Keuls method (Winer, 1962, p. 309) was used to 
examine the significance of the difference between 
the means of the three groups for each quality. The 
results indicate that for each voice quality the 
group whose peak was on that quality was signifi- 
cantly different from the other two groups, but 
the other two groups did not differ significantly 
from one another on that voice quality. 

The mean ages for the three groups were 38, 43, 
and 40 for Peak-Pitch, Peak-Loudness, and Peak- 
Tempo, respectively. Analysis of variance indicated 
the differences in age among the groups were non- 
significant. 


RESULTS * 


The significance of the difference between 
the 10 MMPI clinical scales for the three 
voice-quality profile groups was evaluated by 
a 3 X 10 analysis of variance for repeated 
measures (Winer, 1962, pp. 374-377). The 
mean T scores for the three voice-quality 
groups for each of the 10 clinical scales are 
listed in Table 3 and plotted in Figure 2. 


1The author is grateful to Jennie L. Grossman, 
associate in research at the University of Florida 
Computing Center, for her assistance in analyzing 
the data. 
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Fic. 2. Mean T scores on the MMPI scales for the 
three voice-quality groups. (Note: Analysis of the 
difference between the groups is based only on the 
10 clinical scales.) 


TABLE 4 
ANALYSIS OF VARIANCE SUMMARY 


Source of Variation SS df MS F 
Ee Eos 77 
oii ity groups 
(A) 4,798 | 2) 2,399} 229 
Subjects (W) 78,652 | 75 | 1,049 
Within Ss 702 
MMPI scales (B) 23,119 | 9 | 2,569 | 21.35* 
AXB 5,351 | 18| 297) 2.47* 
BXW 81,212 | 675 | 120 
*p <01 


(However, note that Table 3 and Figure 
2 include the group means on the validity 
scales L, F, and K, but these scales were not 
included in the analysis of variance.) The 
results of the analysis of variance were as 
follows: Factor A (voice-quality groups), 
ns; Factor B (MMPI scales), F = 21.35, 
df =9/675, p < 01; AX B, F= 2.41, df 
= 18/675, p < .01. The analysis of variance 


TABLE 5 
ANALYSIS OF VARIANCE SUMMARIES 
Scale SS df MS F 
Hs 
Between Ss 2,840 2 1,420 | 6.44** 
5j Within Ss 16,542 75 221 
Between Ss 1,399 2 699 | 1.98 
a Within Ss 26,263 75 350 
» 
Between Ss 1,124 2 562 | 3.85* 
ni Ss 10,956 75 146 
Between Ss 139 2 69 | 0.45 
Within Ss 11,667 75 156 
Mf v 
Between Ss 19 2 9 | 0.07 
4 Within Ss 9,590 75 128 
a 
Between Ss 230 2 115 | 0.67 
AES Ss 12,816 75 171 
Between Ss 1,474 2 737 | 2.55 
5 Within Ss 21,692 75 289 
c 
Between Ss | 1,559 2 779 | 229 
Within Ss 25,510 75 340 
a 
Between Ss 51 2 26 | 014 
EAS Ss 13,936 75 186 
t 
Between Ss 1,313 2 657 | 4.52* 
Within Ss 10,892 75 145 
* 
2$ S01. 
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terms are summarized in Table 4. Inspection 
of Figure 2 indicates how the different 
“shapes” of the MMPI profiles for the three 
voice-quality groups produced the significant 
interaction effect. 

Simple analysis of variance was employed 
to examine the difference between the groups 
on each of the 10 clinical scales. The analy- 
sis of variance terms are summarized in Table 
5. This table indicates significant differences 
between the groups on three scales: Hs (p < 
01), Hy (p < .05), and Si (p < .05). 


Discussion 


The most important finding of this study 
is that the mean MMPI profiles for the three 
voice-quality groups are significantly differ- 
ent. Following the coding rules developed by 
Marks and Seeman (1963), a “best fit” of 
code type for the mean profiles of the Peak- 
Pitch and Peak-Loudness groups is 2-8 and 
3-2-1, respectively. The Marks and Seeman 
actuarial information for code types 2-8 and 
3-2-1 indicates a possible direction of the 
relationship personality traits and voice- 
quality profiles. In actuarial terms, Marks 
and Seeman (1963) stated an individual with 
a 2-8 profile is likely to be diagnosed as 
psychotic (p. 138), have a poor prognosis (p. 
142), and be described as, “Keeps people at a 
distance; avoids close interpersonal relation- 
ships" (p. 138). On the other hand, an indi- 
vidual with a 3-2-1 profile is likely to be 
diagnosed as psychoneurotic (p. 132), have 
a good prognosis (p. 156), and be described 
as, “exhibits depression (manifest sad 
mood)" (p. 152). The profile of the Peak- 
Tempo group, with only Pd elevated, is un- 
classifiable as to code type. 

Another indication of the relationship be- 
tween personality traits and voice-quality 
profiles is seen in the significant statistical 
differences between the voice-quality profile 
groups of Scales Hs, Hy, and Si. Marks and 
Seeman (1963, pp. 307-310) provided a 
chart of actuarial statements regarding vari- 
ous levels of elevation on single scales? The 


?Marks and Seeman (1963, p. 307) attributed 
this chart to Wendell M. Swenson. No specific 
statement is provided in this chart for the normal 
level of elevation on Scale Hy. 


following actuarial statements are taken from 
this chart and are based on the mean T 
scores for the voice-quality profile groups on 
Scales Hs, Hy, and Si. 

Peak-Pitch: (Hs) Slightly more than aver- 
age number of physical complaints. Some 
concern about bodily functions and physical 
health. (Hy) Normal level of hysterical 
symptoms. (Si) Probably reserved in unfa- 
miliar social situations. 

Peak-Loudness: (Hs) Considerable num- 
ber of physical complaints. Prominent con- 
cern with bodily functions. (Hy) Probably 
somewhat immature, egocentric, suggestible, 
and demanding, (Si) Has capacity to main- 
tain adequate social relationships. 

Peak-Tempo: (Hs) Number of physical 
symptoms and concern with bodily functions 
fairly typical for clinic patients. (Hy) Nor- 
mal level of hysterical symptoms. (Si) Has 
capacity to maintain adequate social rela- 
tionships. 

Since the actuarial statements regarding 
both profile differences and individual scale 
differences are based on mean scale scores, 
these statements must be taken as “first ap- 
proximations” of the specific personality 
traits related to specific voice-quality profiles. 
However, the statistical results clearly sup- 
port the hypothesis that the three voice-qual- 
ity profile groups represent three different 
personality types, and these results are a 
strong indication that the voice-quality pro- 
file is a significant behavioral variable. 
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The relationship between body-boundary perception and body site of disease 
was examined. In Experiment I, a body-interior group (10 asthmatics plus 10 
cardiacs) was compared with a body-exterior group (10 arthritics plus 10 der- 
matologic patients). In Experiment II Ss were more carefully screened in terms 
of their medical histories. Measures of psychopathology from the Holtzman 
Inkblot Test and the MMPI were used as covariants in both experiments. The 
results failed to replicate previously reported empirical findings and challenged 
the construct validity of the Fisher and Cleveland model. 


Numerous studies (for a recent summary, 
see Fisher, 1963), based upon the model origi- 
nally developed by Fisher and Cleveland 
(1958), have attempted to demonstrate a re- 
lationship between the degree of definiteness 
ascribed to one’s body boundary and diseases 
with various psychosomatic symptoms. This 
assumed relationship is based upon the as- 
sumption that body image is influential in 
determining the body site at which symptoms 
occur whenever stress causes psychosomatic 
illness. The additional assumption is made 
that psychosomatic illness represents an ex- 
treme manifestation of underlying patterns of 
physiological reactivity. The Fisher and 
Cleveland model is based upon the hypothesis 
that individuals with definite body-image 
boundaries are characterized by relatively 
low reactivity in the body interior (heart, 
lungs). 

Studies which have examined this relation- 
ship have inferred body-image boundaries 
from responses to Rorschach inkblots. Those 
Rorschach responses which were protective, 
covering, or containing (e.g. man in armor, 
turtle with a shell, mummy wrapped up) were 
Scored as Barrier responses. The greater the 
Barrier score, the more definite was the in- 
ferred body-boundary perception. Relatively 
high Barrier scores were reported more fre- 
quently among individuals with body-exterior 
reactions (e.g., neurodermatitis or functional 
paralysis of musculature). 


14 modified version of this paper was presented 
at the meeting of the American Psychological Associ- 
ation, Washington, D. C., September 1967. 

?Now at Washington University. 
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Penetration Rorschach responses are those 
in which there is a violation, disruption, or 
passage through a boundary region (eg. 
crushed bug, bullet entering flesh, worn-out 
flower, vagina). The greater the Penetration 
score, the more indefinite the body-boundary 
perception was inferred to be. A relatively 
larger number of Penetration responses were 
reported to occur in individuals with body- 
interior reactions (e.g., stomach ulcers, bron- 
chial asthma, cardiac disease). Because of the 
following limitations, repeated in practically 
all the work of these authors (Mednick, 
1959), it is extremely difficult to assess the 
adequacy of this body-image theory: 


1. Failure to control for variables which 
are important covariants of Penetration and 
Barrier scores. Barrier scores, for instance, 
appear clearly related to F+% and prob- 
ably to other Rorschach indicators of “ego 
strength" or degree of psychopathology. 

2. The arbitrary development of a scoring 
system for the Rorschach without adequate 
supporting data relevant to its reliability or 
validity. 

3. The consistent comparison of only one 
body-interior disease group (e.g, stomach 
ulcers) against one body-exterior disease 
group (e.g., rheumatoid arthritis). If this con- 
struct is valid, then differences in body- 
boundary perception should manifest them- 
selves across multiple groups of body-interior 
and body-exterior diseases; differences be- 
tween two specific groups might be due to 
other factors (e.g. differences in psychopa- 
thology, chronicity of the disease, or degree 
of disability). 
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The purpose of this study is to consider 
these difficulties and to provide a more 
rigorous test of the hypothesis that individu- 
als with body-exterior reactions perceive their 
body boundary as more definite than those 
individuals with body-interior reactions. 


METHOD 


All Ss were tested with the Holtzman Inkblot Test 
(Holtzman, Thorpe, Swartz, & Herron, 1961) and 
the MMPI. While providing psychometric improve- 
ments over the Rorschach procedure, the Holtzman 
inkblots, at least with respect to Penetration and 
Barrier scores, are almost identical, qualitatively, to 
their Rorschach counterparts. 

The Holtzman inkblots were administered and 
scored by an examiner who was unaware of the 
purpose of the study. Protocols were scored for 
Penetration, Barrier, and Integration. This last vari- 
able is the single most sensitive Holtzman index of 
psychopathology and was examined as one of the 
covariants in the analysis of the data. 

The study was divided into two parts. In Experi- 
ment I, Ss were selected exclusively on the basis 
of their current hospital diagnosis. In the literature 
reported there is usually insufficient information pro- 
vided on S selection. It is assumed here that previous 
studies assigned Ss to the body-interior and body- 
exterior groups on some basis similar to current 
admitting diagnosis. The body-interior group con- 
sisted of two subgroups: 10 Ss with cardiac pathol- 
ogy and 10 Ss with bronchial asthma. The body- 
exterior group also consisted of two subgroups: 10 
dermatologic patients and 10 Ss with theumatoid 
arthritis. 

The data analysis consisted of examining the com- 
bined body-interior groups (asthmatics, cardiacs) 
and combined body-exterior groups (dermatologic 
patients, arthritics) for differences in Penetration and 
Barrier responses. Additional analysis consisted of 
comparing Ss for Penetration and Barrier responses 
while examining the role of psychopathology as a 
covariant. In addition to the Holtzman variable of 
Integration, a psychopathology index from the 
MMPI was used. On the basis of Goldberg’s (1965) 
recent evaluation, the simple linear composite of 
scales L+Pa+Sc—Hy—Pt provides the single 
most valid MMPI index of psychoticism versus 
neuroticism. This index appears to be a continuum, 
ranging from “normal” to neurotic to psychotic. It 
seemed desirable to examine an index of psycho- 
pathology which did not overlap with the Holtzman 
variables. 

Experiment II was designed to determine whether 
the previously reported relationships between body 
boundary and site of body reaction would reappear 
if S selection was more stringent. Accordingly, the 
charts of numerous hospitalized patients were exam- 
ined by a physician experienced in both psychiatry 
and internal medicine. Three groups of eight pa- 
tients each were thus selected. One group was a 
“pure” body-interior disease group in which all of 


the present and past recorded medical problems of 
each S were consistent with a body-interior site. The 
second group was a “pure” body-exterior one. The 
third group of patients was mixed, composed of 
patients whose medical histories indicated that they 
had in the past reacted, or were currently reacting, 
with both body-interior and body-exterior sites. 
This assignment of patients to various diagnostic 
groups was also done independently and without 
knowledge of Ss’ Holtzman protocols. 

In Experiment II the data analysis was essentially 
the same as that for Experiment I. An additional 
refinement was introduced through the use of two 
groups which were “pure” and one “mixed” with 
respect to body site of disease. It was thus assumed 
that differences in body-boundary perception would 
form a continuum which could be inferred from 
three rather than two points. 


RESULTS 


The mean scores on Penetration, Barrier, 
Integration, and MMPI index of psychopa- 
thology for each of the four groups of Ss are 
presented in Table 1. It can be seen from this 
table that Ss with a body-interior site of 
disease differ from Ss with a body-exterior 
site of disease on the Penetration variable and 
on the MMPI index of psychopathology. The 
direction of the relationship between site of 
disease and Penetration responses was incon- 
sistent with Fisher and Cleveland’s (1958) 
theoretical model, however. 

Specifically, comparison of the groups with 
body-interior diseases (asthma, cardiac) with 
the two groups with body-exterior diseases 
(dermatology, arthritis) yielded chance differ- 
ences for both Penetration and Barrier re- 
sponses, Use of Integration as a covariant 
yielded results which approached statistical 
significance, but in a direction opposite to 


TABLE 1 


MEAN SCORES ON PENETRATION, BARRIER, INTEGRA- 
TION, AND MMPI INDEX or PsvcHo- 
PATHOLOGY: EXPERIMENT I 


Subjects 


Variable Body interior Body exterior 


Heart» Asthma»| Total| Arthri-| Dermi- Total 


tiw | titise 
Penetration | 33| 32 | 32| 21| 87 | 54 
Barrier _ 50| 45 | 48| 30| 7.2 | 51 
Integration | 124| 43 | 84| 8.1] 110 9.6 
MMPI index | 36.9 | 363 | 36.6| 46.8 | 48.3 | 47.5 


sN =10. 


>< 
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 Awatysis OF VARIANCE AND COVARIANCE FOR PENE- 


TRATION, INTEGRATION, AND MMPI INDEX: 
EXPERIMENT I 
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TABLE 4 


ANALYSIS OF VARIANCE AND COVARIANCE FOR PENE- 
TRATION, INTEGRATION, AND MMPI INDEX: 
EXPERIMENT IT 


Source SS df MS P Source SS df MS F 
Total 826.78 39 Total 263.33 23 
Error 780.55 38 20.54 Error 217.25 21 10.34 
Treatments 46.22 1 46.22 2.25 Treatments 46.08 2 23.04 | 223 
Adjusted Adjusted 

tegration Integration 

Total 788.22 38 Total 248.76 22 

Error 750.50 37 Error 171.41 20 8.57 

"Treatments 38.02 1 38.02 | 1.87 Treatments 71.85 2 38.67 | 4.51* 
Adjusted Adjusted 

MMPI index MMPl index 

"Total 805.37 38 "Total 674.94 22 

Error 714.88 37 20.94 Error 655.81 20 32.79 

Treatments 30.49 1 30.49 1.46 Treatments 19.13 2 9.57 

*p «.05. 


that predicted by the theory. That is, the 
group with the “interior” body site of disease 
gave fewer Penetration and more Barrier re- 
sponses. Use of the MMPI index of psycho- 
pathology as the covariant also failed to 
differentiate the groups. These analyses are 
summarized in Tables 2 and 3. 

In Experment II the same data analysis 
was carried out with three groups of very 
carefully selected Ss. Two of these groups 
were very homogeneous with respect to having 
éither an interior or exterior body site of 
disease; the third group included Ss with both 
types of diseases. These analyses are sum- 
marized in Tables 4 and 5. It can be seen 
from these tables that even these carefully 


TABLE 3 


ANALYSIS OF VARIANCE AND COVARIANCE FOR BAR- 
RIER, INTEGRATION, AND MMPI INDEX: 
EXPERIMENT I 


selected Ss did not differ significantly in the 
frequency of their Barrier and Penetration 
responses. Penetration-response differences ap- 
proached statistical significance, but in the 
direction opposite to that predicted by the 
theory. When Integration scores were used 
as a covariant, these differences were sta- 
tistically significant (5 < .05). 

In order to better understand the role of 
psychopathology as a covariant of Penetra- 
tion and Barrier responses, the product- 
moment correlations between these two Holtz- 
man inkblot variables and the MMPI index 
of psychopathology were computed. For the 
group with the interior body site of disease, 


TABLE 5 


ANALYSIS OF VARIANCE AND COVARIANCE FOR Bar- 
RIER, INTEGRATION, AND MMPI INDEX: 
EXPERIMENT II 


Source SS df MS F Source SS df MS F 
Total 594.78 | 39 Total 386.50 | 23 
Error 583.55 38 15.36 Error 385.50 21 18.36 
Treatments 1.22 1 1.22 Treatments 1.00 2 .50 
pia Adjusted 
ntegration Integration 

Total 465.64 38 Total 305.42 | 22 

Error 465.63 37 12.58 Error 287.43 20 14.37 

Treatments 005 | 1 .005 Treatments 17.99 | 2 9.00 
Adjusted Adjusted 

MMPI index MMPlIindex 

Total 549.87 38 Total 390.70 | 22 

Error 538.24 37 14.55 Error 385.70 20 19.29 

Treatments | 11.63 1 | 1163 1.25 Treatments 5.00 2 2,50 
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Penetration correlated —.626 with the MMPI 
index and Barrier correlated —.451. Both of 
these correlations are statistically significant 
and of “respectable” magnitude. The corre- 
lations between the MMPI index and Pene- 
tration and Barrier scores for the group with 
the exterior body site of disease were .315 
and —.159, respectively. Neither is statisti- 
cally significant. These findings suggest that 
the relationships between Barrier and Pene- 
tration responses and the index of psycho- 
pathology used are of a chance nature among 
Ss with an exterior body site of disease, 
though they are significant for patients with 
interior-disease body sites. Experiment IT 
yielded similar findings; the correlations be- 
tween the MMPI index of psychopathology 
and Barrier and Penetration responses were 
—.475 and —.639, respectively, for the group 
with the interior-disease body site, and —.475 
and .162 for the group with the exterior body 
site of disease. 

The inverse relationship between the MMPI 
index of psychopathology and Penetration 
responses is surprising. The MMPI index 
achieved greatest accuracy in classifying pa- 
tients when scores of 49 or higher were classi- 
fied as psychotic and scores below 40 were 
classified as neurotic. As Penetration responses 
appear qualitatively pathological, a positive 
relationship between these two variables 
would have appeared more understandable. 


DISCUSSION 


This study has examined the theoretical 
construct postulated by Fisher and Cleve- 
land (1958) which relates perception of body 
boundary with site of disease. The empirical 
data previously reported have provided only 
tangential support for such a link. Aside from 
methodological limitations, results have been 
reported only from the comparison of two 
groups at one time. A more critical test of the 
construct validity of the psychosomatic rela- 
tionship postulated would necessitate a com- 
parison between several groups with body- 
interior disease sites and several groups with 
body-exterior disease sites. In the present 
study, where four such groups were compared, 
there were no differences between the groups 
in body-boundary perception. 

Considering alternate explanations for the 


positive results previously reported, two pos- 
sibilities suggest themselves; one is S selec- 
tion, and the second is uncontrolled sources of 
variance (i.e., other than body-boundary per- 
ception). 

As to S selection, the present authors were 
impressed by the difficulty in finding Ss whose 
medical histories clearly reflected that their 
body disease sites had always been internal or 
external. Many Ss hospitalized with a diag- 
nosis of a disease having a given body site 
were also found to have histories of diseases 
with the opposite body site. After examining 
many such records, § samples with diseases 
whose body sites were “pure,” in the sense 
that they were all either external or internal, 
were found. The body-boundary perception of 
these groups either was not significantly dif- 
ferent or differed in a direction opposite to 
that predicted by the theory. 

Such a finding suggests the need to examine 
sources of variance other than body-boundary 
perception. In this study, one such source, 
psychopathology, was examined. When an 
index of psychopathology from the Holtzman 
inkblots was used as the covariant, the 
groups did show significant differences in 
body-boundary perception, but in a direction 
opposite to that postulated by Fisher and 
Cleveland. The present study thus raises the 
possibility that the differences previously re- 
ported between single groups could have been 
due to differences in psychopathology, rather 
than to body-boundary perception. 

Neither the construct validity nor the pre- 
dictive validity of the Fisher-Cleveland model 
appears to be confirmed. 
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AN INVESTIGATION OF METHODS OF PRESENTING 
INCOMPLETE SENTENCE STIMULI * 


FRANK A. WOOD? 


College of Education, Ohio State University 


One hundred and forty-four coeds completed two halves of a modified Rotter 
Incomplete Sentences Blank (ISB) with the Marlowe-Crowne Social Desirability 
(SD) scale interpolated. Modified ISB halves were constructed using personal 
pronouns (P), proper names (I), or neither (S). The instructions systemati- 
cally varied between conventional and verbal speed test type. No significant 
differences were found using ISB difference scores or SD scale raw scores. 
Using ISB raw scores, differences appeared due to forms but not order or in- 
structions. Form means were significantly different in ascending order S, P, I. 
These findings were cross-validated and interpreted as showing greater projec- 
tion with disguised formats but not instructions. Intercorrelations were dis- 
cussed, including negative correlations of the SD scale with certain ISB forms. 


The complex interaction of stimulus ma- 
terial and its presentation with the motiva- 
tions of the S constitute an inadequately un- 
derstood phenomenon at best. The three 
variables—stimulus, personality, and environ- 
ment—are usually found represented in prac- 
tice as the test, S, and the instructions to S, 
respectively. 

In the monograph by Getzels and Walsh 
(1958) on the PDPQ technique, or the 
method of Paired Direct and Projective Ques- 
tionnaires, students were presented first with 
a sentence-completion test under the guise of 
a verbal speed test. The sentence stems had 
been altered with boys’ and girls’ proper 
names replacing personal pronouns (“I some- 
times think my father is. . . .” became “Ruth 
sometimes thought her father was. .. ."), 
and the order of sentences was changed. Two 
weeks later the students completed the un- 
altered sentence-completion test according to 
the usual test instructions. Responses on both 
forms were scored reliably as positive, neutral, 
or negative. The verbal speed test form was 
found to elicit a significantly higher score 
(more negatives) consistently across grade 


„ This paper is based upon the author's disserta- 
tion, submitted in partial fulfillment of the require- 
ments for the PhD degree at Purdue University. 
The author wishes to thank James D. Linden, com- 
mittee chairman, for his assistance. Special thanks is 
due Rhoda Wood, the author’s wife, for her invalu- 
able assistance in scoring the tests and computing 
results. 
2 Now at St. Lawrence University. 
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levels. It was felt that this indicated less 
guarding and more projection with the im- 
personal form than with the personal form of 
the PDPQ. 

This PDPQ study left many questions un- 
answered. How much of the greater projec- 
tion of the verbal speed test form is due to the 
press of speed and how much to the imper- 
sonal nature of the sentence stems? How ro- 
bust is the effect? Would reversing the order 
clue the students so that they would guard on 
the impersonal form as well? And what would 
happen in the case of nonpersonal stems (such 
as, “Sometimes. . . .” or “The only trouble. 
. . 7)? Little research is available which has 
attempted to answer these questions directly 
(cf. Cromwell & Lundy, 1954; Ellis, 1947; 
Meltzoff, 1951; Sacks, 1949; Stein, 1947). 

Another consideration basic to the current 
research was the sensitivity of the particular 
type of sentence-completion test developed by 
Getzels and Walsh (1958). It was felt that 
sentence stems which were not as specific (not 
as long) offered greater projective possibili- 
ties (Rotter & Rafferty, 1950). A scoring sys- 
tem more rigorous than the positive-neutral- 
negative trichotomy of Getzels and Walsh 
also was suggested as desirable. Further, it 
was speculated that empirical scoring criteria 
would be more sensitive than criteria derived 
rationally. Thus, it was decided that the Rot- 
ter Incomplete Sentences Blank, College Form 
(ISB) would be used as the basic instrument 
in this research. The ISB was convenient in 


( 
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that, with virtually no change of sentence 
stems, it easily was divided into two 20-sen- 
tence halves—one-half with only personal pro- 
noun (P) stems and one-half with only 
standard, nonpersonal (S) stems. A third, im- 
personal (I) form was constructed by replac- 
ing the personal pronouns in the P sentence 
stems with random proper female names. 

A third consideration basic to this research 
was that Getzels and Walsh also discrimi- 
nated between a personal hypothesis (set) 
and the reaction finally expressed by the indi- 
vidual. They believed that the PDPQ tech- 
nique provided a measure of this distinction. 
The Direct Questionnaire, being in self-re- 
port form, elicited the censored, expressed 
reaction, while the Projective Questionnaire, 
“containing exactly the same items phrased in 
the third person,” elicited the projectively re- 
vealed, personal hypothesis. It was reasonable 
to conjecture that any such distinction be- 
tween the personal hypothesis and the ex- 
pressed reaction might have evolved through 
the social motives of the individual, that is, 
by social guarding or the tendency of the indi- 
vidual to respond in a socially desirable man- 
ner. Accordingly, it was felt appropriate to 
obtain for each S in this current study a 
measure of this social desirability tendency. 
The Marlowe-Crowne Social Desirability 
(SD) scale was chosen as the instrument to 
ascertain how much, relatively, each S was 
motivated to respond in a socially desirable 
manner. 

The experimental design, as established, 
included as independent variables the form of 
the stimuli (P, S, or I), the mode of stimuli 
presentation (timed, “verbal speed test" in- 
structions, T; or untimed, “personality test” 
instructions, U), and the orders of the presen- 
tation of the form and modes to Ss. Accord- 
ingly, the following three hypotheses were 
tested, and the significant findings were cross- 
validated: 


1. There will be no significant difference in 
scores due to the different forms of stimuli 
presented. 

2. There will be no significant difference in 
scores due to the different modes of present- 
ing the stimuli. 

3. There will be no significant difference in 


scores due to the order of presenting the forms 
and/or modes. 


METHOD 


The three forms (P, S, and I) were each mimeo- 
graphed with two separate sets of instructions: T 
and U. Thus, there were six possible combinations of 
different tests and instructions: PT, PU, ST, SU, IT, 
and IU. 

The Ss used in the first phase of the experiment 
were 144 female undergraduate students at Purdue 
University. The Ss used in the second phase were 
32 female undergraduate volunteers at Ohio State 
University. Demographic information descriptive of 
each S group corresponded closely. 

Each S in the first phase of the experiment com- 
pleted one pair of sentence blanks from among the 
36 pairs of sentence blanks possible. For practical 
purposes only one set of instructions (T or U) at a 
given time could be administered to a given group of 
Ss. The instructions could differ between the first 
and the second blank administered to each group, 
however. The form of the test (P, S, or I) was 
randomly assigned to S within each administration 
of each blank. Between the administration of the 
first and the second blank, each S completed the SD 
scale, (Neither the instructions nor the form of the 
SD scale ever varied.) 

Each S in the second phase of the experiment 
completed all three sentence blanks, given in the 
order I, S, P. The blanks were administered to as 
many as 10 Ss at any one time under U instructions. 


RESULTS AND DISCUSSION 


The reliability coefficients obtained were 
Form P, .84; Form S, .74; Form I, .73. All 
three null hypotheses were accepted when 
either ISB difference-score data or Marlowe- 
Crowne raw-score data were used (see Table 
1). However, using the ISB raw-score data, 
the first null hypothesis, that there would be 
no difference in score due to the form of the 
test, was rejected. The other null hypothesis 
concerning the possible effects of instructions 


TABLE 1 


F VALUES FOR ÁNALYSES OF VARIANCE 
Ustnc DIFFERENT Input DATA 


Source Analysis I | Analysis II | Analysis III 
Form (F) 241 0.09 58.90* 
Order (0) 0.00 0.00 0.12 
Instructions (I) 0.60 0.09 2.00 

FXO 2.68 0.46 2.00 
FXI 0.05 0.80 0.71 
OXI 0.58 3.23 0.61 
FXOXI 0.20 1.16 0.75 


Pn, 


stems, 
form— perso: 
. Fandom proper female names. 
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or order of presentation was accepted. (The 
level of significance held critical for all tests 
made for all data was defined as .05.) 

The ISB thus was altered in certain ways 
without effecting a significant difference in the 
scores obtained. Likewise, the SD scale showed 
itself to be unaffected by the form, instruc- 
tions, or order of presentation of the ISB. 
However, differences in the form of the ISB 
did produce a significant difference in ISB 
raw scores. It can be surmised that the differ- 
ences in test content, as opposed to the order 
of presentation or the instructions, were 
largely responsible for the score differences 
observed previously with the PDPQ. The 
necessity noted by Getzels and Walsh (1958) 
of administering the impersonal (proper 
name) form first so as not to give Ss a clue 
does not appear to be a real consideration 
here. 

In view of the significant difference ob- 
tained due to form, a Newman-Keuls test for 
ordered mean differences (Winer, 1962) was 
performed to ascertain what specific form- 
score differences were obtained on the ISB. It 
was found that each form was significantly 
different from every other form in the as- 
cending order S, P, I (see Table 2). Form I 
(which was constructed as a proper-name 
variant of Form P) tended to elicit responses 
that were scored higher, that is, more malad- 
justed, according to the Rotter criteria. Form 
S, which contained the totally nonpersonal 
Rotter items, produced the Jeast maladjusted 
Tesponses. 

This difference due to the form of the test 
was further investigated in the second phase 
Of the experiment, where every S took all 
three forms. A £ test revealed that the means 


TABLE 2 


A comparison or Form MEAN Scores FOR INITIAL 
AND Cross-VALIDATION SAMPLES 


Initial sample Cross-validation samples 


Form 
N M N M 
S 96 57.4 32 57.4 
P 96 68.0 32 67.0 
I 96 72.2 32 70.1 


Note.—Abbreviated: S = standard, nonpersonal sentence 
= personal pronoun sentence stems, I = impersonal 
nal pronouns in the P sentence stems replaced with 


TABLE 3 
INTERCORRELATION MATRICES 


Variable P | s | I | SD Scale 
Initial sample 

P 1.00 .40* .46* —.34* 

S 1.00 04 —.24* 

I 1.00 03 

Cross-validation sample 

P 1.00 .41* 24 No data 

S 1.00 —.03 No data 

I 1.00 No data. 


Note.—Abbreviated: S = standard, nonpersonal sentence 
stems, P = personal pronoun sentence stems, I = impersonal 
form—personal pronouns in the P sentence stems replaced with 
random proper: female names. 

$ <$ .05. 


for the form scores, again, were significantly 
different in the ascending order S, P, I (see 
Table 2). 

Correlations were obtained between the 
forms in both the first and second phases of 
the experiment (see Table 3). It can be seen 
that Form P correlated with Form S mod- 
erately, as reasonably might be expected of 
two selected halves of the same test. This was 
true in both the initial and the cross-valida- 
tion samples. Form P and Form I also corre- 
lated moderately on the initial sample which 
was expected due to their similarity. This sig- 
nificant correlation was not replicated, al- 
though a finding of near significance was ob- 
tained. Form S did not correlate with Form I 
in either sample, which gave rise to the specu- 
lation that whatever was similar between 
Forms P and I was not that which was similar 
between Forms P and S. 

Forms P and S also negatively correlated 
with the Marlowe-Crowne SD scale. This was 
expected, indicating that the higher was an 

"s SD scale score, the lower was his standard 
(P or S) Rotter score. As predicted, a higher 
deference to social desirability (higher SD 
scale score) was related to less maladjusted 
(lower) ISB scores. However, it may be ar- 
gued that when the necessity to express per- 
sonal feelings was removed from the ISB (as 
was attempted in Form I) the responses were 
influenced correspondingly less by social de- 
sirability. Such was the case in this study. 
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Form I did not correlate significantly with 
the SD scale. Form I also elicited the highest, 
or most maladjusted, mean ISB scores of any 
of the three forms. 
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ASSOCIATIVE RESPONSE COMPETITION IN SCHIZOPHRENIA* 


ROBERT KAPCHE 
California State College at Long Beach 


The present study investigated Mednick’s hypothesis that schizophrenics are 
especially susceptible to weak associative response competition. Groups of 
schizophrenics and college students (15 per group) each learned one of three 
paired-associate word lists. Weak intralist assocative response competition was 
varied across the lists. Results showed a general performance inferiority for the 
schizophrenic groups. A significant list effect was also found with several of the 
learning measures. There was no support for the hypothesis, however. Normal 
and schizophrenic Ss were equally affected by the response competition. 


Mednick (1958) has presented an hypothe- 
sis based on Hullian concepts as a partial 
explanation of the unintelligible verbal be- 
havior of schizophrenics. He stated that 
schizophrenics have an intense anxiety level. 
The anxiety tends to raise remote-response 
tendencies to stimuli above the evocation 
threshold. In verbal behavior, this results in 
disruptions of established verbal habits due 
to associative response competition. Since 
these disruptions often reduce anxiety, the 
thinking and speaking of remote associations 
become habitual defenses against anxiety. The 
purpose of this study was to investigate this 
hypothesized relative susceptibility of schizo- 
phrenics to weak associative response com- 
petition, by means of a verbal learning task. 

Previous experiments using verbal learning 
tasks to test Mednick’s hypothesis have 
yielded inconsistent results. Both support 
(eg., Kausler, Lair, & Matsumoto, 1964; 
Spence & Lair, 1965) and nonsupport (e.g., 
Donahoe, Curtin, & Lipton, 1961; Spence & 
Lair, 1964) have been reported. None of 
these studies, however, directly investigated 
the effects of weak associative response com- 
petition arising from preexperimental verbal 
habits. This is the type of competition 
stressed by Mednick. Since his hypothesis is 
that schizophrenics have intense anxiety, an 


1 This study is based on a master's thesis submitted 
to Northwestern University. The author wishes to 
thank his thesis advisor, W. A. Hunt, and the staff 
at the Downey Veterans Administration Hospital for 
their cooperation and aid. During the collection of 
the data, the author was the recipient of a National 
Science Foundation Fellowship. The study was pre- 
sented in part at Western Psychological Association 
Meetings, May 5, 1967, San Francisco. 


appropriate test of the hypothesis would be 
to compare schizophrenics and controls on a 
task on which it could be assumed that little 
associative response competition would be 
present under average levels of anxiety. 

The present experiment studied the effects 
of weak intralist associative response com- 
petition on the learning of paired-associate 
word lists by schizophrenic and normal Ss. An 
interaction between S groups and the lists 
was predicted, with the schizophrenics’ per- 
formance becoming relatively more inferior to 
the normals’ as tHe level of assocative com- 
petition increased over the lists. It was ex- 
pected that the manipulated response com- 
petition would be so weak that it would not 
disrupt the performance of the normals. 


METHOD 
Materials 


Three paired-associate word lists were devised, 
each containing five pairs. The stimuli were constant 
across the lists and were approximately equal in 
frequency, according to the Thorndike-Lorge (1944) 
norms. 

In List 1, the control condition, there was minimal 
associative strength for all possible relationships be- 
tween stimuli and responses. With the two remaining 
lists each response was a weak associate to one of 
the stimuli, but was not paired with it. In Lists 2 
and 3, the associated response had been given an 
average of 11.00 and 19.6 times, respectively, accord- 
ing to Russell and Jenkins’ (1954) norms. 

The Russell and Jenkins’ (1954) norms were com- 
bined with those of Bousfield, Cohen, Whitmarsh, 
and Kincaid (1961) and an additional study by the 
author in order to minimize as much as possible all 
other associative relationships among the items on 
the lists. The author’s study collected associations 
to 11 stimuli from 110 college students in intro- 
ductory psychology classes. 

The sets of responses were also equated for word 


76 ROBERT 


frequency, number of syllables, and number of words 
in each list beginning with the same letter. The 
lists are presented in Table 1. 


Subjects 


Forty-five schizophrenic Ss were drawn from the 
population of the Veterans Administration Hospital 
at Downey, Illinois. Forty were males. Most had 
histories of repeated hospitalizations and were diag- 
nosed as chronic undifferentiated schizophrenics. The 
selection criteria were as follows: at least 8 yr. of 
education, no history of a psychiatric diagnosis other 
than schizophrenia, no indications of CNS pathology, 
no histories of alcoholism or drug addiction, and 
the willingness to cooperate and the ability to under- 
stand the instructions, as estimated by ward person- 
nel. Most of the patients were lightly medicated, No 
systematic drug-level control was attempted, since 
Donahoe et al. (1961) found drug level to be un- 
related to performance, and E found patients would 
often avoid taking their medication. A few potential 
Ss were not used, however, due to their being over- 
medicated. The mean age was 39.38 yr.; the mean 
educational level was 11.87 yr. The three subgroups 
of schizophrenics were equated for age and education. 

The normal comparison group consisted of college 
sophomores fulfilling a course requirement. These Ss 
were assigned randomly to the conditions. 

Comparing two such psychiatrically dissimilar 
groups probably should have accentuated any SX 
List interactions present. The two groups probably 
did differ on mean IQ. The available evidence indi- 
cates, however, that a mean difference between 
groups of about 40 points is needed before a clear 
difference in paired-associate learning is found, with 
the brighter group then showing superior perform- 
ance (Zeaman & House, 1967). It is very doubtful 
that such a large difference could be found here, 
since the difference in mean educational level was 
not extreme. The older age of the schizophrenic 
group would also tend to retard performance (eg., 
Canestrari, 1966), although Canestrari's study did 
not find response competition to be affected by age. 


Procedure 


Each S learned one of the lists. The S saw the 
stimulus alone for 2 sec, then the stimulus and 


TABLE 1 
ParnED-AssocrATE ITEMS 
Response 
Stimulus 
List 1 List 2 List 3 

1.swirr | Load (2)* | Stream (5) | Current (2) 
2. BLoSsoM| Bread (1) | Pretty (3) | Smell (5) 
3. STEM Dark (3) | Lamp (2) | Branch (4) 
4. CARPET | Boy lb House (1) | Walk (3) 
5. prrest | Mountain (5) | Holy (4) | Black (1) 


*The number in parentheses corresponds to th: 
of the stimulus paired with it. Sage gent? tue mumgber 
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response together for 2 sec. Then the next stimulus 
was presented immediately. There was no intertrial 
interval, as an asterisk above the fifth response 
marked the end of each trial Four orders of the 
pairs were used. Standard paired-associate instruc- 
tions were given to the normal Ss. These instructions 
were lengthened and simplified for the schizophrenics, 
and efforts were made to relax them before the 
task was begun. The learning criterion was two 
consecutive correct anticipations of the list. 


RESULTS 


The means of the data to be discussed are 
presented in Table 2. The data were sub- 
jected to 2 X 3 factorial analyses of variance, 
the factors being S categories and degree of 
intralist associative competition across lists, 
The first analysis concerned trials to criterion. 
Due to great differences in variance between 
the normal and the schizophrenic groups on 
this measure (Fmax = 14.28, six groups, df = 
14 per group, ? < .01), a logarithmic transfor- 
mation to the base 10 was used on this data. 
The subsequent analysis showed significant 
main effects for S categories (F — 38.13, 
df = 1/84, p < .001) and for lists (F = 4.00, 
df = 2/84, p < .05). The schizophrenics took 
more trials to criterion than did the normals. 
Scheffé’s test for orthogonal comparisons (Ed- 
wards, 1962) revealed that the list effect was 
due to the poorer performance on Lists 2 and 
3, relative to List 1. There was no support 
for an interaction between S categories and 
lists (F < 1.00). 

Analyses of both the mean number of cor- 
rect responses and the mean number of omis- 
sions per trial yielded the same general re- 
sults. Again, there were significant effects for 
S categories and for lists and no noticeable 
trends toward interactions. The schizophrenics 
averaged fewer correct responses and more 
response omissions. Performance was worst on 
List 2, followed by List 3. 

Analysis of the mean number of overt 
errors of all types per trial showed somewhat 
different results. Here the only significant 
effect was for S categories (F = 15.12, 
df = 1/84, p < .01), with the schizophrenics 
averaging more errors than the normals. A 
weak trend toward an interaction was also 
present. The trend was contrary to the pre- 
dictions, though, as it showed the normal Ss 
tending to make more errors as the level of 
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TABLE 2 
j MEANS OF PERFORMANCE MEASURES FOR SCHIZOPHRENICS AND NORMALS ON THE 
ParrED-ASSOCIATE LISTS 
Group 
Schizophrenic Normal 
Measure 
List List 
1 2 1 2 3 

Trials to criterion (raw scores) 18.07 .80 20.13 6.53 10.60 8.13 
Mean correct 3.24 .69 2.80 4.06 3.41 3.69 
Mean omissions 1.17 17 1.66 15 1.27 .93 
Mean errors .62 54 53 49 .30 37 
"Mean percentage of response errors that 

were associative errors — 42 31" — 41> 53> 


aN = 14. 
SN = 11, N = 15 in all other cells, 

response competition increased from List 1 
to List 3, while the schizophrenics did the 
opposite. 

In order to test for possible effects of 
associative competition more directly, all 
errors consisting of the selection of one of the 
four incorrect responses were tabulated in 
Lists 2 and 3. The percentage of these errors 
that consisted of the selection of the associ- 
ative response was also found. If associative 
competition were not present, the percentage 
of associative errors should not have been 
Significantly greater than 25%. Results 
showed, instead, that 42% of all response 
errors were also associate errors, a difference 
Significantly greater than 25% with a two- 
tailed ¢ test (p < .01). A subsequent 2 X 2 
analysis of variance on Ss’ percentage of 
associative errors on Lists 2 and 3 failed to 
reveal any significant effects. 

Response competition arising from extra- 
experimental sources seemed negligible. Only 
nine such responses were emitted by 90 Ss, 
seven of these by the normal Ss. No S emitted 
more than one. 


Discussion 


It is apparent that the present study offers 
no support for Mednick’s (1958) hypothesis 
that schizophrenics will be especially sus- 
ceptible to weak associative response com- 
petition. None of the predicted interactions 
Was significant, or even in the direction con- 
sistent with the hypothesis. The most direct 


test of associative competition failed to dif- 
ferentiate the S categories. Perhaps the most 
striking feature of the data was the rela- 
tive similarity in performance of the schizo- 
phrenics and the normals across the three lists. 

These results also do not support the posi- 
tions expressed in two recent reviews. Both 
Huff (1964) and Lang and Buss (1965) ac- 
cepted the hypothesis that associative com- 
petition is stronger in schizophrenics than in 
normals, 

The performance decrement of the schizo- 
phrenics found here was reflected in a lower 
rate of correct responses and higher rates for 
both response omissions and overt errors. 
These results differ from two recent studies. 
Donahoe et al. (1961) stated that the inferior 
performance of the mostly chronic schizo- 
phrenics they tested was due largely to a 
general failure to respond. Spence and Lair 
(1964), on the other hand, pointed out a 
tendency for their relatively acute sample of 
schizophrenics to make more overt errors, and 
to show performance equivalent to the normal 
controls when comparisons involving correct 
responses were made. 

Noting this discrepancy, Spence and Lair 
suggested that length of current hospitaliza- 
tion may be related to performance differences 
among schizophrenics, with acute patients 
tending to make overt errors and chronic pa- 
tients tending not to respond. This hypothesis 
was tested in the present study by correlating 
months of current hospitalization with both 
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the omission and the error rates. Pearson r’s 
were tabulated for each list separately, then 
averaged. However, neither correlation co- 
efficient differed significantly from zero. 
Thus, Spence and Lair’s hypothesis was not 
confirmed. 

Despite the negative findings of Donahoe 
et al. (1961) concerning drug effects, medica- 
tion may have reduced the anxiety of the 
schizophrenics enough to significantly lessen 
the strength of the assocative competition. 
Indirect evidence against this hypothesis, 
however, is found in a study by Buss and 
Daniell (1967). They also reported results 
contrary to Mednick’s hypothesis, as they 
found no differences in amount of stimulus 
generalization between groups of schizo- 
phrenics either on medication or on placebos 
compared with a group of normals. Their 
experimental design should now be used to 
study associative competition in schizophrenia. 
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THE HYSTERIA AND PSYCHASTHENIA CONSTRUCTS AS AN 
ALTERNATIVE TO MANIFEST ANXIETY AND 
CONFLICT-FREE EGO FUNCTIONS* 


JAMES STEELE 
San Jose State College 


A total of 128 hysteric, psychasthenic, and control Ss, selected with the 
Hysteria and Psychastenia scales of the MMPI, were employed to duplicate 
the performances of the “conflict-free” cognitive styles of leveling and sharpen- 
ing and scanning under anxiety arousal. Psychasthenics performed like sharp- 
eners and scanners on the Embedded Figures and Stroop Color-Word Tests, 
while hysterics performed like levelers. Duplication of the performance of “con- 
flict-free” cognitive styles by anxiety-based personality defenses and the superior 
performance of the psychasthenic or “high-anxious” group on the complex 
Embedder Figures Test suggested modifications in the positions of Gardner on 
cognitive styles and Taylor on the Manifest Anxiety scale. 


An investigation of the influence of hysteric 
and psychasthenic defensive styles upon the 
learning or performance of simple and com- 
plex tasks was undertaken in this study. 

The purpose of the investigation was to 
demonstrate that hysteric and psychasthenic 
defenses can account for some of the findings 
of J. Taylor and K. W. Spence (1952) in 
relation to the task performances of Ss high 
and low on the Taylor Manifest Anxiety 
(MA) scale (J. A. Taylor, 1953) and the 
findings of Gardner, Holzman, Klein, Linton, 
and Spence (1959) in relation to the task 
performances of Ss having the confiict-free 
cognitive styles of leveling, sharpening, and 
scanning. 

It is a basic hypothesis of this study that 
levelers, whom Klein (1954) described as 
having hysterical defenses and tendencies to 
repress, will be primarily low scorers on the 
Taylor MA scale, while sharpeners and scan- 
ners, who have obsessional defenses, will be 
high scorers (Gardner et al., 1959). This 
hypothesis would be based in part on low 
nonsignificant or negative correlations between 
the Hysteria and Psychasthenia scales of the 
MMPI and the reported correlations of .80 
or .90 between the Psychasthenia scale and 
d Taylor MA scale (Erickson & Davids, 

955). 


1This paper is an abridgment of a dissertation 
submitted in partial fulfillment of the PhD degree 
and sponsored by Irwin Sarason at the University 
of Washington, 1961. 
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Gardner et al. (1959) reported that level- 
ers who could be classified as low on the 
Taylor MA scale perform poorly on the 
Witkin Embedded Figures Test because of 
their inability to articulate stimulus configu- 
rations. They also perform poorly on tasks 
requiring memory for detail and size judg- 
ments. Sharpeners, who are primarily profes- 
sional women, perform in a superior fashion 
on the Embedded Figures Test, and scanners, 
who are primarily male Ss, perform in a 
superior fashion on tasks requiring size judg- 
ments. Scanners have great difficulty, how- 
ever, with a simple task, that is the Stroop 
Color-Word Test (Stroop, 1935). There is 
some evidence that levelers have much less 
difficulty with this task (Gardner et al., 
1959). 

Gardner et al.’s (1959) findings, then, sug- 
gest that sharpener and scanner Ss, who are 
potentially high on the Taylor MA scale, can 
perform in a superior fashion on complex 
tasks such as the Embedded Figures Test, 
while low-anxious Ss (levelers) do poorly on 
a complex task. 

This finding is, by inference, a direct con- 
tradiction of the original Taylor and Spence 
position where high-anxious Ss reportedly 
have inferior performance on complex tasks 
as a function, in Hullian terms, of the high 
drive state and activation of task-irrelevant 
and competing habits (J. Taylor & K. W. 
Spence, 1952). More recently, as a result of 
inconsistencies in her findings, Taylor has 
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stated that her theory is incomplete with 
regard to properties other than drive level 
which may differentiate groups with extremely 
high scores on the MA scale from those with 
extremely low scores (J. Spence, 1963). 

A clarification of both the Taylor and 
Gardner-Klein positions is suggested by the 
original interpretation of the Psychasthenia 
scale of the MMPI as a measure of obses- 
sional traits as well as manifest anxiety 
(Hathaway & McKinley, 1951). 

An intellectual feature of the obsessive, in 
addition to manifest anxiety, is vigilant and 
compulsive attention to detail, retentiveness, 
and what may be termed overinclusion of 
stimuli, while hysterics are contrasted as ex- 
cluders who avoid or forget stimuli (Cameron 
& Margaret, 1951). 

These theoretical and clinical positions have 
received experimental support from the studies 
of Erickson and Davids (1955), Truax 
(1957), Truax and Martin (1957), Ruebush 
(1960), Gordon (1957), and Barron (1953) 
where, in general, Ss classified as hysterics or 
repressors tended to forget failures and were 
less detailed in their memory and perceptual 
functions, while the psychasthenic or sensitizer 
Ss tended to remember unpleasant associa- 
tions and displayed more differentiated 
memory and perceptual functions. 

The general exclusion capacity of the hys- 
teric was suggested also in a recent study by 
Wollitsky (1967) in which he found that Ss 
who obtained low-interference scores on the 
Stroop Color-Word Test denied a relationship 
between cigarette smoking and lung cancer to 
a significant degree when compared to the 
high-interference group. He concluded that 
the facility for ignoring intrusive words 
extends to the area of belief. 

Messick and Fritzky (1963), commenting 
on studies of conceptual styles by Witkin, 
Dyk, Faterson, Goodenough, and 
(1962), and Kagan, Moss, and Sigel (1963), 
noted that these studies have produced a di- 
mension of the analytic in contrast to the 
global attitude. Witkin (1965) related global 
and analytic conceptual styles to the defenses 
of repression and isolation, respectively. Ror- 
schach interpretation has long used global or 
whole responses as indicative of hysterical 
traits and detail responses as indicative of 
obsessional traits (Schafer, 1948). 
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From the studies just mentioned one could 
infer that more exact predictions of task per- 
formance could be made if attention were 
paid both to defensive style and to task 
requirements. 

To test this inference it was predicted that 
psychasthenic Ss or high-anxious Ss would 
perform in a superior fashion on a complex 
task, the Witkin Embedded Figures Test 
under stress, in contrast to a Taylor predic- 
tion of inferior performance. It was also pre- 
dicted that hysteric Ss, who are manifestly 
low-anxious Ss, would perform in an inferior 
fashion on the same complex task and in a 
superior fashion a simple task, the Stroop 
Color-Word Test, in contrast to the Taylor 
position. Superior or inferior performances 
would thus depend upon whether the task 
is compatible with the intellectual differentia- 
tion, retention, and attention to detail of the 
psychasthenic, as in the case of the Witkin 
test, or compatible with the lack of retentive- 
ness, differentiation, and attention to detail 
of the hysteric, as in the case of the Stroop 
test. 

The introduction of stressful instructions to 
increase levels of anxiety was included to in- 
crease the defense and magnify these pre- 
dicted results. Statistically, this would occur 
as a significant Defense X Instructions inter- 
action. 

Duplication of the performance of levelers 
and of sharpeners and scanners by hysterics 
and psychasthenics could be predicted by 
Gardner et al. (1959), but these authors 
would not predict systematic changes under 
experimental stress, as cognitive styles are 
conflict free and not derived from anxiety 
reduction. 

In testing the hypothesis above, groups of 
hysteric, psychasthenic, and control Ss were 
exposed to the Embedded Figures and Stroop 
Color-Word tests. It was predicted that con- 
trol Ss would not show systematic changes 
predicted for the experimental groups. 

Controls for intelligence were included to 
prevent the confounding of intelligence and 
defenses. 


A METHOD 
Selection of Subjects 


The variables of concern to the study were incor- 
porated into a 4X2X2X2 factorial analysis of 
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variance design in which defenses were varied in 4 
ways, sex in 2, instructions in 2, and order of task 
administration in 2 ways. It can be seen from this 
arrangement that 32 cells are involved in the design 
and that, with a minimum of 4 Ss to a cell, 128 Ss 
would be required for the design. The design re- 
quired an analysis of variance for scores obtained 
on both the Witkin and Stroop tests. 

The Ss were selected from 942 beginning students 
in psychology and sociology on the basis of a score 
in the upper or lower 30% of a distribution of scores 
obtained from the administration of the Hy and Pt 
scales of the MMPI to these students. The Ss were 
administered the Hy and Pt scales by a representative 
of E. Two weeks later the same students were asked 
to submit their names for a one-in-five random 
drawing of Ss to standardize some tests. 

Four combinations of defenses were obtained from 
the Hy and Pt measurements, individuals who were 
high on one scale and low on the other or indi- 
viduals high on both scales or low on both scales. 
High on the Hy scale was a raw score of 24 or 
above and 19 or above on the Pt scale, Lows on 
both scales were 15 and below and 9 and below, 
respectively. It was difficult to obtain Ss who were 
high and low on both scales. 


Independent Variables 


The major independent variable was neutral versus 
stressful instructions. Other independent variables 
were defenses, order, and sex. Neutral or stressful 
instructions were given prior to the standard instruc- 
tions which are provided for the Witkin Embedded 
Figures or Stroop Color-Word Tests. One major 
chnage in the standard instructions for the Witkin 
test was made. The Ss were not informed that they 
could ask to look at the simpler designs again if 
they so desired. This change was introduced on the 
basis of Ruebush’s (1960) finding that high-anxious 
Ss asked to see the designs more frequently than 
low-anxious Ss. In addition, Gardner et al.’s (1959) 
data indicated that levelers forgot the design more 
frequently than sharpeners. To eliminate the possible 
effects of these tendencies on performance, Ss were 
reexposed to a simple design every 60 sec. until they 
reached a solution. Failure-induced anxiety was 
avoided by informing Ss that the usual procedure 
was to let them see the design again at regular 
intervals. 


Dependent Variables 


Dependent variables were scores obtained from 
performance on the Witkin Embedded Figures Test 
and the Stroop Color-Word Test. 

Scores for the analysis of the Witkin test were 
derived by obtaining the mean solution time per 
figure among the 24 figures for each S. 

Scores for the Stroop Test were obtained by the 
method suggested in the Gardner et al. (1959) mono- 
graph. Each S’s score was derived by determining 
the difference between his predicted and obtained 
scores on the third phase of the Stroop test. This 
score is a measure of interference. Predictions were 


made on the basis of a regression equation derived 
from the correlation between the second and third 
phases of the Stroop test. Scores obtained directly 
by time on the third phase of the test were also 
employed. 

Each S was administered both of the task discussed 
above to make the most efficient use of Ss. Analysis 
of the data for a possible order effect was arranged 
by alternating the order of administration of the 
tasks. 

Sex differences on the Witkin test have been estab- 
lished by Witkin (1950). Control for this anticipated 
difference in performance was obtained by assigning 
an equal number of each sex to each experimental 
condition, 


Intelligence 


Control for intelligence was accomplished by re- 
stricting the range of intelligence of Ss markedly 
below the mean obtained American Council on Edu- 
cation Test of Intelligence (ACE) scores for experi- 
mental and control groups used in the study (revised 
by the University of Washington Counseling Center, 
1960). This resulted for the most part in the removal 
of several male and female hysterics. After selection 
or exclusion on the basis of these scores, £ tests 
revealed no significant differences among mean total, 
quantitative, and verbal ACE scores of Ss assigned 
to the various experimental and control groups. 


Experimental Arrangements 


The Ss were administered the Witkin and Stroop 
tests individually. Each S was seated facing a 2 X 3 
ft. vertical plasterboard placed on a table upon which 
were mounted two identical blue notebooks contain- 
ing the respective tests. The Z timed S's performance 
with a stopwatch and recorded his score on record 
sheets prepared for that purpose. 

The Ss were screened for color blindness before 
they received instructions. After this inquiry, Ss 
received either the natural or stressful instructions 
followed by the standard instructions for the test. 

Stressful instructions were as follows: 


Each of these notebooks that you see in front of 
you contains a test that has been used to evaluate 
effective decision-making in a short period of 
time. Originally the tests were used to select 
executive personnel. As you know, a person in 
an executive position often has to make an ef- 
fective decision without having a lot of time to 
collect information on a subject. The tests have 
also been used in test batteries designed to measure 
psychological adjustment since effective decision- 
making is one measure of psychological adjustment. 
Our present interest, however; is to use the tes 
to predict a student’s grade point average duri 
his senior year. Before we begin, could you tel 
me the GPA that you expect to earn during your 
senior year. 


(The prediction was obtained, and then the stu- 
dent was asked his current. GPA.) Following this 
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exchange, S was asked if he planned to graduate from 
this university. 
Neutral instructions were as follows: 


The two notebooks you see in front of you each 
contain a test that has been developed for general 
experimental purposes. At this point we are simply 
interested in establishing some norms or averages 
for these tests since we don’t know how college 
students will perform on them. 


The effectiveness of the stressful instructions was 
evidenced by very noticeable increases in random 
motor behavior, for example, shifting in the chair, 
arm, hand, and facial movements. 

Administration of the tests under neutral condi- 
tions should be interpreted as a moderately stressful 
condition since the tests are timed and relatively 
difficult to perform. 


RESULTS 


An analysis of variance conducted with the 
four defense categories resulted in a signifi- 
cant F as a result of the superior perform- 
ances of male Ss on the Witkin test and 
a significant Instructions X Sex interaction 
(table not included). The superior perform- 
ance of male Ss was consistent with Witkin's 
(1950) findings. The significant Instructions 
X Sex interaction can be attributed to the 
superior performance of men under neutral 
instructions, while females tended to dis- 
play superior performance under stressful 
instructions. 

Since hysteric and psychasthenic perform- 
ance differences were obscured by the less 
extreme performances of the control groups in 
the analysis of the four groups (see Table 1), 
it is necessary to turn to the analysis of the 
high-low (hysteric) and low-high (psych- 
asthenic groups alone to provide support for 
the basic hypotheses advanced at the begin- 
ning of this study. 


TABLE 1 


EAN SCORES: TOTAL EMBEDDED FIGURES FOR EIGHT 
EXPERIMENTAL GROUPS 


Hysteric Psychasthenic 


Control Ss Control Ss 


LLS 
29.62 


HLN 
29.81 


HLS |LHN 
40.93 | 33.68 


LHS 
25. 


HHN 
33.06 


HHS 
35.24 


LLN 
M |33.93 


Note.—Abbreviated: L = low, H = high, N = neutral in- 
structions, S — stressful instructions. 


TABLE 2 


ANALYSIS OF VARIANCE FROM SOLUTION TIME oF 24 
EMBEDDED FIGURES FOR Two DEFENSE Mops: 
HYSTERIA AND PSYCHASTHENIA ONLY 


Source of variance df MS F 
Between subjects 

Defense (À) 1 581. 3.489 

Instructions (B) 1 23.76 — 

Sex (C) 1 669.51 4.021* 

Order (D) 1 301.89 1.81 
AXB 1 1,571. 9.435** 
AXC 1 142. .852 
AXD 1 180. 1.081 
CXD 1 722.58 4.339 
BXC 1 1,130.73 6.791** 
BXD 1 425.55 2.555 
AXBXC 1 46.65 .280 
AXBXD 1 17.04 .102 
BXCXD 1 628.67 3.775 
AXCXD 1 76.40 458 
AXBXCXD 1 258.27 1.551 
Within subjects 48 166.5 — 

Total 63 
*p <05. 
LT. $ 01, 


Basic interest should center upon the highly 
significant Defense X Instructions interaction 
(Table 2) which is significant beyond the 
01 point of probability after correction for 
heterogeneity of variance (Lindquist, 1953). 
This significant interaction provides major 
support for two basic hypotheses of the 
study: that the personality defenses of hys- 
teria and psychasthenia can be used to predict 
performance on the Embedded Figures Test 
and that one manifestation of this relation- 
ship is a significant Defense x Instructions 
interaction. 

Mean scores for the four defensive modes 
when performing under neutral and stressful 
conditions are presented in Table 1. Inspec- 
tion of this table indicates that the significant 
interaction obtained from the analysis of the 
hysteric and psychasthenic groups can be at- 
tributed to the poorer performance of the 
hysteric group under stress and the improved 
performance of the psychasthenic group under 
the same condition. In this case, the inter- 
action can be attributed to the inferior per- 
formance of the females under neutral condi- 
tions. In addition, the hysteric males are 
inferior under stressful conditions. 

Differences in performance which are char- 
acteristic of the hysteric and psychasthenic 
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TABLE 3 


MEAN OBTAINED SCORES IN SECONDS or Eromr EX- 
PERIMENTAL GROUPS ON STROOP 
Cotor-Worp TEST 


Control Hysteric | Psychasthenic| Control 


HHS 
104.50 


LLN 
M |99.06 


LLS | HLN 
99.62 | 94.25 


HLS | LHN 
96.87 | 107.43 


LHS | HHN| 
100.7 | 95.62 


Note.—Abbreviated: L =low, H = high, N = Neutral 
instructions, S = stressful instructions. 


groups under neutral and stressful conditions 
do not occur in the low hysteric-psychasthenic 
or control group (see Table 1). This finding 
adds additional significance to the systematic 
fluctuations of the hysteric and psychasthenic 
groups. It is of interest, also, to note that the 
high hysteric-psychasthenic control group does 
not differ under neutral and stressful condi- 
tions in the same systematic manner as the 
two experimental groups. 

As an additional point it is obvious from 
the preceding data that the predicted differ- 
ences between the hysteric and psychasthenic 
groups did not materialize until the introduc- 
tion of stress. While this finding fails to 
confirm one hypothesis of the study—that 
hysterics and psychasthenics would differ 
under neutral conditions—it may be a 
function of controls exercised for intelligence 
in this study. 


Stroop Color-Word Test Results 


Inspection of the distribution of scores on 
the Stroop Color-Word Test indicated hetero- 
geneity of variance. The procedure suggested 
by Lindquist (1953) was also applied to the 
data obtained from this analysis. 

Although the original design of this study 
provided only for an analysis of scores based 
upon the difference between the predicted and 
obtained scores on the third phase of the 
Stroop test as suggested by Gardner et al. 
(1959), an additional analysis was conducted 
to compensate for possible shortcomings in 
the scoring system listed above. The addi- 
tional analysis used time scores for the third 
phase of the Stroop test as a direct measure 
of interference. 

The only significant finding resulting from 
an analysis of variance of all four experi- 
mental groups using the first method, differ- 


ence scores (table not included), was the 
order effect. The significant order effect was 
a result of the higher interference scores 
obtained under Order I administration of the 
tests. This order involved the administration 
of the Stroop test as the second test and 
might be interpreted as the effect of fatigue 
or accumulated anxiety over performance on 
the preceding test. Inspection of the perform- 
ance of hysterics and psychasthenics revealed 
the mean difference scores of the hysteric and 
psychasthenic groups differ in the predicted 
manner, but these differences failed to reach 
statistical significance (table not included). 

Table 3 presents the mean time rather than 
difference scores of the entire eight groups of 
Ss of interest to the study. Table 4 presents 
the results of an analysis of variance using 
scores from the hysteric and psychasthenic 
group only. This analysis indicates that 
the psychasthenic group displayed higher 
interference. 

The significant main effect between de- 
fenses is not accompanied by a significant 
Defense X Instructions interaction (see Table 
4). None of the other variables shows signifi- 
cant effects. Support for one basic hypothesis 
of the study, that the psychasthenic group 
would show higher interference and the hys- 


TABLE 4 


ANALYSIS OF VARIANCE OF Stroop Cotor-Worp TEST 
OBTAINED SCORES: HYSTERIA AND 
PSYCHASTHENIA GROUPS ONLY 


Source of variance df MS m F 
Between subjects 

Defense (A) 1 1,164.51 5.34* 

Instructions (B) 1 66.01 — 

Sex (C) 1 135.89 — 

Order (D) 1 708.89 3.25 
AXB 1 348.50 1.59 
AXC 1 461.85 2.118 
AXD 1 16.60 — 
CXD 4 437.60 | 2.00 
BXC 1 94.30 = 
BXD 1 321.3 147 
AXBXC 1 7410 — 
AXBXD 1 212.12 973 
BXCXD 1 98.37 — 
AXCXD 1 745 — 
AXBXCXD 1 166.56 ceu 
Within subjects 48 218.06 

Total 63: 
*p«.05. 
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teric, a lower interference on the third phase 
of the interference test, was obtained. 

Of interest, also, are the relationships be- 
tween the means where the lowest means 
(lowest interference) are obtained by Ss in 
the high Hy groups, who are in effect high 
scorers on the Hy or repression variable. 


Discussion 


The superior performances of the psych- 
asthenic group on the Embedded Figures Test 
under stress and the superior performance of 
| the hysteric group on the Color-Word Test 
| support the importance attached to personal- 
ity defenses at the beginning of this study. 

The significance of the systematic differ- 
| ences is also supported by the control groups, 
who did not show superior or inferior per- 
formance from task to task. 

Statistically the predicted Defense X In- 
structions interaction has been confirmed. Per- 
formance of the hysteric group showed a 
|| significant decrement under stress, while the 
| psychasthenic group tended to show a signifi- 
cant increment in performance under stress on 
a complex task, the Embedded Figures Test. 
|| In terms of defensive processes, the superior 
|| capacity of the hysteric to exclude stimuli 
and reduce interference from competing stim- 
| uli appears to be an asset in the Color-Word 
Test and a handicap in the Embedded Figures 
Test, while the inclusive capacity of the 
psychasthenic produces directly opposite re- 
sults when the inclusive defenses are activated 
by stress. 

As a general conclusion it appears that 
| there has been a tendency toward over- 
implification of the effects of personality 
imensions upon task performance. It is dif- 
ficult to assess these effects by using single 
variables such as drive level as originally 
suggested by J. Taylor and K. W. Spence 
|| (1952). Specific consideration of task per- 
formance as an interaction between personal- 
|| ity defenses and task requirements has made 
|| it possible to produce superior performance 
|| on a complex task by high-anxious (psych- 
|| asthenic) Ss. Since these findings directly con- 
tradict the Taylor-Spence position, this study 
|| supports interpretation of high and low scores 
|| on the Taylor scale as measures of defensive 


processes rather than of drive. Insofar as tests 
of Hullian theory are tied to the Taylor scale, 
this study also indicates a need for modifica- 
tion of Hullian drive theory to take account 
of traditional clinically derived personality 
defenses. 

An essential result of the study has been 
the demonstration of differences in perform- 
ance as a function of personality defenses on 
tests employed by Gardner et al. (1959) to 
demonstrate cognitive controls. While the 
prediction that nonstress differences could be 
demonstrated on the Witkin and Stroop tests 
was not established for the Witkin test, the 
control for intelligence in this study and the 
consequent elimination of some low-scoring 
hysterics who might perform poorly on the 
Witkin test may be sufficient to explain the 
failure to duplicate Gardner et al.’s (1959) 
cognitive controls of leveling and sharpening 
and scanning under nonstress conditions. 

A functional relationship has been obtained 
between the arousal of anxiety and perform- 
ance in the hysteric and psychasthenic groups 
comparable to Gardner et als (1959) levelers 
and sharpeners, respectively, on the Em- 
bedded Figures Test. This anxiety-based rela- 
tionship casts doubt on the independence of 
cognitive controls from conflict and anxiety 
and suggests that cognitive controls may be 
a manifestation of the specific performance 
characteristics of a particular defensive style. 
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A PROCEDURAL CRITIQUE OF “DESENSITIZATION AND 
THE EXPERIMENTAL REDUCTION OF THREAT” 


GERALD C. DAVISON 
State University of New York at Stony Brook 


It is suggested that conclusions from an experiment by Folkins, Lawson, Opton, 
and Lazarus be tempered with respect to two procedural flaws: (a) Their 
control for relaxation unpaired with aversive stimuli is unsatisfactory because 
Ss relaxed while viewing the test stress film; and (b) Ss in the “simulated 
desensitization” condition were not allowed to escape aversive images when 
they became disturbing, this error in design violating the principle in desensi- 
tization that aversive scenes be paired with relaxation, not with emotional 
upset. The results, therefore, cannot be taken as evidence either for or against 
the importance of any desensitization variables or combinations thereof. 


The experiment by Folkins, Lawson, Op- 
ton, and Lazarus (1968) raises important 
questions regarding the largely neglected role 
of cognitive processes in “behavior therapy 
techniques” such as Wolpe’s (1958) system- 
atic desensitization procedure. Close exami- 
nation of their methods, however, reveals 
two problems which seem to justify consider- 
able caution in interpreting their experi- 
mental results. 

Their principal surprising finding was that, 
on several measures, those Ss who had under- 
gone an analogue of the clinical desensitization 
procedure did not fare as well as either those 
who were merely relaxed or those who re- 
hearsed forthcoming stressful incidents with- 
out benefit of relaxation. Thus, a question 
Folkins et al. are asking is very close to that 
investigated by this writer (Davison, 1968); 
namely, is it necessary to pair anxiety-com- 
peting relaxation with imaginal aversive 
stimuli in order to effect reduction of fear 
(or stress), or can one account for the dem- 
onstrated efficacy of Wolpe’s procedure in 
terms of relaxation alone and/or in terms of 
imagination of aversive scenes without re- 
laxation (“extinction effects,” or, as Folkins 
et al. termed it, “cognitive rehearsal”)? The 
present writer found (as did Rachman, 1965, 
in a similar pilot study) that only when im- 
aginal aversive stimuli are paired with re- 
laxation, that is, during systematic de- 
sensitization, does significant reduction in 
avoidance behavior occur. The Ss who re- 
laxed while imagining scenes unrelated to 
their fears and Ss who imagined aversive 


scenes without relaxation did not improve 
significantly. 

Although the Folkins et al. study is not 
intended as a replication of Davison (1968), 
the fact that their closely related findings 
are so radically different seems quite im- 
portant. Let us therefore examine the pro- 
cedures more closely. 

In their Condition II—'"Relaxation"—they 
hoped to evaluate the effects of relaxation 
alone (ie. without pairing with aversive 
stimuli) in reducing stress reactions to an 
upsetting film. The Ss were given two ses- 
sions of relaxation training by means of a 
recording by A. A. Lazarus (upon which 
this writer based the training procedures in 
his own study), and in the third test session 
they were shown the film immediately fol- 
lowing a third practice session with the re- 
laxation tape. The film itself was preceded 
by these instructions: 


It is felt that the relaxation you are now experi- 
encing and the techniques you have learned to con- 
trol these sensations can be used to your advantage 
to handle the somewhat stressful movie film you 
are about to see. Just remain as relaxed as you can. 
The more you are able to relax, the more you WP 
enjoy the experience [p. 103]. 


Thus, one finds, Ss instructed to rela* 
during the presentation of actual aversive 
stimulus material. This procedure is virtually 
identical to what has been occasionally re- 
ported (e.g., Clark, 1963) with clients who 
are either unable to generate clear images 
for systematic desensitization or with Ss un- 
dergoing what is usually referred to as "in 
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vivo desensitization” (cf. Davison, 1965). 
The crucial fact is that, in contrast to the 
*Relaxation Group" of Davison (1968), 
which imagined unrelated scenes while re- 
laxed, Folkins et al's "relaxation group" 
seems to have experienced the apparently 
crucial pairing of anxiety-competing relaxa- 
tion with stressful stimuli. It is submitted, 
therefore, that this is an unsatisfactory con- 
trol for relaxation alone. 

A second problem, which Folkins et al. 
alluded to, is that, because the "simulated 
desensitization" was conducted via tape re- 
cording, “Ss in the desensitization condition 
may have been sensitized rather than de- 
sensitized to the threat scenes [p. 111].” 
That is, unless the person being desensi- 
tized can signal to the therapist-E that 
a given scene is disrupting the relaxation, 
one is setting the stage for a pairing of im- 
aginal aversive stimuli with emotional upset, 
rather than with emotional calm. Since the 
design of their Desensitization condition does 
not fulfill even the most basic requirements 
of Wolpe’s (1958) original procedure, it does 
not seem reasonable to regard the outcomes 
of this particular experimental group as re- 
flecting anything at all upon the efficacy of 
the technique or upon the variables under- 
lying its efficacy. (See Davison, 1968, for 
a discussion of a similar problem in one of 
his control groups; also see Schubot, 1966, 
for a procedural modification which handles 
the problem.) Lang (in press) has indepen- 
dently voiced this concern. 

In spite of what the present writer feels 
to be serious methodological flaws in this 
study, the questions raised about the role 
of cognitive processes in behavior therapy, 
and particularly in systematic desensitiza- 
tion, are intriguing and important. While this 
writer (Davison, 1968) interprets his main 
findings as support for a counterconditioning 
Process underlying the beneficial outcomes 
of systematic desensitization, a later study, 
by Valins and Ray (1967), has raised a 
rather compelling alternative explanation for 
the beneficial outcomes of Wolpe’s procedure. 
Valins and Ray suggested that, by relaxing 
in the face of hitherto disturbing images, 


Ss reevaluate the significance of these events 
(ie., change their cognitions) and subse- 
quently (consequently as well?) manifest sig- 
nificant reduction in avoidance behavior. 
This is not dissimiliar to London’s (1964, 
pp. 130-131) Tolmanian interpretation of 
desensitization, whereby, 


The closer the imagery comes to representing “real” 
experience of the most complete sort without being 
followed by the actual experience it simulates, the 
more the patient’s expectation of disastrous action, 
with its disastrous consequences, is reduced. . . . 


It is the task of future carefully designed 
experiments to ferret out the active ingredi- 
ents of this fascinating procedure in such a 
fashion as to allow some choice among alter- 
nate explanations. 
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A REPLY TO DAVISON’S CRITIQUE 


CARLYLE H. FOLKINS, KAREN LAWSON EVANS, EDWARD M. OPTON, Jr. 
AND RICHARD S. LAZARUS 
University of California, Berkeley 


We disagree with both of Davison’s (1968) 
criticisms of our paper, “Desensitization and 
the Experimental Reduction of Threat” 
(Folkins, Lawson, Opton, & Lazarus, 1968). 

Davison’s first criticism is that our “relaxa- 
tion” treatment included elements of desensi- 
tization. He writes, “The crucial fact is that 
. . . Folkins et al.’s ‘relaxation group’ seems 
to have experienced the apparently crucial 
pairing of anxiety-competing relaxation with 
stressful stimuli [Davison, 1968, p. 87].” 
This “procedural flaw” might therefore ac- 
count for the fact that our relaxation treat- 
ment was as effective as our desensitization 
treatment, whereas Davison’s (1965) Ss, in 
a design free from this “flaw,” benefited only 
from his desensitization treatment, and not at 
all from his relaxation treatment. 

In fact, however, it was Davison’s pro- 
cedure that permitted “. . . the apparently 
crucial pairing of anxiety-competing relaxa- 
tion with stressful stimuli” during training. 
Davison (1965) exposed his Ss to a snake as 
a test for initial strength of snake avoidance. 
Then he administered experimental treat- 
ments: relaxation, desensitization, etc., and 
finally he readministered the snake-avoidance 
test. Davison’s Ss knew, at the time of their 
relaxation treatment, that the purpose of the 
treatment was to reduce their fear of ap- 
proaching snakes. Indeed, “The incentive for 
participation was [in part, desire] to be rid 
of a particular fear . . . [Davison, 1965, p. 
18].” In consequence, Davison's Ss had ample 
opportunity and motivation to think about 
snakes during the relaxation training and 
hence to contaminate the relaxation treatment 
with elements of desensitization. In contrast, 
our design avoided this particular procedural 
flaw. Our Ss knew only that they would see 
a somewhat stressful film, but they knew 
nothing about the film content until they 
actually saw it. Thus, our procedure more ade- 
qately isolated relaxation from desensitization. 
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Our Ss were explicitly encouraged to make 
use of their relaxation training in dealing 
with the stressful film; Davison’s Ss were 
implicitly encouraged to do so. In this respect 
the relaxation treatments of both Davison 
and the present writers resembled the clinical 
application of relaxation therapy, and both 
were clearly quite different from systematic 
desensitization therapy. The question remains 
open as to why our relatively “pure” relaxa- 
tion treatment was effective, while Davison’s 
relaxation treatment, which may have been 
contaminated to some extent by what he 
calls “in vivo desensitization,” was ineffective. 
Our data remain a challenge to Rachman's 
(1967) conclusion, based on a review of em- 
pirical studies (including Davison’s), that 
pairing of aversive imagery with relaxation 
during training is necessary in order to 
produce salutary effects. 

Davison’s second criticism was anticipated 
and dealt with in our original discussion 
where we pointed out that, since the pro- 
cedure was standardized, Ss in the simulated 
desensitization condition could have experi- 
enced increased anxiety rather than relaxation 
while imagining the threat scenes. But this 
aspect of the simulated desensitization treat- 
ment is equally true of the cognitive rehear 
condition, in which the same threat scenes 
were presented without relaxation. Yet the 
beneficial effects of the cognitive rehearsal 
condition were greater than those of either 
our desensitization or relaxation conditions. 
Thus, our data are inconsistent with the pos- 
sibility suggested by Davison that lack of 
procedural flexibility might have accounte 
for the failure of the full desensitization pro- 
cedure to be superior to its relaxation OT 
cognitive rehearsal components. 

If Davison's criticisms were valid, they 
would vitiate the challenge which our data 
present to the reciprocal inhibition interpreta- 
tion of the efficacy of desensitization therapy. 


= 


REPLY To Davison 89 


Since the data do remain an embarrassment to 
the reciprocal inhibition theory, other theo- 
retical formulations must be explored. Our 
conclusion that the role of cognitive processes 
is probably an important one is a conclusion 
with which Davison seems to agree. 
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STAFF ATTITUDES TOWARD PATIENT CARE 
AND TREATMENT-DISPOSITION BEHAVIOR 


ALAN S. DEWOLFE, ROBERT P. BARRELL, anp FRED E. SPANER? 
Veterans Administration Hospital, Downey, Illinois 


Treatment milieus (staff attitudes) were related to treatment-disposition be- 
havior in four autonomous units with comparable staff and patients. Only 
measures on which units showed significant differences or ratios of these mea- 
sures were analyzed. Attitudes toward rules (p< .01), informing patients 
(p «.001), impact of staff attitudes (p< 01), and patients’ characteristics 
(p < 05), and measures of use of restraint and seclusion (p < .01), trial visits 
(P «.05), and maximum hospital benefit discharges (p< .001) met this cri- 
terion. The main focus was on three related attitudes interpreted as indicating 
openness, frankness, and trust in interactions with others and their significant 
relationships to an index of trust in patients’ capabilities (p < .05 in all cases). 
These attitudes appeared sufficiently important in the treatment climate to 


merit further exploration. 


Recently, considerable research energy has 
been turned to a multidimensional approach 
to measuring staff attitudes toward patient 
care in the psychiatric setting. The custodial 
versus treatment orientation has been sub- 
divided into a number of more specific di- 
mensions, With varying success, attempts 
have been made to relate staff attitudes to 
effectiveness in working in the psychiatric 
setting, for example, Carstairs, Gilbert, 
Heron, Levinson, and Pine (1957), Canter 
(1963), Cohen and Struening (1964), and 
Ellsworth (1965). 

The present study evaluated the relation- 
ships between different treatment milieus, as 
measured by staff attitudes toward patient 
care, and treatment-disposition behavior in 
a large psychiatric hospital. Four autono- 
mous treatment units, comparable in com- 
position of staff and patients, provided the 
variation in treatment milieu or unit-staff 
attitudes toward patient care. 


METHOD 
Subjects 


The attitudes of 418 staff members in the four 
units were evaluated three times over the period 
of 1 yr. Monthly measures for a 2-yr. period of 
treatment and disposition variables were analyzed. 


1The authors wish to express appreciation to 
Winfred F. Hill of Northwestern University, who 
served as consultant, for his many helpful sugges- 
tions, and Albert Erlebacher of Northwestern Uni- 
versity, for advice on statistical procedures. 


These measures involved approximately 1,600 psy- 
chiatric patients under treatment in the four units. 


Measures 


Staff attitudes toward patient care were measured 
by the Philosophy of Treatment Form (POT), ex- 
plained in Barrell, DeWolfe, and Cummings (1965). 
The units were compared on the following patient- 
care attitude areas: (A) interprets rules in an authori- 
tarian manner, (B) thinks patients should be told 
their condition, (C) accepts the idea that staff be- 
havior affects patients, (D) is aware of patients’ 
needs, (E) perceives a need for staff congeniality, 
(F) views patients unfavorably, (G) thinks staff 
should be critical about performance. 

This treatment-disposition behaviors evaluated 
were the (a) average hours of restraint and se- 
clusion, (b) number of unauthorized absences and 
elopements of patients, (c) number of maximum 
hospital benefit (MHB) discharges, (d) number of 
patients released on trial visit, (e) number of ir- 
regular discharges, and (f) total dispositions in- 
cluding all releases and discharges. The analyses of 
these treatment-disposition behavior variables were 
based on the monthly figures of the units for the 
first 2 yr. under the unit system, corrected for the 
average monthly patient census. 


Procedures 


The attitude questionnaire (POT) was adminis- 
tered to as many of the staff members as possible 
three times with 6 mo. between administrations. 
Since not all of the staff members who took the 
questionnaire took it all three times, an anal 
was used which permitted maximum use of thé 
available data. The mean score was computed for 
each individual who took the test whether he took 
the test once, twice, or three times. The unit mea” 
of this measure was selected as the best estimate ° 
the overall staff attitude of the units, The sample © 
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418 included 82 staff members from Unit I, 141 from 
Unit II, 95 from Unit III, and 100 from Unit IV. 
‘The average participation across all three adminis- 
trations of the test was about 60% for Unit I, 85% 
for Unit II, 70% for Unit III, and 75% for Unit 
IV. The proportion of the different professions repre- 
sented in the samples from each unit was com- 
parable, and these proportions in each unit were 
comparable to the proportions of the professions in 
the total staff. 

In order to focus attention on the more meaning- 
ful variables, staff attitude and treatment-disposition 
behavior measures were screened on their ability to 
discriminate among the units. Only measures show- 
ing significant differences among the units, or ratios 
of these measures, were analyzed in the main com- 
parisons relating staff attitudes to each other and 
to treatment-disposition behavior measures at the 
unit level. 


RESULTS AND DISCUSSION 


Selection of Staff Attitude and Treatment- 
Disposition Behavior Measures 


Results of analyses of variance screening 
the seven attitude areas on the ability to dis- 
criminate between units are shown in Table 
l. Statistically significant differences were 
found between the units in the following 
areas: interprets rules in an authoritarian 

- manner (p < .01), thinks patients should be 
told their condition (p < .001), accepts the 
idea that staff behavior affects patients (p < 


TABLE 1 


STAFF ATTITUDES TOWARD PATIENT CARE 
AMONG THE UNITS 


EE. ... SA 


.01), and views patients unfavorably (p < 
.05). 

Table 2 presents the results of analyses of 
variance screening the treatment and disposi- 
tion behaviors on ability to differentiate 
units. Significant differences between units 
were found in the number of MHB dis- 
charges (p< .001), the average hours of 
restraint and seclusion (p< .01), and the 
number of trial visits (p< .05). The dif- 
ferences among the units in MHB discharges 
and trial visit releases did not seem to be 
the result of differences in total number of 
dispositions, as the units did not differ sig- 
nificantly on this variable (F <1, df — 3/ 
69). 


Relationships: among Personnel Attitudes and 
Treatment-Disposition Behavior Measures 


Only those staff attitude and treatment- 
disposition behavior measures which indi- 
cated statistically significant differences 
among units or ratios of these measures 
were further evaluated. Since there were only 
four units, a very strong relationship be- 


TABLE 2 


TnEATMENT-DisPOSITION BEHAVIOR MEASURES 
AMONG THE UNITS 


Variable Source | df | MS F 


Hr. of restraint and | Units 19.77 | 4.41** 
seclusion Mo. 8.50 | 1.90* 
Residualf| 69 | 4.48 
No. of patients absent | Units 083| «1 
without official o. 020| «1 
leave Residual 1.06 
No. of trial visit re- | Units 3.93 | 2.75* 
leases Mo. 3.08 | 2.15* 
Residual| 69 | 1.43 
No. of maximum hos- | Units 17.63 |26.71*** 
pital benefit dis- Mo. 4,56 | 6.91*** 
charges Residual | 69 | 0.66 
No. of irregular Units 15.30 | 1.15 
charges Mo. 12.56] «1 
Residual | 69 | 13.35 
Total no. of discharges | Units 163| «1 
En releases Mo. 23 | 9.23 | 1.55 
Residual | 69 | 5.95 


Attitude Area Source | df MS 
BEER foetal 
A, Interprets rules in 3 | 128.67 | 4.29** 
an authoritarian Units |414| 29.99 
manner Within 
B. Thinks patients 3| 89.33 | 7.08*** 
Should be told their | Units | 414 | 12.61 
condition Within 
C. Accepts the idea 3| 24.67 | 5.54** 
that staff behavior | Units |414| 445 
affects patients Within 
- Is aware of pa- Units 3| 31.00 | 1.32 
tients' needs Within | 414 | 23.44 
E, Perceives a need — | Units 3| 26.00 | 1.07 
for staff congeniality | Within | 414 | 24.22 
- Views patients Units 3| 78.11 | 2.60* 
unfavorably Within | 414 | 30.10 
— G. Thinks staff should 017| «1 
beself-criticalabout | Units | 414] 0.32 
performance Within 
Note.—The average scores for all individuals who took 
the Philosophy of Treatment Questionnaire at least once were 
Used in the analyses. 
*5«. 
M p <.01 


Note.— The 24 monthly measures of all six variables were 
corrected for the average monthly patient census for each of 
the four units. 

*p <.05. 

»* 5 c .01. 
> < .001. 
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tween variables was required to attain statis- 
tical significance. Nonetheless, a number of 
significant relationships were found. All of 
the correlations between unit means on Areas 
A, B, and F were positive and statistically 
significant, indicating that units high on one 
variable were high on all three. Correlations 
of .99 (p<.01), .98 (p< .02), and .99 
(p «.01) were found for the relationships 
between unit means of Areas A with B, A 
with F, and B with F, respectively. These 
results are shown in Table 3. As indicated 
above, the V was 4 for all these correlations. 
A strong belief that a patient should be in- 
formed of his psychiatric condition may be 
surprising in those units which endorse more 
authoritarian rule interpretation and less fa- 
vorable attitudes toward patients. 

More authoritarian interpretation of rules 
and less favorable views of patients are cer- 
tainly not in vogue, and groups of people 
willing to endorse such views on a question- 
naire could logically be considered relatively 
frank and open. When these same groups of 
people endorse attitudes of informing patients 
about their condition, they are endorsing 
openness and frankness in interactions with 
patients. Thus, a possible common thread 
among the three empirically related areas 


TABLE 3 
CORRELATIONS OF UNIT MEANS OF STAFF ATTITUDE AND TREATMENT-DISPOSITION BEHAVIOR MEASURES 


A. S. DEWorrr, R. P. BARRELL, AND F. E. SPANER 


would seem to be a generalized open and 
frank approach.* 

The hypothesis that relationships among 
these area scores resulted from a generalized 
orientation of frankness and openness was 
evaluated further by correlating these with 
other variables. Openness and frankness are 
likely to be found in conjunction with trust 
of others. Specifically, if the endorsement of 
these three attitude areas (Areas A, B, and 
F) by unit staffs does indicate openness, 
frankness, and trust, then these attitudes in 
a unit atmosphere should be negatively cor- 
related with hours of restraint and seclusion 
of patients. If trust is not indicated, positive 


2 The relationships of Areas A, B, and F in the 
unit comparisons might have been the result of à 
generalized response bias among the individuals. 
However, further analysis indicated that this was 
not the case. There were nine possible correlations 
(three on each of the three occasions the POT was 
administered to the staff) when Areas A, B, and 
F could be related, using a comparison of individ- 
uals rather than units. None of these nine corre- 
lations was significantly positive. On the contrary, 
only two were positive and the average of the nine 
was —.07. Thus, the strong positive relationships 
between Areas A, B, and F in the unit comparisons 
were not simply a reflection of the response ten- 
dencies of the individuals involved, but appeared to 
be a unit-level treatment-atmosphere phenomenon. 


1. Area A score* 
Authoritarian attitudes 


2. Area B score* 
Informing patients 


3. Area C score* 
Staff behavior importance 


4. Area F score? 
Unfavorable views of patients 


5. Hour of restraint and seclusion> 

6. Trial visits? 

7. Maximum hospital benefits (MHB) discharges» 
8. MHB/MHB + trial visits 

9. % of staff participation 


1 2 3 4 5 6 7 8 
.993** — 

936 932 — 

.982*  .994** .953* — 
—407 —.319 —.154 —.239 — 
—.631 —.561 —.775 —.579 2296 — 

.692  .772  .600  .776 .092 —.1060  — 
.953*  .953*  .996** .971* .166 —.726 .655 — 
460 792 919  .884 251 —.503 .642 9! 


* The average scores of all individuals who took the Philosoph: tment d. 
b These measures were corrected for average patient census. ees prec EIU S ore ers ey 


*p <.05. 
» 5 «.01. 


t» 


STAFF ATTITUDES AND PATIENT CARE 93 


relationships would be anticipated. The cor- 
relations with hours of restraint and seclu- 
'sion were —.41, —.32, and —.24 for Attitude 
Areas A, B, and F, respectively. Although 
mot statistically significant, these negative 
correlations tend to support the trust inter- 
pretation. 

Trust might also be reflected in the condi- 
tions under which patients were released from 
the hospital A trial visit release does not 
sever the patient-hospital relationship and 
facilitates rehospitalization, usually indicat- 
ing less trust in the patient's capabilities. 
An MHB discharge severs the patient's re- 
lationship with the hospital, ordinarily in- 
dicating greater trust in his capabilities.* 
The ratio of MHB discharges to the sum of 
MHB discharges plus trial visits in a unit 


"was used as the index of trust of patients’ 


capabilities. When the mean scores of the 
units on Areas A, B, and F (the attitudes 
hypothesized to indicate a frank, open, and 
trusting orientation) were related to this 
measure of trust, there were significant posi- 
tive relationships between all three attitude 
areas and the trust measure. The correlations 
of the areas with the mean of the ratio of 


‘number of MHB discharges divided by the 


sum of the MHB discharges plus trial visits 
were .95 (df = 2, p < .05) for Areas A and 
B, and .97 (df —2, p< .05) for Area F. 
"These statistically significant findings also 
indirectly support the openness, frankness, 
and trust interpretation of the relationships 
found between this group of endorsed atti- 
tudes of the staff toward patient care. 

The frankness, openness, and trust dimen- 
Sion interpreted from the unit-level treat- 
ment-atmosphere data of the present study 
could be viewed as the antithesis of the pro- 


8 The possibility that MHB discharges which sever 
the patients’ relationship with the hospital were 
actually a desire to get rid of the patients rather 
than a trust in the patients’ capabilities was evalu- 
ated. When MHB discharges were related to ir- 
Tegular discharges on a unit level, the correlation 
Was significantly negative (r=—.92, ~p<.05). As 
itregular discharges separate patients from the hos- 
Pital and preclude readmission for 90 days, the 
negative relationship between MHB and irregular 
discharges was interpreted as support for the hy- 
Pothesis that MHB discharges are indicative of trust 
rather than a desire to get rid of patients. 


tective benevolence factor for individuals 
found by Ellsworth (1965). What he called 
protective benevolence attitudes toward pa- 
tients were the endorsed beliefs that patients 
should be made to feel at home and be ap- 
proached in a generally nonpunitive manner. 
However, patients perceived people who en- 
dorsed these attitudes as aloof, distant, and 
noninteracting individuals. Ellsworth’s inter- 
pretation of the discrepancy was that the 
high-protective-benevolence individual pre- 
fers to “go along with" patients and to ac- 
cept them as they are without any attempt 
to change them. Thus, confrontations (dis- 
agreeing with patients’ statements or de- 
mands) and the potential stresses and differ- 
ences of opinion that could result from 
confrontations can be avoided. Ellsworth 
concluded that the high-benevolence indi- 
vidual apparently strives for superficially 
comfortable relationships with patients by 
accepting them as they are and avoiding con- 
frontations. 

In contrast, the frankness, openness, and 
trust orientation in the present study in- 
volves acceptance of a more authoritarian 
and less laissez-faire approach (higher Area 
A scores), less acceptance of patients as they 
are (less favorable ratings of patients, Area 
F scores), but greater trust in their abilities 
to handle stress (stronger belief that patients 
should be informed about their condition, 
Area B scores and a higher MHB discharge 
to trial visit release ratio). Thus, it is possi- 
ble that the proposed openness, frankness, 
and trust orientation might also include an 
orientation toward helping patients change. 

The different rates of participation of staff 
in the units indicated a differential level oi 
cooperation among staffs in the units. The 
high positive (albeit not significant due to 
the small number of units) correlations be- 
tween the units’ percentage of staff participa- 
tion and their endorsement of the Attitude 
Areas A, B, and F (Table 3) support the 
interpretation that high scores by units on 
these areas indicate trust of others. That the 
units high in scores on thes areas tend to 
trust others more was indicated by the will- 
ingness of more people to expose their atti- 
tudes. 

Although differential participation would 
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usually necessitate greater caution in inter- 
pretation of results, this did not seem to be 
the case in the present study. First, differen- 
tial participation did not result in inequalities 
in professional representation in the samples 
from the units, indicating that this potential 
confounding was not present. 

In addition, differential participation 
probably reduced the strength of the critical 
correlations between staff attitudes and 
treatment-disposition behavior by restricting 
the range of the attitude means for the units: 
(a) It would seem likely that nonresponders 
would tend to be low in frankness, openness, 
and trust of others; (D) complete returns 
from all units would be expected to spread 
the unit means of the frankness, openness, 
and trust of others attitudes, since the units 
with the fewest respondents tended to have 
lower means on these attitudes; and (c) the 
actual range of unit-staff attitude score means 
was probably reduced. Reduced range of at- 
titude means would result in lower correla- 
tions between staff attitudes and the be- 
havioral measure of trust of patients’ 
capabilities, thereby decreasing, rather than 
increasing, the probabilitiy of falsely reject- 
ing the null hypothesis. 

In light of the findings above, the sig- 
nificant positive relationships between all 
three measures in the openness, frankness, 
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and trust cluster of endorsed attitudes and 
the objective measure of trust of patients 
would seem to indicate that openness, frank- 
ness, and trust could be important dimen- 
sions in the treatment climate and merit 
further exploration. Specifically, evaluation 
of the relationships between these dimensions 
and an orientation toward helping patients 
change might be fruitful. 
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EFFECTS OF LABELS ON THE PERCEPTION AND 
DISCRIMINATION LEARNING OF RETARDATES + 


PHYLLIS A. KATZ anD SUE ROSENBERG 
New York University 


The role of labels in perception was assessed in 54 familial retardates and 48 
normal IQ children at two matched MA levels. Three experimental manipula- 
tions were employed at each level in a factorial design. One group learned to 
associate a common label to random visual forms; a second group learned 
distinctive labels to the same forms; finally, a third group viewed the forms 
without labels. Following verbal training, all Ss received perceptual judgment 
and discrimination-learning tasks. No significant differences emerged between 
the retarded and normal Ss on paired-associate learning, discrimination learning, 
or perceptual accuracy without labels. In contrast to the normal children, 
however, the retardates’ perceptual and learning processes were relatively 
unaffected by the learning of verbal levels. 


A recent controversy in the area of mental 
retardation has been centered around the 
question of whether or not language cues 
have the same effect on normal children and 
familial retardates. One widely accepted theo- 
retical position advanced by Luria (1963) 
suggests that the cognitive and motor be- 
havior of retarded children is not effectively 
regulated by language. This relative lack of 
verbal mediation is assumed to result from 
underlying neurological defects. A second 
view, espoused by Zigler (1966), maintains 
that familial retardates are not linguistically 
deficient, but are instead intellectually com- 
parable to normal children of the same MA 
level. This latter position holds that differ- 
ences in intellectual performance previously 
obtained between retarded Ss and matched 
MA normals (e.g., Ellis, 1963; Kounin, 1941; 
Spitz, 1963; Zeaman & House, 1963) 
are attributable to differential motivational 
patterns rather than to innate organismic 
defects. 

Research relevant to these two positions 
has focused primarily upon the effects of 


1This study was partially supported by a New 
York University Arts and Science Research Grant 
and by National Institute of Mental Health Grants 
HD-2672-01, Phyllis A. Katz, principal investigator 
and HD-MH-03008, Edward Zigler, principal investi- 
gator. The authors would like to thank Edward 
Zigler for assisting in obtaining Ss and Jacob 
Schneider and Thomas McCulloch for their assist- 
ance in providing them. Additionally, particular grati- 
tude is expressed to Aron B. Katz for his many in- 
sightful suggestions. 
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verbal training on learning and problem solv- 
ing in retardates (e.g, Barnett & Cantor, 
1957; Milgram & Furth, 1963; O'Conner & 
Hermelin, 1959; Smith & Means, 1961), and 
the findings have not provided clear-cut sup- 
port for either viewpoint. Despite the widely 
acknowledged importance of perception as 
the process underlying learning and problem 
solving, there have been no investigations, to 
date, of the direct effects of language upon 
the perceptual processes of retarded indi- 
viduals. The purpose of the present study 
was to compare the effects of verbal train- 
ing upon the perception of retardates and 
matched MA normals. Based upon the rea- 
soning outlined above, Luria's (1963) posi- 
tion would generate the prediction that the 
perceptual responses of retardates will be less 
influenced by language training, whereas 
Zigler's (1966) viewpoint would predict no 
difference in the degree of modification ex- 
hibited by the two groups. These two possi- 
bilities were assessed in the present investi- 
gation. 
METHOD 

Subjects 

Fifty-four familial retardates were selected from 
a New York State public institution for the men- 
tally retarded. Half of the Ss had a mean MA of 
6 yr., 10 mo. on the Stanford-Binet Intelligence 
Scale at the time of the experiment, and the other 
half had an MA of 8 yr., 9 mo. These groups will 
be referred to hereafter as the low- and the high- 
MA groups, respectively. The mean IQ of these 


Ss were 47.7 and 602, respectively, with a total 
range extending from 40 to 73. The mean CA of 
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both the groups was 19 yr., 6 mo. This sample 
comprised all of the higher MA retardates available 
at the institution. No S whose records indicated any 
evidence of organic etiology was included in the 
sample. 

The performance of the retarded Ss was compared 
to scores obtained by a group of 48 Ss of normal 
intelligence in an earlier study (Katz, 1963). The 
normal Ss were randomly drawn from the first-, 
second-, and fourth-grade classes of a public ele- 
mentary school in New Haven, Connecticut. Half 
of these Ss were 7 yr. old, and the other half 
averaged 9 yr., 6 mo. Thus, the normal and re- 
tarded Ss were of roughly comparable MA levels. 
It was not possible to match the higher level re- 
tarded Ss exactly to the older normal sample (who 
were run earlier) since there were not enough re- 
tardates at the institution with MA levels of 9 yr. 
or more. 


Procedure 


Al Ss were run individually in the school or in- 
stitution. The tasks were introduced to Ss as games. 
Upon entering the experimental room, they were 
Shown "prizes" and were told they could win the 
object of their choice if they played the games well. 
The prizes were appropriate to the CA of Ss. For 
the normal elementary school children, the reinforce- 
ments included such things as jacks, modeling clay, 
toy guns, marbles, books, and small games. For 
the retardates, the prizes included such objects as 
comic books, combs, stationery, and ball-point pens. 

Within the retarded and normal groups, at both 
MA levels, an equal number of Ss was randomly 
assigned to three experimental condition, depending 
upon the type of verbal training received. A com- 
mon label group was taught to associate two non- 
sense syllables with four highly similar random 
forms. Thus, for this group, there were two forms 
per label. A distinctive label group learned to asso- 
ciate four distinctive labels to the same four forms. 
Finally, a control group viewed the figures without 
labels. The control Ss were asked to count the 
figures aloud to minimize the possibility of spon- 
taneous labeling. For all training conditions, the 
forms were individually presented on slides for 150 
trials, A more detailed description of the stimuli 
and the apparatus can be found elsewhere (Katz, 
1963). The verbal labels were chosen from nonsense- 
syllable lists of 95% association value and were 
randomly assigned to the visual forms for each S. 
A correction procedure was used for the label groups. 
Verbal support was given to all groups after every 
tenth trial, that is, “You're doing fine"; “You really 
know how to play this game”; etc. In order to 
maintain interest in the task, the nonsense forms 
were introduced to all Ss as “pictures of funny- 
looking animals from Mars,” and Ss were asked 
to pretend they were space explorers. 

Following the verbal training, a perceptual judg- 
ment task was introduced in which pairs of the 
previously employed forms were tachistoscopically 
presented on slides at a .2-sec. exposure. The S was 


instructed to make a judgment of "same" if the 
members of the pair looked like twins (ie, were 
identical) and "different" if they did not. Judgments 
were indicated by S’s depression of one of two 
appropriately marked telegraph keys, which stopped 
a clock when pressed. The apparatus allowed for 
the recording of both choice scores and judgment 
latencies. The slides which were shown contained 
8 pairs of identical forms and 20 pairs of two. 
different forms. Ten slides in this latter group con- 
tained pairs of different figures previously associated 
with the same label for the common label group. 
The actual forms used were matched across groups, 

Following the perceptual judgment task, a dis- 
crimination-learning task was introduced to all Ss, 
Three of the four previously employed forms were 
used as discriminanda. For this task, the forms were 
cut out of Masonite and mounted on blocks of 
wood. The S was instructed as follows: "One of 
the animals has a paper goldfish like this under it. 
You pick up the one you think has the fish under 
it. If you're right, you can keep the fish and at the 
end, you can trade them in for prizes." 

The reinforced stimulus was always one of the 
two forms previously associated with the same label 
for Ss in the common label group. The position 
of the figures was changed behind a vertical screen 
before each trial. "Testing was discontinued after 
five consecutive correct choices or 50 trials. 


RESULTS 
Verbal Learning 


A four-way analysis of variance (Type of 
Label X Type of Ss X MA Level x Blocks of 
Trials) was conducted on the number of cor- 
rect responses made by the various label 
groups. The only significant (p < .01) effect 
to emerge from the analysis was the blocks 
of trials, indicating that all Ss improved over 
trials. None of the other main or interaction 
effects was significant. The normal and te 
tarded Ss at each MA level learned the 
labels equally well. ] 


Perceptual Judgments 


The measure of major interest to the posl" 
tions initially advanced is the number 0 | 
“same” judgments made to pairs of physically 
different forms which were -associated w! 
the same label in the common label group. 
It would be expected that if labels did modify 
perception, then a common name sho 
elicit more “samé” judgments made to the 
two figures, whereas distinctive names shoul 
elicit more “different” judgments relative t° 
a no-label control group. 
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DISCRIMINATION LEARNING OF RETARDATES 


The mean number of “same” judgments 
made by the various groups is presented in 
Table 1. -A three-way (Type of S X Treat- 
ment X MA) analysis of variance was con- 
ducted on these scores. This analysis re- 
vealed the type of S effect to be significant 
(F = 7.93, df=1/90, p< .01), indicating 
that the retarded Ss were somewhat more in- 
accurate in their perception than the normals. 
A comparison of the normal and retarded no- 
label control groups was not significant, how- 
ever, suggesting that the type of S effect 
is primarily attributable to the labeling train- 
ing. The Treatment X S interaction was sig- 
nificant at the .05 level (F = 4.71, df= 
2/90), reflecting the finding that the verbal 
training conditions had differential effects on 
"retarded and normal Ss. In addition, the 

triple. interaction of Treatment X S X MA 
- level was significant (F = 3.77, df = 2/90, 
?«.05). 

In view of these interactions involving type 
of S, separate analyses were conducted for 
the normal and retarded Ss. For the normals, 
the main effect of treatment was statistically 
significant (F = 4.70, df = 2/42, p < .05), 
"indicating that labels influenced the judgment 
of stimulus similarity in the predicted direc- 
tion. Analysis of the retardates’ error scores, 
however, revealed no significant overall treat- 
ment effect. For the retarded group, a sig- 
nificant Treatment X MA interaction emerged. 
The means involved in this interaction, con- 
"tained in Table 1, reflect the finding that 
„ labels did not influence the judgments of the 

low-MA retarded Ss at all, whereas they did 
have some effect on the higher MA Ss. For 
TABLE 1 


Mean NUMBER or “Same” JUDGMENTS MADE TO 
PHYSICALLY DIFFERENT Parrs or Forms Asso- 
CIATED WITH THE SAME LABEL FOR THE 

i Common LABEL Groups 


Type of subject 
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TABLE 2 
MEAN RESPONSE LATENCIES ON PERCEPTUAL 
Jupement Task 
SN Type of subject. 
Group 
Normal Retardate 
Label 1.63 141 
Control 1.18 1.57 


this latter group, having an equivalent label 
for two stimuli made the stimuli appear less ' 
distinctive, as was true for the normals. Hav- 
ing distinctive labels, however, which theo- 
retically should decrease similarity, had an 
opposite effect for the high-MA retardates. 

The data regarding response latencies to 
these stimulus pairs paralleled the choice 
scores, for the most part. When the scores of 
normal and retarded Ss were analyzed to- 
gether, a significant Treatment X Type of 
S interaction emerged. The means involved 
in this interaction are presented in Table 2. 
It can be seen that with the normal children 
the label groups took longer to make a judg- 
ment than did the control group (F — 4.88, 
df = 2/36, p < .05), suggesting that verbal 
mediation was operative. For the retarded Ss, 
however, no such difference between the label 
and control groups in response latencies was 
evident, 
Discrimination Learning 

The mean number of correct responses is 
contained in Table 3. An analysis of variance 
conducted on these measures revealed a sig- 
nificant Treatment X Type of S interaction 
(F = 3.40, df = 2/90, p < .05), indicating, 
once again, that the labels had differential 
effects on retarded and normal Ss. Analyses 
conducted separately on these two popula- 
tions revealed a significant treatment effect 


TABLE 3 


MEAN NUMBER oF CORRECT RESPONSES ON 
DiscRIMINATION-LEARNING TASK 


Group Normal Retardate Normal Retardate 
Group 
i i High 
oot ee i eru due 
Common label 4.00 4.62 543 444 Common label 15.5 17.1 23.0 21.0 
No label (Control) | 2.35 3.88 5. 241 No label (control) 19.4 23.1 20.1 30.1 
` Distinctive label 2.25 1.62 | 5.33 5.55 Distinctive label 279 | 265 | 157 | 231 
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for the normal Ss. It can be seen in Table 3 
that this effect is in the expected direction; 
that is, the common label group made more 
errors than a control group, whereas the 
distinctive label group made fewer errors. 
An analysis of variance conducted on the 
retardates’ scores yielded no significant main 
or interaction effects. Thus, it may be con- 
cluded that labels did not influence the dis- 
crimination learning of the retardates. 


Discussion 


The major finding of the present investiga- 
tion is that labeling training has differential 
effects on the perceptual responses of familial 
retardates and normal children of matched 
MA levels. For normal 7- and 9-yr.-old school 
children, the association of a common label 
with two objects elicited increased perceptual 
similarity and more difficulty in learning to 
discriminate between them relative to a no- 
label control group. Conversely, having dis- 
tinctive labels for these stimuli increased dis- 
tinctiveness and facilitated the learning of a 
discriminatory response. Furthermore, groups 
having labels took longer to make perceptual 
judgments than groups without labels. This 
latter finding offers additional corroboration 
for verbal mediation effects upon perception. 
With the retarded population, however, label- 
ing training did not significantly alter, for 
the most part, either the discriminability of 
the stimuli or the time required to make a 
perceptual response. This clearly suggests 
that verbal mediation training was not par- 
ticularly effective in modifying the perception 
of retarded Ss. 

Although it is clear from the findings that 
labels have less salience in the discriminatory 
processes of retarded Ss than in normals, it 
should be noted that several possible inter- 
pretations of this difference exist. 

One possibility is that Luria's (1963) gen- 
eral thesis is correct, and that some as yet 
unspecified organic linguistic defect exists in 
all retardates. Logically, however, such an 
assumption could not be unequivocally sup- 
ported in the absence of direct neurological 
evidence. 

A second possible explanation is that verbal 
cues are less effective for the retardate be- 


cause he has received less reinforcement for 
verbal mediation in the past. A caretaker's 
verbal interactions with a retarded child 
might be minimal, particularly under cer- 
tain types of institutional conditions, and 
thus words may not acquire much secondary 
reinforcing value for the retardate. This so- 
cial reinforcement interpretation suggests that 
verbal mediation differences would be less 
pronounced between a noninstitutionalized re- 
tarded sample and a normal group of chil- 
dren. Furthermore, it offers more potential 
for behavior change than the cognitive struc- 
tural defect position postulated by Luria. 

A third possibility which should be con- 
siderd is that the retarded Ss were not verb- 
ally deficient at all. It is typically assumed 
by most investigators that a retardate be- 
having differently than a matched MA normal 
is necessarily functioning at a lower develop- 
mental level. Such an assumption completely 
ignores the greater opportunities for learn- 
ing that the chronologically older retardate 
has had. Although the devlopmental trends 
regarding verbal mediation in normal indi- 
viduals are not completely clear, extrapola- 
tion from a considerable body of research in 
this area suggests that the salience of verbal 
cues may be more pronounced in the per- 
ceptual processes of children (e.g., Cantor, 
1955; Katz, 1963; Norcross, 1958) than of 
adults (e.g, Arnoult, 1953; H. C. Ellis & 
D. G. Muller, 1964; Robinson, 1955). Thus, 
it is conceivable that the retardates in this 
study were behaving more like their CA 
counterparts, Further testing is indicated to 
evaluate these various possibilities. 
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A BRIEF MENTAL HEALTH INDEX" 


E. K. ERIC GUNDERSON 4x» RANSOM J. ARTHUR 


Navy Medical Neuropsychiatric Research Unit, 
San Diego, California 


The Cornell Medical Index (CMI) was administered to 630 Navy psychiatric 
patients and 454 healthy controls. Patient and control samples were split into 
two groups for cross-validation purposes, and two methods, regression analysis 
and a new item selection technique called SEQUIN, were applied to the 
problem of selecting the most discriminating set of CMI items. The per- 
centages correctly classified “sick” or “well” when results from Sample 1 were 
used to predict Sample 2 and vice versa were 82% and 85% by the regression 
method and 86% and 86% by the SEQUIN method. Seven items, perhaps 
representing general attributes defining mental illness in the Navy culture, 
contributed significantly to the predictive scales regardless of particular item 


selection method or sample. 


The Cornell Medical Index (CMI) has 
been widely used as an aid to clinical diag- 
nosis (Brodman, Erdmann, & Wolff, 1956; 
Erdmann, 1959) and as a measure of the 
prevalence of somatic and emotional symp- 
toms in various populations (Abramson, 
Terespolsky, Brook, & Kark, 1965). The In- 
dex has repeatedly demonstrated validity as 
an indicator of general health or of emo- 
tional health when compared with clinical 
ratings, but has proved less useful as an in- 
dicator of the presence of specific diseases 
or in comparing health status in different 
populations and cultures (Abramson, 1966). 
In a previous study of the Navy population, 
the total CMI score (the number of “yes” 
responses over the entire 195-item health 
questionnaire) discriminated efficiently be- 
tween psychiatric inpatients and healthy con- 
trols using a cutting score of 15 (Arthur, 
Gunderson, & Richardson, 1966). This re- 
sult suggested that the CMI might have po- 
tential value as a psychiatric screening de- 
vice and as an epidemiological tool in the 
Navy setting. 

The next logical step in developing a use- 
ful screening index was to identify the specific 


1 Report No. 67-18, supported by the Bureau of 
Medicine and Surgery, Navy Department, under 
Research Work Unit MF 022.01.03-9002. The opin- 
jons or assertions contained herein are the private 
ones of the authors and are not to be construed as 
official or as necessarily reflecting the views of the 
Department of the Navy. 

Statistical assistance was provided by Frank 
Thompson. 


items in the CMI which best differentiate 
patients from healthy controls and to deter- 
mine the most efficient combination of these 
items. It seemed plausible that a short scale 
of highly selected items might be devised 
which could discriminate mentally ill from 
healthy individuals in the Navy population 
as well as the entire CMI inventory. 


METHOD 
Subjects 


The Ss for the study were 630 psychiatric in- 
patients from two naval hospitals and a normal 
control group of 454 enlisted men who were cur- 
rently on active duty and were not medical or 
psychiatric patients at the time of the study. The 
control group consisted of students from an elec- 
tronics technicians school, Marine guards, hospital 
corpsmen, and Navy enlisted men from several occu- 
pational categories on duty at a West Coast naval 
station, and participants in underwater demolition 
training. The mean ages of the patient group and 
the control group were 22 yr. and 21 yr., respec- 
tively. Distributions in pay grade (rank) were ap- 
proximately the same for both patients and controls, 
although seamen recruits were represented in the 
patient group but not in the control group. 


Procedure 


CMI questionnaires were administered individually 
or in groups by medical personnel at the various 
facilities included in the study. The total sample of 
1,084 patients and controls as divided into tW? 
subgroups for purposes of cross-validation by sort- 
ing on odd or even last digits of service numbers. 
By this method, 560 cases were selected for Sam- 
ple I, of whom 335 were patients and 225 were 
controls. The composition of Sample II was 2% 
patients and 229 controls, for a total of 524 Ss. 
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The frequency of “yes” responses given by pa- 
tients and controls for all 195 CMI items was ex- 
amined, and, on the basis of the differences in 
percentages answering “yes,” 60 items were chosen 
for intensive analyses. The regression technique was 
used to select the most discriminating of the 60 
items and to determine regression weights for sets 
of items derived separately from Samples I and 
II, The regression weights from Sample I then were 
used to predict the patient versus control criterion 
in Sample II (0=control, 1=patient), and, simi- 
larly, regression weights from Sample II were used 
to predict the illness criterion in Sample I. An al- 
ternative method of item selection and weighting, 
called SEQUIN (Moonan & Pooch, 1966), also was 
used for cross-validation, The most important dif- 
ference between the two methods is the assignment 
of unit weights to items selected by the SEQUIN 
method as opposed to the assignment of variable 
weights in the regression method. 

Correlations were computed by the product-mo- 
ment method for the cross-validities as well as the 
regression analysis and by the biserial method for 
the SEQUIN analysis; the two methods of correla- 
tion yielded identical results in the present analyses. 


RESULTS 


Table 1 presents the items, correlations 
with the criterion, and the regression coeffi- 
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cients derived from Samples I and II. The 
items most highly correlated with the cri- 
terion in both samples were, “Does worrying 
continually get you down?” and “Are you 
considered a nervous person?” The item, 
“Were you ever a patient in a mental hos- 
pital (for your nerves) ?” had relatively large 
regression weights in both samples. Of all 
CMI questions, these items most directly refer 
to severe emotional disturbance. Substantial 
weights were received by the items, “Do you 
smoke more than 20 cigarettes a day?” “Do 
you suffer badly from frequent severe head- 
aches?” and “Do you need glasses to read?” 
The relationship of these items to mental ill- 
ness is less obvious than the relationships of 
the aforementioned items; however, head- 
aches, reading difficulties, and excessive smok- 
ing are commonly recognized concomitants 
of emotional distress. Various specific emo- 
tional symptoms, for example, “shy or sensi- 
tive,” “frightening thoughts,” and “unhappy 
and depressed” contribute to discrimination 


TABLE 1 


CORRELATIONS AND REGRESSION COEFFICIENTS FOR CORNELL MeEpicat Inpex (CMI) 
ITEMS SELECTED BY THE REGRESSION METHOD 


Sample I Sample IT 
Item CM 
no. ect Item Regres- Regres- 
r^ sion co- r sion co- 
efficient efficient 
1 A Do you need glasses to read? 23 13 eek 46 
79 | G | Do you suffer badly from frequent severe headaches? Al 43 | 38 44 
113 I Do you suffer from severe nervous exhaustion? 39 Sit 
133 | K | Are you definitely underweight? 30 12), | 8d 41 
142 L Do you smoke more than 20 cigarettes a day? 42 .23 .39 ak; 
143 L Do you drink more than six cups of coffee or tea a day? .30 42 
146 M Do you get nervous and shaky when approached by a 
5s. [AN | Do odas feels d depressed? n d 
o you usually feel u: an T : : 
162 N Do you often wish you ES dead and away from it all? 40 446 
163 o Does worrying continually get you down? AS 09 | .49 48 
166 o Are you considered a nervous n? AT .16 46 .14 
168. O. | Did you ever have a nervous breakdown? 335 43 
169 o Did anyone in your family ever have a nervous break- 
down? 20 —.08 
170 o Were you ever a patient in a mental hospital] (for your 
nerves)? .28 21 432, 19 
172 P Are you extremely shy or sensitive? 36 13 
180 | Q | Are you easily upset or irritated? bse pta à 
m Q | Do people often annoy and irritate you? 445 E 
Hee R Do sudden noises make you jump or shake badly? | 43 10 
3 R Do frightening thoughts keep coming back in your mind? | .42 13 
194 R Do you often become suddenly. scared for no good reason? .28 —.13 
Multiple r .69 69 


Note.—Items which contributed significantly ( < .05) to an increase in the multiple correlation were included. 


“Correlation between item responses (yes-no) and 


it versus control 


criterion. 


Note.—The correlations are between item responses (yes-no) and patient versus control criterion. 


in one or the other sample, but are less con- 
sistent than the previously mentioned items. 
Three suppressor items appeared in the 
Sample I analysis and one in the Sample II 
analysis. These items correlated positively 


A multiple correlation of .69 was achieved 
with this set of items against the illness cri- 
terion. When the regression weights in Sam- 
ple I were used to predict “well” versus “sick” 
status for Sample II, a cross-validity coeffi- 
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TABLE 2 
CORRELATIONS OF IrEMs SELECTED BY THE SEQUIN METHOD 
Cornell 
Item No. EXER Item Sample I | Sample II 
section 
1 A Do you need glasses to read? 22 27 
79 G Do you suffer badly from frequent severe headaches? 40 38 
85 G Have you fainted more than twice in your life? 28 
133 K Are you definitely underweight? 30 32 
139 L Do you usually have great difficulty in falling asleep or staying ie 
asleep? . 
142 L Do you smoke more than 20 cigarettes a day? .42 39 
143 L Do you drink more than six cups of coffee or tea a day? 31 30 
146 M Do you get nervous and shaky when approached by a superior? 40 
147 M Does Beppe fall to pieces when the or a superior is watch- / 
ing you E 
158 N Do you usually feel unhappy and depressed? 44 
162 N Do you often wish you were dead and away from it all? 39 
163 [0] Does worrying continually get you down? EG 49 
166 (0) Are you considered a nervous person? 46 AS 
167 [0] Does nervousness run in your family? 29 
168 [6] Did you ever have a nervous breakdown? 35 
170 oO Were you ever a patient in a mental hospital (for your nerves)? 28 32 
175 P Does criticism always upset you? 37 
184 Q Do people often annoy and irritate you? 44 
189 R Do sudden noises make you jump or shake badly? 43 
193 R Do frightening thoughts keep coming back in your mind? 42 
Cumulative r 67 68 
J 
| 


with the criterion, but in combination with 
the entire set of significant items they re- 
ceived negative regression weights for the 
prediction of the patient-control criterion. 
The fact that none of the negative (suppres- 
sor) weights were replicated in the other sam- 
ple would recommend caution in interpreting 
these results. 

Items selected by the SEQUIN method 
and their correlations with the criterion for 
Samples I and II are shown in Table 2. All 
but five of the items selected by the SEQUIN 
method also had been selected by the regres- 
sion method in at least one sample. Thus, the 
item content of the scales derived by the 
two methods was quite similar. 

Multiple or cumulative correlations and 
cross-validation results are shown in Tables 
3 and 4. Fifteen items significantly con- 
tributed to discrimination of patients from 
controls in Sample I by the regression method. 


cient of .69 was obtained. Regression weights 
for 12 significant items were derived from 
Sample II in order to predict patient-control 
status for Ss in Sample I. The multiple 


TABLE 3 


MULTIPLE CORRELATIONS AND CROSS-VALIDITY 
COEFFICIENTS FoR Two ITEM-SELECTION 


METHODS 
Method | No. of items | Multiple, | tiation 
Regression 
Sample I 15 69 69 
Sample II 12 169 e 
Cumulative 
correlation 
SEQUIN 
SampleI 14 67 D 
Sample IL 14 .68 n 


* Items and weights from Sample I were used to predict 
patient or control status in Sample II and vice versa. 
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TABLE 4 


Cross-VALIDITy DISTRIBUTIONS FOR 
Two IrEM-SELECTION METHODS 


Sample I Sample II 
Method Actual Actual 
Well | Sick | Well | Sick 
Regression 
Predicted* 
Sick 47 | 259 42 | 244 
Well 170 | 45 | 180 34 
Correct predictions 82% 85% 
SEQUIN 
Predicted* 
Sick 39 | 295 38 | 261 
Well 186 40 | 191 34 
Correct predictions 86% 86% 


* Predictions for Sample I were based upon items and weights 
derived from Sample II and vice versa. Cutting points for pre- 
dicted criterion scores were based upon actual proportions 

ick" and '*well" in each sample. 


correlation and cross-validity coefficients were 
.69 and 64, respectively, for this set of items. 
Using the SEQUIN technique for item 
selection, 14 items were found to cumulatively 
increase the correlations with the criterion for 
both Samples I and II. Cumulative correla- 
tions of these sets of items with the illness 
criterion were .67 and .68, respectively, and 
cross-validity correlation coefficients were .72 
and .71, respectively, for Samples I and II. 
The distributions obtained when the re- 
gression or SEQUIN results from Sample I 
were used to predict Sample II and vice versa 
are given in Table 4. The percentages classi- 
fied correctly, that is, “sick” or “well,” were 
82% and 85% by the regression method and 
86% and 86% by the SEQUIN method. 


Discussion 


The development of a short, self-adminis- 
tered mental health index which provides a 
reasonable approximation of the results of 
a brief psychiatric examination would have 
obvious importance for both research and 
Practice. Opportunities for comparative stud- 
les of populations using clinical examination 
Methods are extremely limited, and investi- 
gations of prevalence of symptoms and ill- 
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ness almost always must rely upon Ss’ own 
reports of health status. Important relation- 
ships of personality, cultural, and environ- 
mental factors to emotional illness and in- 
validism can be readily adduced by means 
of survey methods if valid questionnaire tech- 
niques can be devised. 

In military psychiatric practice, decisions 
concerning fitness for military duty or the 
need to hospitalize often must be made 
rapidly and without the benefits of extended 
clinical study. A standardized screening in- 
strument obviously could have value in set- 
tings with rapid patient turnover, for ex- 
ample, an outpatient clinic, where allocation 
of the psychiatrist's or physician's time is an 
important consideration. Also, the specific 
questions identified as most significant in 
differentiating “sick” from “well” individuals 
may suggest particularly useful areas to be 
explored in brief clinical interviews because 
these items have a clearly demonstrated rele- 
vance for estimating the probability of mental 
illness. 

The findings of the present study are 
viewed as encouraging with respect to the 
possibility of developing a brief but efficient 
mental health index for the Navy population. 
It is clear that selection of items by linear 
multiple regression or similar techniques can 
provide great economies in scoring the pres- 
ent CMI questionnaire and at the same time 
slightly increase validity as well. Scoring only 
from 12 to 15 items by means of regression 
weights or unit weights resulted in concur- 
rent validities consistently higher than those 
attained with scores based on unit weights 
for all 195 items in a previous study.” Split- 
half cross-validation results indicated high 
stability and comparability for the various 
brief predictor scales obtained. 

The apparent superiority of the SEQUIN 
method of item selection for scale cross-vali- 
dations suggests that simple counting of diag- 
nostic answers (“yes”) would be preferable 


2When scores based upon all 195 items were 
dichotomized at 15 and were correlated with a sick- 
well criterion, a concurrent validity coefficient (phi) 
of .65 was attained. Details concerning the dis- 
criminating power of scores based upon the entire 
195-item CMI were given in the study by Arthur 
et al. (1966). 
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to more complex differential weighting pro- 
cedures in scoring a mental health index. 
This simplicity of scoring would offer prac- 
tical advantages for many clinical settings. 

A large proportion of the items included 
in the various experimental scales was drawn 
from Sections M-R of the CMI questionnaire. 
These items were designed to elicit psycho- 
logical symptoms, and it is not surprising that 
they are the most discriminating with re- 
spect to mental illness. Three items from Sec- 
tion L—items usually not scored as emotional 
symptoms—also were highly significant cor- 
relates of diagnosed mental disorder. These 
items—'Do you usually have great difficulty 
in falling asleep or staying asleep?” “Do 
you smoke more than 20 cigarettes a day?” 
and “Do you drink more than six cups of 
coffee or tea a day?”—should be included in 
scales designed to estimate mental health 
status in the naval population. 

Seven “universal” items emerged from the 
various analyses; that is, items which con- 
tributed significantly to the predictive scales 
regardless of particular item-selection method 
or sample, These questions perhaps represent 
a set of general attributes which are most 
important in defining mental illness in the 
Navy culture. The items, in order of im- 
portance, are, “Does worrying continually 
get you down?” “Are you considered a ner- 
vous person?” “Do you smoke more than 
20 cigarettes a day?” “Do you suffer badly 
from frequent severe headaches?” “Are you 
definitely underweight?” “Were you ever a 
patient in a mental hospital (for your 
nerves)?” “Do you need glasses to read?” 
These items appear to offer a useful nucleus 
of questions for a brief psychiatric index. 
Other questions could be added as their rele- 
vance was consistently demonstrated. 

Using different methods of item selection, 
Abramson et al. (1965) devised a scale of 10 
“key” CMI items (Hebrew version) which 
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correlated highly (r = .63) with physicians’ 
ratings of emotional health in a randomly se- 
lected sample of Jerusalem males. Seven of 
these 10 key items appeared in the experi- 
mental scales of the present study, suggesting 
a surprising generality in the present findings, 

Studies of the relationships of brief mental 
health index scores to psychiatrists’ decisions 
concerning fitness for duty and need for 
hospitalization are in progress, and the effects 
of social and cultural differences upon health- 
questionnaire responses also are being evalu- 
ated, 

Because the predictive value of the brief, 
CMI scales derived in this study has not yet 
been tested, the accuracy and usefulness of 
these scoring techinques for diagnostic pur- 
poses with new or suspected patients remain 
to be demonstrated, and they obviously should 
not be used for intake screening until such 
verification can be achieved. 
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PERFORMANCE OF RETARDED READERS ON THE 
WISC AND EDUCATIONAL TESTS 


J. G. LYLE anp JUDITH GOYEN + 
University of Sydney, New South Wales, Australia 


The WISC results of 54 retarded readers (experimentals) of average intel- 
ligence from the six primary school grades were compared with those of a 
matched group of 54 adequate achievers (controls). The experimentals had 
greater Verbal (V) ~ Performance (P) differences than the controls in both 
directions. This was due mainly to their significantly poorer performances on 
Information, Arithmetic, and Coding subtests, and to their significantly better 
performances on Comprehension, Picture Arrangement, and Block Design sub- 
tests after adjustments had been made by covariance analyses for Full Scale 
IQ differences. The experimentals also performed significantly less well than 
controls on untimed tests of Spelling and Arithmetical Calculation. No tendency 
was found for the differences between groups on the WISC or educational 
tests to increase with age, as was the case with their reading achievement. 
Hence there was no support for Neville’s hypothesis that certain of the 
differences are effects of reading retardation rather than correlated symptoms. 


The concept of reading retardation as a 
symptom of minimal cerebral dysfunction 
(Money, 1962) has led psychologists to seek 
correlates of this disability in cognitive and 
perceptual tests. As with known cases of 
brain damage, retarded readers are found 
with gross Verbal ~ Performance discrepan- 
cies on the WISC or sometimes with gross dis- 
crepancies on just one or two of the WISC 
subtests. For example, the authors have 
found one child? of 8 yr. who averaged a 
scale score of 8 on all subtests excepting 
Vocabulary, on which he scored 0; and an- 
other (see Footnote 2) aged 17 yr. who, de- 
spite an overall IQ of 94, could not, even with 
prompting, attain a raw score of above 3 on 
Picture Completion. 

In the clinic situation it is very tempting 
to infer correlations between such discrep- 
ancies and reading disability; but the clinical 
psychologist labors under two serious handi- 
caps: he lacks a control group, and he is 
dealing with a selected clinic sample. 

This report is part of a wider study de- 
signed to determine: (a) whether Verbal 
~ Performance discrepancies on the WISC 
are characteristic of retarded readers; (5) 

1The cooperation of the New South Wales De- 
Partment of Education, in particular, of O. R. Jones, 
former Director of Primary Education, and C. H. 
von Sturmer, Principal Research Officer, is grate- 
fully acknowledged. 

? Not included in these samples. 


whether retarded readers manifest any char- 
acteristic scatter on the WISC subtests; and 
(c) whether reading retardation is a spe- 
cific educational problem or whether it is 
merely part of a generalized school back- 
wardness. 

A number of studies of WISC patterns of 
retarded readers has been published, most of 
them suffering from the defects of small 
sample size (N < 30) and/or from lack of 
a control group. Studies falling within one 
or other of these categories are those of Altus 
(1956), Burks and Bruce (1955), Coleman 
and Rasof (1963), Dockerell (1960), Hirst 
(1960), Graham (1952), and Robeck (1962). 

Experimentally more satisfactory studies 
are those of Neville (1961), McLeod (1965), . 
and Belmont and Birch (1966), the two last 
mentioned being the only ones with large 
samples. Briefly, the findings in all three of 
these more satisfactory studies were that re- 
tarded readers perform relatively poorly on 
the subtests of Information and Arithmetic. 
Vocabulary was also found to discriminate 
significantly by McLeod (1965) and. Bel- 
mont and Birch (1966), Digit Span by Mc- 
Leod (1965) and Neville (1961), and Coding 
by McLeod (1965). Reverse discrimination 
(ie., retarded readers better than controls) 
was found on the following subtests: Picture 
Completion (McLeod, 1965), Object Assem- 
bly (Belmont & Birch, 1966), and Picture 
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Arrangement and Block Design (Neville, 
1961). 

Most published studies have drawn samples 
from specialized reading clinics, in which, 
because of their selective nature, factors of 
a sensory, neurological, emotional, or socio- 
economic nature are likely to have been un- 
duly represented. This sampling bias has 
usually been compounded by a failure to ex- 
amine Ss clinically in order to screen out 
those with easily explicable causes, such as 
culturally poor home backgrounds, histories 
of broken schooling, or even simple dullness. 
For example, McLeod (1965) may be criti- 
cized for the selection of both of his groups 
since he compared children from an Aus- 
tralian reading clinic (apparently without any 
screening of the individuals selected from the 
files) with a group of successful readers in 
England, who, because of differences in the 
educational systems, would have had a year’s 
longer tuition than the experimental group. 
Belmont and Birch (1966), although select- 
ing their samples from state schools, included 
in their retarded group 60 children who were 
under IQ 90, as against only one such normal 
reader, Not surprisingly, the discrepancy be- 
tween the two groups in Full Scale IQ was 
accompanied by significant differences in all 
but one of the subtests. Equating the groups 
for IQ resulted in two attenuated samples 
of only 22 each. 

But the major defect with all published 
studies is that they fail to define adequately 
the nature of reading retardation. The defini- 
tion proposed here is that reading retardation 
is a syndrome of learning disability in chil- 
dren who are bright enough to learn, and 
who have had the usual learning opportun- 
ities, There is obviously no point in inquiring 
why dull children, or children with sensory 
handicaps, or children who have missed 
schooling, have progressed less rapidly in 
reading than their fellows. 

The term “reading retardation" may in 
fact have four meanings: (a) slow reading 
speed, (5) limited vocabulary, (c) failure to 
master basic processes such as letter recog- 
nition and sound blending, and (d) lack of 
comprehension despite adequate fluency in 
the mechanics of reading. If d, termed 
“true dyslexia,” is ignored as a rare con- 
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dition,? it is c which should form the basis 
of the selection criteria; a and b can 
arise simply through lack of informal reading 
practice at the older age levels. These distinc- 
tions are ignored in all the studies reviewed. 
Thus McLeod (1965) set up as his criterion 
at least 14 yr. of retardation; but since his 
children were above CA 10} yr., many would 
presumably have been reading beyond RA 9 
yr., that is, at a level adequate for many 
practical purposes. In fact, some of his duller 
Ss would have been reading at about their 
MA level; but McLeod (1965) published 
no details of the reading achievements of 
his Ss. 

The question of whether retarded readers 
are also backward in the other basic sub- 
jects of spelling and arithmetic does not seem 
to have been investigated much since Burt’s 
(1961) investigation of poor school achievers. 
He found that such children tended to be 
weak in all school subjects; but this was to 
be expected since his Ss were mostly duller 
than average. Also, of course, reading retarda- 
tion would retard achievement in all other 
verbal subjects, and even in arithmetic in the 
later grades, when pupils are confronted with 
written problems, 


METHOD 
Subjects 


Two groups of children, 54 retarded readers (ex- 
perimentals) and 54 adequate achievers (controls) 
were selected from primary schools in Sydney. Each 
group consisted of nine Ss from each of the six 
primary school grades, which range in CA from 
6 yr. to 12 yr. To ensure that the groups were 
matched as closely as possible for the important vari- 
ables of teaching method, curriculum, and socioeco- 
nomic class, both groups were drawn from the same 
classrooms in the same schools and at the same time 
of year. The schools were all situated in middle- 
class suburban areas to reduce the probability of 
selecting children whose poor achievement was due 
primarily to adverse conditions for home study. 
All Ss in both experimental and control groups 
were male. 

All children were tested by the two authors. Re- 
tarded readers were first identified through class 
teachers. Each S was then individually screened 0n 
tests of visual and auditory acuity, and on Schonell's 
Graded Word Reading Test which involves only 
the untimed oral reading of single words. Children 


?Only one such case, an epileptic, has ever been 
found by the authors. 
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who fitted the criteria for reading retardation were 
individually administered 11 subtests of the WISC, 
(Mazes omitted), those failing to achieve a pre- 
determined level of intelligence being excluded from 
the sample. (The criteria for reading retardation and 
WISC achievement are detailed in a later section). 
The selected sample was further given the Australian 
Council for Educational Research (A.C.E.R.) test 
of Mechanical Arithmetic (ie. of arithmetical cal- 
culation) and a spelling test which were both indi- 
vidually supervised for each child. 

Adequate achievers were first identified by rank 
in class at the last term examination. Where classes 
were divided into A and B streams on the basis 
of school achievement, the lower stream was chosen, 
since the retarded readers would also be there. 
Overall examination performance close to the median 
of the class was the criterion set for inclusion in 
the sample. Those selected in this way were screened 
first on the tests for sensory acuity, then the WISC, 
reading test, spelling test, and arithmetic test were 
all administered in the same way as for the experi- 


- mental group. 


Means for the controlled variables and for read- 
ing achievement are listed in Table 1. 


Criteria for Reading and Intelligence 


Failure to recognize single words, or to sound 
them phonetically, was the basic criterion of read- 
ing retardation, hence the choice of the oral reading 
test. The degree of retardation required for inclusion 
in the sample followed a sliding scale, that is, Grade 1: 
3 yr. retarded, Grade 2: 1 yr. retarded, Grade 3: 
1} yr. retarded, Grade 4: 2 yr. retarded, Grade 5: 
23 yr. retarded, and Grade 6: 3 yr. retarded. In 
fact, none of the Grade 1 children could read more 
than half a dozen two- or three-letter words; and 
none of the Grade 6 children was less than 33 yr. 
retarded in reading. The sample could therefore be 
described as a group of very retarded readers who 
manifested learning difficulties in the basic reading 
processes. 

The criteria for selection on the basis of intelli- 
Bence posed a problem. One of the aims of the study 
Was to examine the amount of scatter between verbal 
and nonverbal subtests, while the a priori defini- 


TABLE 1 


SAMPLING DATA FOR EXPERIMENTAL 
AND CONTROL GROUPS 


Variable Experimentals Controls 

CA (yr.) 

M 9.54 9.23 
"n : 1.98 1.88 

eading age (yr.) 

M 6.94 8.98 

SD 85 1.52 
Socio'class 

M 341 3.59 

SD 1.61 1.67 


Note.—N = 54 in each group. 
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tion of reading retardation demanded children of 
at least average intelligence. Obviously, to insist 
upon an average Full Scale IQ would be to limit 
the amount of possible scatter, for large differences 
between Verbal (V) and Performance (P) Scales 
would result in below-average scores on the com- 
bined scale in some cases. As a compromise, it was 
decided to insist on an IQ of at least 90 on either 
the V or P Scale, but not necessarily on both com- 
bined, so that children with widely discrepant scores 
could be included in the sample. As it eventuated, 
6 of the 54 experimentals and one of the controls, 
had Full Scale IQs below 90. The lowest of these 
was IQ 80. 


RESULTS 


The means of both samples were at about 
the middle of the average range of intelli- 
gence, the difference in mean Full Scale IQ 
being only 5.68 points, (Table 2). This small 
difference could hardly explain the great dis- 
crepancy in reading achievement between the 
groups. Nevertheless the difference proved to 
be statistically significant: for Full Scale IQ, 
t= 3.21, p< 01; for V IQ, £= 3.34, p< 
.01; and for P IQ, 7 = 1.81, df = 106, ns. 

Differences between groups for all standard 
deviations shown in Table 2 were also sig- 
nificant beyond the .01 level. 

Further investigation of WISC scatter con- 
sisted of an analysis of V — P IQ differ- 
ences, and also of analysis of individual sub- 
test differences. 


Differences between Verbal and. Performance 
7Q 

Distributions of V and P scale scores are 
listed in Table 3. Kruskal-Wallis one-way 
analyses of variance were carried out for the 
various pairs of distributions. For the V > 
P distributions, the ranks for the experi- 
mentals were significantly greater than for 
the controls (x? = 4.02, df = 1, p < .05). For 
the P >V distributions, ranks for the ex- 
perimentals were also greater than for the 
controls (x? = 4.55, df = 1, p < .05). Within 
the experimental group, however, there was 
no significant difference between V > P and 
PV (22.75, ns). In other words, the 
experimentals had greater V — P discrepan- 
cies than the controls in both directions, but 
the experimentals as a group were about 
evenly distributed in either direction. 

From Table 3 it appears that the V ~ P 
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TABLE 2 


IQ, MEANS, AND STANDARD DEVIATIONS FOR 
EXPERIMENTAL AND CONTROL GROUPS 
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TABLE 3 


FREQUENCY OF VERBAL IQ AND PERFORMANCE 
IQ DIFFERENCES 


Experimental Control 

Full scale 

M 100.85 106.53 

SD 11.22 6.62 
Verbal 

M 99.70 105.85 

SD 11.29 7.38 
Performance 

M 101.93 106.15 

SD 14.54 8.77 


index becomes discriminating at differences 
beyond about 15 points of IQ. However, 
further analysis shows this index to be an in- 
sensitive one, since the majority of retarded 
readers scatter below the test norms on sub- 
tests contained in both V and P scales. This 
would of course reduce the V — P difference 
scores. The number of Ss who in the experi- 
mental group had at least one subtest score 
which was one standard deviation or more 
below the WISC subtest mean, (i.e., a score 
of 7 or below) was tallied. Of the 43 Ss who 
had at least one subtest score of 7 or less, 
12 were entirely on V subtests, 11 were en- 
tirely on P subtests, and 20 were on both 
scales. This finding also disposes of any idea 
that there may have been two syndromes of 
reading retardation contained in the distri- 
butions, that is, one V > P syndrome and one 
P » V syndrome. 


v>P P»V 
Dieren 
ferences H " 
Control | Experi: | Control | Experi: 

14 10 2 5 9 

5-9 6 9 8 3 

10-14 rg 6 6 7 

15-19 2 4 3 2 

20-24 2 2 1 2 

25-29 0 1 0 2 

30-34 0 0 1 3 

35-39 0 1 0 0 

Total 27 25 24 28 


Note.—Zero differences: controls, 3; experimentals, 1. V = 
verbal IQ; P = performance IQ. 


Comparison of Subtests 


Means, standard deviations, and ż-test re- 
sults for the 11 WISC subtests are shown in 
Table 4. 

From Table 4 it is apparent that the IQ 
difference between the two groups was due 
principally to differences in the Information, 
Arithmetic, Coding, and to a lesser extent, 
the Vocabulary and Digit Span subtests. The 
failure to find a consistent V < P trend in 
the previous analysis was due mainly to the 
importance of the Coding subtest in dif- 
ferentiating between the two groups. 

Both McLeod (1965) and Neville (1961) 
explained the consistently found low Infor- 
mation score as an effect of reading disabil- 


TABLE 4 
MEANS AND STANDARD DEVIATIONS or WISC SUBTESTS 


Experimentals Controls 
Subtest 
2 SD x SD 
Information 9.74 2.74 11.59 7 Cam 
Comprehension 10.83 2.68 10.52 238 E 
Arithmetic 9.82 2.61 11.87 2.59 4.266" 
Hir vac ied AR 10.26 1.81 39. 
igit span ^ 4 10.02 2.48 2.25 

Vocabulary — 10.13 2.75 11.39 2.08 2.73" 
Picture completion 11.00 3.49 11.61 2.88 99 
Picture arrangement 10.67 2.84 10.79 2.66 25 
Blocks 10.67 2.81 10.61 2.22 01 
Object assembly 9.39 2.43 10.04 2.07 1.45 
Coding 9.50 2.93 11.30 2.15 3,64*** 
* 5 <.05, df = 106. 

** p < .01, df = 106. 

*P$5 «.001, df = 106. 
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ity rather than as a symptom. That is, they 
claimed that much of the information con- 
tained in the items may be gained through 
reading, so that the retarded reader is at a 
disadvantage on this subtest. However, if 
this were so, one would expect the discrep- 
ancy between the groups on this subtest to 
increase with school grade. That is to say, 
relevant information is unlikely to be gleaned 
from books in the first two grades of primary 
school (CA 6 yr. through to 7.9 yr.) when 
the children are in the early stages of learn- 
ing to read; but as reading becomes fluent 
in the later grades, children might be expected 
increasingly to read books for interest and 
pleasure, thus gaining information relevant 
to the subtest. The same claim and the same 
argument might be applied to the Vocabulary 
subtest. However, Table 5 shows no indica- 
tion of any trend toward increasing discrep- 
ancies either for Information or Vocabulary; 
the data do not warrant a test for significance 
of Grade X Group interactions. 

The difference between the groups on Cod- 
ing could also be explained as an effect of 
reading retardation, since the task involves 
the rapid reading and writing of a code. 
Again, if this were true, the gap should in- 
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TABLE 5 
Mean SUBTEST Scores ror SCHOOL GRADES 
Information Vocabulary 

pope 
grade : T 

Experi | Controls | Experi. | Controls 
1+2 9.89 11.22 10.11 11.61 
3+4 9.83 11.94 10.67 11.73 
5+6 9.45 11.33 9.61 10.83 


Note.—N = 18ineach group. 


crease with age, as the adequate achievers 
gain relatively more practice in reading and 
writing fluency. An analysis of variance was 
carried out to test whether the Groups X 
Grades interaction was significant; but an F 
test was unnecessary since the interaction 
mean square was smaller than that of the 
remainder. 

If it could be assumed that the retarded 
readers’ group and the adequate achievers’ 
group were representative samples, the find- 
ings in Table 3 could be taken as a valid 
indication of the differences between the 
groups. That the two groups should differ in 
Full Scale IQ is not surprising; since a down- 
ward scatter among certain of the subtests 


TABLE 6 


ADJUSTED MEANS, 


SIGNIFICANCE OF DIFFERENCES BETWEEN GROUPS, 


AND CORRELATIONS OF SUBTESTS WITH FULL Scare IQ 
Verbal subtest 
Group 
Information | Comprehension | Arithmetic Similarities Digits Vocabulary 

Experimentals 

? 10.17 11.188 10.26 10.46 9.14 10.49 
Controls 

E 11.10 10.178 11.47 9.89 9.82 11.03 

F 5.19* 4.92*R 7.52** 2.27 1.64 1.95 

r 579 AAT .499 .536 .254 484 

Performance subtest 
ots Aene Blocks Object Assembly Coding 

Experimental: 

T E 11.29 11.188 11.098 9.80 9.76 
Controls 

2 11.08 10.298 10.198 9.63 11.04 

F 145 4.65*R 4.36*R <1.00 6.72* 

r 347 .583 .541 .585 327 

Note.—R = reverse direction. 

b <.05. 


"x.t. 
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was postulated, an overall IQ difference was 
to have been expected. 

However, it could be argued that the ade- 
quate achievers’ group may have been sub- 
ject to sampling error, since the mean IQ 
was 6 points above the mean of the general 
population. This would not be a particularly 
good argument, first because the WISC was 
not standardized on an Australian population, 
and second, because it does not follow that 
average class achievement corresponds ex- 
actly to average IQ. But it is certainly true 
that adequate achievement was a somewhat 
elastic concept, based as it was upon rank 
order in class, rather than on an absolute 
Score. For this reason, covariance analyses, 
by which adjustments for mean IQ differ- 
ences could be made, were carried out. The 
adjusted means and significance of differ- 
ences are shown in Table 6. 

As the assumption of homogeneous vari- 
ances required for these tests was not met, 
a square-root transformation of the Full Scale 
IQ scores was undertaken, and this reduced 
the differences to acceptable limits. For all 
tests, the assumption of parallel regression 
lines was substantiated. 

It will be seen from Table 6 that, when Full 
Scale IQ differences are taken into account, 
the controls are superior only on the Infor- 
mation, Arithmetic, and Coding subtests. The 
experimentals are shown to have been sig- 
nificantly better than expected in view of 
their lower IQ on the Comprehension, Block 
Design, and Picture Arrangement subtests. 


Differences in Spelling 


Since the A.C.E.R. Spelling Test, the only 
available one with Australian norms, has a 
lower limit at Grade 3 level, this test was 
supplemented by an ad hoc list of words 


TABLE 7 
ANALYSIS OF VARIANCE: SPELLING TEST 


Source SS df MS F 
Between groups (A) 6,611.34 1 | 6,611.34 | 137.74* 
Between grades (B) 7,287.94 5 | 1,457.59 30.02* 

AXB 254.82 5 50.97 1.05 
Remainder 4,661.78 96 48.56 
Total 18,815.88 107 


*p « .001. 
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TABLE 8 
ANALYSIS OF VARIANCE: ARITHMETICAL COMPUTATION 


Source SS df MS F 
Between groups (A) 564.90 1| 56490 | 24.59* 
Between grades (B) 9,967.19 5 | 1,993.44 86.76* 

AXB 216.38 5 43.28 1.88 
Remainder 2,205.78 96 22.98 


Total 12,954.25 107 


* p « 001. 


drawn from the reading primers of Grades 
1 and 2. The analysis of variance shown in 
Table 7 was therefore carried out on the raw 
scores, that is, on the number of words cor- 
rectly spelled. The test was administered 
individually, and there was no time limit. 

The results shown in Table 7 show a 
highly significant difference in spelling 
achievement between the two groups in favor 
of the controls, and no Groups X Grades in- 
teraction effect. 


Differences in Arithmetic 


The A.C.E.R. test of Mechanical Arith- 
metic was administered individually to each 
child. This test is untimed, and consists solely 
of calculation sums, The reading of words 
is required for only one item, and this was 
read out to the Ss as necessary. The reading 
and writing of numbers was involved, but it 
was determined that all children could recog- 
nize numbers from 0 to 9. No child failed 
because of inability to read, but some of the 
experimental Ss failed on one or two items 
through a tendency to write number sequences 
in reverse, (eg, 137 written as 731). A 
number of the younger experimental Ss either 
did not recognize arithmetical signs, or did 
not understand subtraction, multiplication, 0r 
division processes. For the older ones, errors 
occurred mainly in calculation. 

Since the test norms have a lower limit at 
Grade 3 level, ad hoc items were added from 
elementary text books appropriate to Grades 
1 and 2. Raw scores were used in the analysis 
of variance in Table 8. 

The results in Table 8 show a highly sig- 
nificant difference between the computational 
achievement of retarded readers and adequate 
achievers, and no Grades x Groups interac: 
tion. 
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DISCUSSION 


The inferiority of the experimentals on the 
subtests of Vocabulary, Digit Span, and, 
more importantly, of Information, Arithmetic, 
and Coding may be referred to the factor 
analytic findings of Witkin, Dyk, Faterson, 
Goodenough, and Karp (1962). Analyzing 
the performance of 10-yr.-old boys, they 
found that Information and Vocabulary 
loaded highly on a factor of Verbal Com- 
prehension, whereas Arithmetic, Digit Span, 
and Coding loaded highly on a factor iden- 
tified as Attention and Concentration (though 
it seems that this could also have been con- 
sidered as a Memory factor). 

The hypotheses of McLeod (1965) and 
Neville (1961) that lower performances on 
the Information and Vocabulary subtests of 
the WISC are effects of reading retardation 
rather than correlated symptoms were not 
confirmed in the present investigation. The 
relevant analyses were based on the assump- 
tion that if they were effects, the discrep- 
ancies between the groups should become 
greater with age, owing to the increasing 
reading fluency of the controls. 

The hypothesis put forward by the present 
authors is that the WISC deficits are not 
effects, but symptoms correlated with reading 
disability, and primarily of a verbal nature. 
All of the significantly low subtests except 
Coding require verbalization; but it is pos- 
sible that the experimentals’ poorer perform- 
ance on that subtest reflect a failure to use 
an effective labeling strategy as a memory 
aid, resulting in increased time taken in 
checking the code key for each symbol. Simi- 
larly it is possible that their poorer perform- 
ance on Digits and Arithmetic was due to 
failure to use subvocal rehearsal as a strategy 
for keeping the items in mind. Whether these 
speculations are valid can of course be proved 
only by further experimentation; the reason 
they are put forward at all is that these same 
children were found in another, as yet un- 
published, part of the investigation, to have 
had histories of delayed speech development 
and articulation difficulty in infancy signifi- 
cantly more often than the controls. 

It is more difficult to account in these 
terms for the relatively good showing of the 


experimentals on the Comprehension subtest. 
Despite the negative findings of Neville 
(1961), McLeod (1965), and Belmont and 
Birch (1966), there is support for this find- 
ing from the earlier studies of Burks and 
Bruce (1955), Coleman and Rasof (1963), 
Dockrell (1960), and Robeck (1962). Some 
light may be thrown on this problem by refer- 
ence to the factor analytic study of Cohen 
(1959), who found two verbal factors, one 
of retained knowledge impressed by formal 
education, upon which Information and Arith- 
metic had high loadings, and one of informal 
verbal learning, upon which Comprehension 
loaded highly. It may also be noted that 
this subtest seems to require less verbal 
precision on the part of the testee than any 
of the other subtests. 

The findings that retarded readers did rela- 
tively well on Block Designs and Picture Ar- 
rangement are supported by Neville (1961). 
Other investigators have found other P sub- 
tests to favor retarded readers: Picture Com- 
pletion (McLeod, 1965) and Object Assem- 
bly (Belmont & Birch, 1966). 

The finding that retarded readers were so 
much poorer in simple arithmetical calcula- 
tion than adequate school achievers was some- 
what surprising. Certainly this failure cannot 
be explained as effects of reading disability, 
for no reading was involved. The authors can 
think of nothing in terms of differential cur- 
ricula or selection criteria which would have 
accounted for such a difference. In fact, a 
number of experimentals in Grades 4, 5, and 
6 were repeating grades because of reading 
disability, and hence would have been exposed 
to more than 1 yr.’s learning of arithmetic at 
that grade level. On this basis one might 
have expected a superiority in Arithmetic. 
This is then another example of failure in 
formal learning, and again the hypothesis 
concerning verbal strategies may be tenta- 
tively invoked, since verbal labeling of num- 
bers and processes is probably an important 
part of arithmetical calculation. Incidentally, 
this inferiority, as for reading, was present 
consistently from the earliest school grade. 

That retarded readers were greatly inferior | 
to controls in spelling is less surprising. In 
acquired brain injury, loss of function may 
be relatively specific; but in developmental 
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dysfunction, as reading retardation appears 
to be, it is to be expected that where there 
is an input disorder, there will be a corres- 
ponding output disorder; that is, where recog- 
nition (ie. reading) does not take place, 
neither can recall (ie., spelling) take place. 

Retarded readers, it seems, can generally 
be regarded as retarded learners in all the 
basic school subjects at all primary grade 
levels, though of course there may be indi- 
vidual exceptions to this rule. 
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CONTRIBUTION OF SELECTIVE POSITIVE REINFORCEMENT 
AND THERAPEUTIC INSTRUCTIONS TO SYSTEMATIC 
DESENSITIZATION THERAPY* 


HAROLD LEITENBERG, W. STEWART AGRAS, DAVID H. BARLOW 
AND DONALD C. OLIVEAU 


University of Vermont 


The contribution of therapeutically oriented instructions and verbal positive 
reinforcement to systematic densensitization therapy was investigated. Ss were 
30 female college students with severe fear of snakes. Following treatment, the 
increase in approach behavior was significantly greater for Group 1 (graded 
hierarchy, relaxation, reinforcement, and therapeutic intructions) than for 
Group 2 (graded hierarchy and relaxation only). Group 2 did not differ sig- 
nificantly from a no-treatment control group. These results pose certain prob- 
lems for reciprocal inhibition conceptions, and suggest that the effects of 
systematic desensitization cannot be attributed solely to the variables of a 


graded hierarchy plus relaxation. 


According to Wolpe (1958) and Wolpe 
and Lazarus (1966) the effects of systematic 
desensitization therapy can be explained by 
two variables: (a) the temporal association 
of a state of relaxation with scenes that evoke 
anxiety and (5) the presentation of these 
scenes in a graded hierarchy. Relaxation is 
assumed to inhibit anxiety and to become 
conditioned to the formerly anxiety-evoking 
stimuli after a sufficient number of pairings. 
When the patient is asked to imagine himself 
in a series of situations that progress from the 
least to the most anxiety evoking, each suc- 
ceeding item is presumed to elicit minimal 
amounts of anxiety which can be overcome 
by relaxation. Using severe fears of snakes 
and spiders as assumed prototypes of neurotic 
phobias, several investigators have confirmed 
the importance of combining relaxation with 
step-by-step exposure to the items in the 
hierarchy (Davison, 1968; Lomont & Ed- 
wards, 1967; Rachman, 1965). 

The question remains, however, whether 
this combination alone can account for the 
effects of systematic desensitization therapy 
or whether there are other contributing fac- 
tors. A review of the procedures used in sys- 
tematic desensitization (Wolpe, 1958; Wolpe 
& Lazarus, 1966) suggests that therapeu- 
tically oriented instructions and positive re- 

1This research was supported in part by a grant 


(MH-13651-01) from the National Institute of Men- 
tal Health, United States Public Health Service. 


inforcement of approach behavior may also 
be important factors. 

The usual therapeutic instructions indicate 
to the patient that a favorable outcome is 
likely and that gradual progress will occur 
during therapy. By themselves, such instruc- 
tions may be relatively ineffective (cf. Lang, 
Lazovik, & Reynolds, 1965) but when added 
to the other basic ingredients of the sys- 
tematic desensitization procedure they may 
increase the likelihood of a desirable out- 
come. In the area of hypnosis, Orne's research 
(1959) provides impressive evidence that Ss 
tend to fulfill explicit and implicit expecta- 
tions in a situation. 

In systematic desensitization, positive rein- 
forcement in the form of selective attention 
and praise may be provided for reports of 
no anxiety, for completion of items in a hier- 
archy, and for reports of improvements out- 
side the office. In an uncontrolled case report, 
Lazarus, Davison, and Polefka ( 1965) have 
previously argued that combining operant 
reinforcement procedures with desensitization 
procedures can facilitate the reduction of 
phobic avoidance behaviors. Truax (1966b) 
suggests that selective positive reinforcement 
is present in most forms of psychotherapy 
and may account for the similarity in out- 
comes often reported with different therapies. 
He has found (1966a) that response con- 
tingent reinforcement is present in Rogerian 
nondirective therapy. 
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The purpose of the present experiment is 
to investigate whether instructions and verbal 
positive reinforcement are factors contribut- 
ing to the success of desensitization therapy. 


METHOD 
Subjects 


Thirty female Ss were selected from the under- 
graduate student body at the University of Vermont 
as follows: Questionnaires were distributed to the 
undergraduates at their weekly dormitory meetings. 
They were told that the Departments of Psychiatry 
and Psychology were looking for volunteers for an 
experiment investigating different methods of psycho- 
therapy and the physiology of fear. They were then 
given a questionnaire asking them to rate their 
fear of nonpoisonous snakes on a 5-point scale. 
Following later evaluation of the questionnaire, all 
those students who noted that they were at least 
"definitely tense" at the sight of a nonpoisonous 
snake and who indicated a willingness to take part 
in the experiment were telephoned for an appoint- 
ment. Since very few males met the criteria, the 
study was restricted to female undergraduates, ages 
18-22. 


Pretest 


Before the pretest, Ss were interviewed to clarify 
their responses on the questionnaire and to gain 
some history about their fear of snakes. Possible 
Ss were then asked to enter a 15 X 10 ft. room and 
told: 


At the end of the room there is a harmless green 
snake securely enclosed in a glass cage. The pur- 
pose of this session is simply to find out how 
afraid you are of the snake. To do this, I shall 
ask you to try to do various things with the 
snake. At no time will I force you to do anything 
which you are too afraid to do. 


The glass cage was 6 X 10X 8 in. and was covered 
by wire grating. The green snake (North American 
colubrid snake, Opheodrys Aestivus) was small 
and thin, measuring about 1 ft. in length. The floor 
of the room was marked off at 15-, 10-, 5-, 4-, 3-, 
2-, and 1-ft, intervals from the cage. In order to 
insure some uniformity of activity, the snake was 
poked just before the assessment to provide some 
movement during the test. Approach to the snake 
was measured on a 14-point scale and the S qualified 
for the experiment by failing to reach Point 10, 
which consisted of reaching into the cage barehanded 
and touching the snake once. If the S qualified, she 
was matched with two other Ss who achieved similar 
scores and the three Ss comprised one block. Each 
S in this block was randomly assigned to one of 
three groups. The Es who administered the pre- and 
posttests did not know to which group the S was 
assigned. Similarly, the Es who performed the ther- 
apy were not aware of the pretreatment scores of 
the Ss on the scale. The Ss in Groups 1 and 2 
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were distributed equally between the two therapists, 
At the end of the pretest all Ss were told, *We are 
administering this test to a number of students and 
we will shortly decide which students will be selected 
for the experiment. The secretary will contact you.” 
They were all asked not to discuss the experiment 
with other students. 


Treatment 


Systematic desensitization was administered to 
Groups 1 and 2 with different instructions and dif- 
ferent reinforcement procedures. Group 1 (graded 
hierarchy, relaxation, therapeutic instructions, and 
positive reinforcement): Ss in this group were given 
therapeutically oriented instructions, the essentials 
of which were: 


During these sessions we are going to be using 
a therapy that has been found effective in curing 
various fears, such as fears of sankes. In order to 
gain more understanding of how it works we are 
going to be recording physiological responses. . . . 
The way this therapy works is that by learning 
to relax deeply while imagining these scenes you 
will become less afraid of them. This will be true 
not only in imagination but also in the real 
situation. 


After the initial instructions were completed and 
it appeared that S understood them, electrodes for 
monitoring galvanic skin response (GSR) were at- 
tached, and possible fears of being electrically 
shocked were allayed. GSR readings were taken 
throughout each entire session using a Grass Poly- 
graph, model 7, with a direct current of 50 w pass- 
ing between silver, silver chloride palm, and wrist 
electrodes. In Session 1 Ss were also told that the 
button resting near their right thumb was to be 
pressed when they felt any anxiety. Pressing the 
button marked the recording paper. 

In the first two sessions Ss were also trained in 
deep muscle relaxation using a 15-min. tape edited 
from a longer tape, originally made by Jacobson 
(1938). Ss were instructed to practice the relaxation 
procedure between these two sessions and during the 
second session the therapist inquired about the suc- 
cess and frequency of relaxation and answered any 
questions that S had. In the third session desensitiza- 
tion was begun. 

A hierarchy of 27 items containing scenes involv- 
ing snakes was used. A large number of items was 
employed to avoid big “gaps” in the hierarchy. This 
reduced the need for forming a subhierarchy of 
breaking down the item into more gradual steps 
for different S’s and kept the number of items that 
each S could be exposed to about the same. 

The following is an example of a subhierarchy. 
Item 4 in the hierarchy consisted of a scene in 
which S imagined himself standing 10 ft. from 4 
snake, and Item 5 consisted of a scene in which 5 
imagined standing 5 ft. from a snake. If after 20 
presentations S continued to report anxiety to Item 
5, then a subhierarchy was employed consisting of 
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scenes in which S imagined himself standing 9, 8, 
7, and 6 ft. from a snake. 

Before introducing the first item, Ss in Group 1 
were given the following instructions: 


I am going to ask you to visualize various scenes 
involving the snake. I want you to report any 
anxiety or disturbance caused by a scene by 
pressing the button. This procedure has been found 
io be successful in reducing fears not only in 
the process of visualization, but also in the real 
situation. 


This last statement was repeated until E was sure 
that S understood it. 

Desensitization was begun with Item 1. Each item 
was read exactly as written from the work sheet 
and 10 sec. were allowed to elapse before S was 
instructed to "stop visualizing that and go on re- 
laxing.” Relaxation was continued for 30 sec. at 
which time the item was again presented. If S did 
not signal anxiety for 2 successive presentations of 
a particular item, E proceeded to the next item: 
“Now I want you to imagine a different scene.” 
Tf, on the other hand, the S signaled anxiety, he was 
instructed to “stop imagining that and go on re- 
laxing,” and the same item was presented again. If 
anxiety was still aroused after 10 presentations, E 
dropped back to the previous item and presented 
that until it was imagined twice without anxiety, 
and then attempted the arousing item once again. 
The 30-sec. interval was kept constant between 
items. If, after 20 presentations of the same item 
either in 1 session or more, S still reported anxiety, 
a subhierarchy for that item was entered. All Ss 
who participated in the experiment were seen until 
either they completed the hierarchy or for 10 ses- 
sions (excluding the 2 sessions used for training in 
relaxation). Each session was half an hour in length. 

Verbal praise such as “very good” or you're doing 
fine” was given for any successful completion of an 
item, and at the end of the session progress was 
praised. Inquiries as to the clarity of visualization 
and depth of relaxation were frequently made. 

At the end of the third desensitization session, S 
was given the following instructions. 


As I told you at the beginning, this method has 
been useful in reducing fears. You have been pro- 
gressing well in the visualization of the snake. 
I would like you to have an opportunity today to 
practice with the live animal. The snake is in a 
cage in the next room. The secretary will show 
you in. You may do as you like for not more than 
15 minutes. Don’t do anything that you find 
frightening; this is simply practice with the real 
thing. 


The E inquired about success at the next session 
and any positive change was praised. The same in- 
structions were repeated at the end of Sessions 6 
and 9 if S had not completed the hierarchy by then. 

Group 2 (graded hierarchy and relaxation only): 
Ss in this group were treated in the same manner 
with the following exceptions: (a) The instructions 
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were designed to eliminate the expectation that 
therapeutic benefit might be present in the treatment. 
(b) Positive reinforcement was not given for com- 
pletion of an item on a hierarchy, nor was it given 
for reports of progress during the practice sessions. 
The essential instructions given Group 2 were the 
following: 


You are taking part in an experiment on the 
visualization (or imagining) of snakes. We are 
going to record your physiological response on this 
machine while you visualize these objects. Because 
we want you to be able to visualize each scene 
very clearly we are going to teach you to relax 
during this session and the next. In this way there 
will be a minimum of extraneous factors in our 
recordings. 


At the end of the third systematic desensitization 
session the rationale given for the practice session 
was: “I would like you to have an opportunity 
today to refresh your memory with the live animal 
in order to help you form a clear image.” 

Group 3 (no treatment): Ss in this group served 
as a control to determine the amount of improve- 
ment on the posttest in the absence of intervening 
treatment. After the other two Ss in each control 
S's block finished the treatment, the control S was 
contacted for the first time since the pretest. 


Posttest 


The Ss in Groups 1 and 2 were assessed as soon 
as possible after treatment was completed, The me- 
dian time was 3 days. The instructions given to all 
Ss were the same as those for the pretest. The Es 
administering the posttests were not aware of the 
conditions under which each S had been treated 
and the tests were conducted in such a way as to 
prevent the E from discovering in conversation 
prior to the test which Ss had been treated and 
which had not. One S from each group failed to 
complete the experiment. The data for these Ss were 
replaced according to Yates’ formula for missing 
data (Cochran & Cox, 1957). The less appealing 
alternative was to discard the data from two com- 
plete blocks of matched Ss. 

At the completion of the posttest, E asked ques- 
tions designed to determine what group the S had 
been in. Twenty-six out of 27 Ss were correctly 
identified, One of the Group 2 Ss explained the 
purpose of the experiment in terms more suitable 
for a Group 1 S. Her data, nevertheless, were in- 
cluded in the analysis. 


RESULTS 


Since there was some question as to whether 
the data met the interval scale requirements 
of parametric statistical analyses, both para- 
metric and nonparametric tests were carried 
out. In all instances both tests led to iden- 
tical conclusions, so that only the values of 
the parametric tests will be reported. 
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Table 1 presents the mean scores the Ss 
received on tests of their ability to look at 
the live snake, put their hand close to the 
snake, touch the snake, etc. These tests were 
given before and after treatment and higher 
scores indicate greater approach behavior to 
the snake. For example, a score of 10 means 
the S touched the snake momentarily, whereas 
a score of 6 means that the S looked down 
at the live snake, but refused to put her 
gloved hand in the cage to the point where 
the knuckle would be below the top of the 
cage. 

The Ss in the three groups were matched 
on the basis of their pretest scores and as 
can be seen the means on the pretest are 
very close together. The slight differences did 
not approach statistical significance (F = .88, 
df = 2/18, p > .05). 

On the posttest nine Ss in Group 1 (graded 
hierarchy, relaxation, therapeutic instructions, 
and positive reinforcement), six Ss in Group 
2 (graded hierarchy and relaxation), and only 
two Ss in Group 3, the control group, showed 
an increase in their approach scores. Differ- 
ence scores were obtained for each member 
of each group by subtracting the pretest 
score from the posttest score. An analysis of 
variance for randomized blocks was then per- 
formed comparing the mean difference scores 
for the three groups. This analysis yielded 
a significant F ratio (F — 6.96, df — 2/15, 
p < .01). (Three degress of freedom for the 
error mean square were lost because Yates’ 
formula for computing missing data was em- 
ployed.) A Duncan multiple-range test indi- 
cated that the mean difference score for Group 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF PRETEST, 
POSTTEST, AND DIFFERENCE SCORES 


Dee 
ween 
Pretest Posttest Pre- and 
Group Posttest 
M | SD| M | SD| M | SD 

Graded hierarchy, 


y, re- 
laxation, therapeutic | 6.40 | 2.17 | 8.61 | 2.35 | 2.21 |2. 
instructions, and P5 


positive reinforce- 

ment 

Graded hierarchy and 

relaxation only 6.20 | 2.66 | 7.18 | 2.64 | .98 | .92 
No treatment 6.60 | 1.96 | 6.80 | 1.75 | .19 | .92 
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1 was significantly greater than the mean dif- 
ference score for the control group (? < .01) 
as well as significantly greater than the mean 
difference score for Group 2 (p < .05). The 
mean difference score for Group 2, the group 
which received neither therapeutic instruc- 
tions nor selective positive reinforcement, did 
not differ significantly from the mean differ- 
ence score of the control group (? > .05). 

Although the difference score for Group 2 
did not differ significantly from the difference 
score of the control group, a within-Ss analy- 
sis indicated a significant increase from pre- 
to posttest scores for Group 2, ¢ (9) = 3.50, 
p< .005, but no such significant increase 
was noted for the control group, ¢ (9) = 
68, p > .05. 

Data collected during therapy sessions con- 
cerning Ss ability to progress through the 
hierarchy and her verbal and physiological 
reactions to items in the hierarchy are also 
of some interest. The mean number of hier- 
archy and branch items completed by Group 
1 was 26.55 while the mean number of items 
completed by Group 2 was 22.88. A ¢ test for 
matched samples indicated, however, that this 
difference was not statistically significant, 
t (8) = 1.34, p> .05. This trend is related 
to the finding that Ss in Group 1 had to 
have each item presented a fewer number of 
times than Ss in Group 2 before the criterion 
for going on to the next item was reached. 
(The criterion was two successive presenta- 
tions without a report of anxiety.) The mean 
was 4.44 for Group 1 and 10.16 for Group 2, 
t (8) = 1.15, $7.05. Although neither of 
these differences are statistically significant, 
they are in the expected direction. Reinforce- 
ment for successful visualization of an item 
and for completion of an item may have in- 
creased the number of items completed and 
decreased the frequency of successive anxiety 
reports. The GSR data did not yield any 
significant difference between experimental 
groups, but it was quite clear that there was 
No one-to-one correspondence between verbal 
reports of anxiety and the presence of a GSR. 
Within each group a ¢ test indicated that the 
percentage of item presentations which elicited 
an anxiety report was significantly greater 
than the percentage of item presentations 
which elicited a GSR (p< .05). This re- 
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sult is most likely due to adaptation of GSR 
activity following repeated presentations of 
an hierarchy item. Martin (1956) has pre- 
viously reported a decline in skin conduc- 
tance in successive free association segments 
of experimental interviews, and as Lacey 
(1959) points out “adaptation is a very com- 
mon and striking phenomenon in all auto- 
nomic responses [p. 166].” Another possible 
interpretation is that physiological indexes of 
anxiety extinguish more rapidly than do ver- 
bal indexes of anxiety. (Also, verbal report 
may have a lower reaction threshold, that is, 
be more “sensitive,” than GSR in cooperating 


Ss.) 


Discussion 


The results of the present study suggest 
that therapeutically oriented instructions and 
selective positive reinforcement can enhance 
the beneficial effects of systematic desensitiza- 
tion therapy. The instructions given to Group 
1 Ss indicated that the experiment would 
reduce their fears of snakes, while those 
given to Group 2 Ss indicated that the ex- 
periment was designed to learn about their 
physiological reactions while they imagined 
fearful stimuli. The therapist provided rein- 
forcement by remarking “good,” “excellent,” 
“you're doing fine" when an S in Group 1 
was able to visualize a scene without re- 
porting anxiety, when an item in the hierarchy 
was completed, and when S reported progress 
in approaching the snake during practice 
sessions. Although both experimental groups 
Showed a significant increase in their ability 
to approach the snake after treatment was 
completed, the magnitude of this increase was 
significantly greater for Group 1 (graded 
hierarchy, relaxation, reinforcement, and 
therapeutic instructions) than for Group 2 
(graded hierarchy and relaxation). In fact, 
Since Group 2 did not differ significantly from 
a control group which received only the pre- 
and posttests, it can be argued that either 
reinforcement or therapeutic instructions or 
their combination were major rather than 
minor contributors to the observed increase 
in approach behavior. 

Although the combined absence of these 
variables significantly reduced the effective- 
hess of the systematic desensitization pro- 
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cedure, no attempt was made in this study 
to determine the relative effects of instruc- 
tions and reinforcement in systematic desen- 
sitization. Presumably they are most effective 
when combined (cf. Ayllon & Azrin, 1964). 
This hypothesis is currently being tested in 
the authors' laboratory. 

If selective reinforcement and therapeu- 
tically oriented instructions of the kind used 
for Group 1 are customarily a part of the 
systematic desensitization procedure, and ex- 
amination of the detailed protocols provided 
by Wolpe and Lazarus (1966) suggest that 
they are, then the usual theoretical explana- 
tions of the effects of systematic desensitiza- 
tion are brought into question by these find- 
ings. Theoretical statements which stress 
almost exclusively the notion that relaxation 
physiologically inhibits anxiety and then be- 
comes conditioned to the formerly anxiety- 
arousing stimuli would seem to be incomplete. 
Leitenberg and Agras (in press) have argued 
that relaxation facilitates the neutralization 
of anxiety-arousing stimuli but does not be- 
come conditioned to these stimuli. Following 
successful desensitization therapy, formerly 
anxiety-arousing stimuli seldom evoke a re- 
laxed state—they simply no longer evoke 
anxiety. Relaxation may aid in the extinc- 
tion of anxiety but the authors’ findings sug- 
gest that therapeutic instructions and/or 
the step-by-step reinforcement procedure are 
largely responsible for the acquisition of 
adaptive behavior. ; 

The results of the present experiment 
should not come as a surprise. In their own 
right both instructions and reinforcement have 
previously been shown to have significant 
behavioral effects. In psychotherapy and other 
therapylike situations Ss show changes in 
behavior in accordance with what they have 
been led to expect by past experience, in- 
structions, and other placebo cues (e.g, Gold- 
stein & Shipman, 1966; Orne, 1959). And 
operant conditioning has been employed to 
reduce the frequency of various abnormal 
behavior patterns (e.g. Ayllon & CLR 
1962; Barlow, Agras, & Leitenberg, 1967; 
Ullman & Krasner, 1965). 

A possibly important aspect of the sys- 
tematic desensitization procedure, related to 
both instructions and reinforcement, is gen- 
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erally ignored in the literature. The precise 
instructions, an emphasis on behavioral meas- 
urement, and the structured design of system- 
atic desensitization define for the patient 
exactly what behaviors are of interest and 
provide clear evidence throughout therapy 
that these behaviors are indeed changing in 
a gradual and systematic manner. The pa- 
tient receives constant feedback from his own 
observations that the behavior in question 
can be changed in small steps, that he is 
being successful and is on the road to re- 
covery. These changes are made possible by 
the small requirements of the graded desen- 
sitization procedure, and the importance of 
these changes is suggested to the patient by 
initial instructions, as well as by reinforce- 
ment during therapy when the therapist 
praises progressive change. Such self-observed 
signs of improvement may account for much 
of the success of all graded behavioral thera- 
pies, not only systematic desensitization. This 
question remains to be experimentally in- 
vestigated. 
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In previous research, therapists’ A-B 


of Kentucky 


status has predicted differential therapist 


effectiveness with schizoid versus neurotic patients, but patients’ A-B status 


has not been studied. This study rel 
clinic patients (As, controls, and Bs) 


lated the A-B scores of 60 male college 
to therapist ratings of presenting symp- 


toms and patients’ own complaints and role expectancies. As exhibited turning 
against the self, intropunitive symptoms, whereas Bs showed extrapunitive 
tendencies, Role expectancy data suggested, surprisingly, that As enter therapy 


expecting to unburden themselves in 
Bs appear to seek rational guidance 
adjustive modes and role expectancies, 
than similarity, may partially mediate 


Recent research on therapist variables re- 
lated to the outcome of psychotherapy points 
to the critical nature of the so-called A-B 
variable (see the review by Carson, 1967). 
On the basis of their scores on a small num- 
ber of items taken from the Strong Voca- 
tional Interest Blank (the A-B scale), thera- 
pists categorized as “As” have been shown 
to achieve significantly better outcomes than 
“Bs” with schizophrenic patients (see re- 
views by Betz, 1962, 1967), whereas B ther- 
apists have performed better with neurotic 
Outpatients (McNair, Callahan, & Lorr, 
1962). Since the content of the A-B scale 
Ostensibly involves varying degrees of interest 
in selected activities of a manual, technical, 
or mechanical nature (As dislike these ac- 
tivities, Bs like them), the A-B variable, 
despite its promise, it little understood. 

Therapists’ A-B status was initially re- 
Ported to denote differences in “clinical 
Style” in the treatment of schizophrenic pa- 
tients in the Whitehorn-Betz studies (re- 
viewed by Betz, 1967), where the As ap- 
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an active, productive manner, whereas 
and correctives. With respect to these 
therapist-patient complementarity, rather 
the effectiveness of therapy interactions. 


peared to behave in an active and experiential 
manner whereas the Bs tended to be passive 
or didactic in their conduct of therapy. An- 
other correlate of A-B status appears to 
involve the  field-dependence-independence 
dimension (Witkin, 1962): Bs were signifi- 
cantly more field independent and less vari- 
able than As in studies using actual therapists 
(Pollack & Kiev, 1963) and college students 
(Shows & Carson, 1965). The hypothesis that’ 
the A-B scale measures, beyond therapeutic 
“effectiveness,” important dimensions of per- 
ceptual, cognitive, or communicative styles 
requires further examination. 

The basic hypothesis implied by the origi- 
nal clinical studies, that there is an inter- 
action between therapist type (A or B) and 
patient type (schizoid or neurotic), has re- 
ceived surprising support in several therapy 
analogue studies, using untrained male under- 
graduates as subject therapists (Carson, Har- 
den, & Shows, 1964; Sandler, 1965). In these 
studies, the patient-type stimulus materials 
were usually adapted from Phillips and Rabi- 
novitch’s (1958) “avoidance of others” 
(AVOS) and “turning against the self” (TAS) 
symptom clusters as prototypic of schizoid 
and neurotic modes of adjustment, respec- 
tively. Results yielding significant Thera- 
pist X Patient Type interaction effects, but 
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opposite to the direction predicted by clini- 
cal studies, have been obtained when the 
amount of interaction between the partici- 
pants has been attenuated, for example, by 
asking As and Bs to respond evaluatively to 
tape-recorded or written stimulus materials 
descriptive of the AVOS and TAS clusters 
(Carson & Klein, 1965; Kemp, 1966; Kemp 
& Sherman, 1965). 

What variables mediate the apparent po- 
tency of A x AVOS and B x TAS pairings? 
Initial speculations have included reference 
to the possibility that sex role or social class 
related variables may be involved (Lorr & 
McNair, 1966). However, when Sandler 
(1965) asked male undergraduates to write 
out their typical reactions to stress, he found 
that the statements of As were significantly 
more often consistent with a trusting, intro- 
punitive, collaborative mode of adjustment, 
whereas Bs more often described themselves 
in suspicious, extrapunitive, avoidant terms. 
In other words, As and Bs resembled, in their 
respective modes of adjustment, those pa- 
tients with whom they would be presumably 
less likely to be effective in actual therapy 
interactions. 

Systematic differences in mode of stress 
adjustment, if reliably related to the A-B 
variable, should be highlighted within an ac- 
tually disturbed population, that is, among 
patients rather than undergraduate Ss. The 
present study was designed to examine the 
relationship of patients A-B status to (a) 
therapist ratings of patient symptomatology 
evident in the first interview, (5) patients’ 
own presenting complaints, and (c) patients" 
role expectancies regarding psychotherapy. 
The latter variable was selected to gain per- 
spective on how the A or B patient con- 
strues his own and the therapist's probable 
behaviors prior to treatment. 

Although this is the first time that the 
A-B scores of patients have been related to 
their own symptomatology and role expec- 
tancies, from Sandler's data, as previously 
cited, one would expect As to show symptoms 
consistent with a TAS (turning against the 
self, intropunitive), and the Bs, with an AVOS 
(avoidance of others, extrapunitive) mode of 
adjustment. The role-expectancy measures 


J. I. Berzins, W. H. FRIEDMAN, AND E. SEIDMAN 


were expected to extend the known correlates 
of the A-B variable in a therapy-specific area, 


METHOD 


Patient and therapist samples. A battery of pre- 
therapy measures (see the following section) was 
filled out by 68 consecutive male patients prior to 
being assigned to eight therapists for initial inter- 
views at the Student Health Service of Indiana Uni- 
versity. The therapists comprised five psychiatrists, 
one psychologist, and two social workers. They in- 
terviewed from 2 to 16 patients each; four therapists 
accounted for 45 patients rated. No therapist was 
aware of the purpose to which his ratings were 
to be put. 

Measures. The A-B scale version filled out by 
patients was a 21-item scale (comprising 15 items 
from the Strong Vocational Interest Blank and 6 
items from the MMPI) resulting from a recent cross- 
validation 2 of the Kemp modification (Kemp, 1966) 
of the original Whitehorn-Betz scale. It was scored 
in the A direction (high scores yield A status, low 
scores yield B status). The A-B scale mean for the 
entire group was 12.02, with scores ranging from 
5 to 19, suggesting that the lower (B) scale points 
may be somewhat underrepresented in this sam- 
ple. Eight patients at the cutting points were ran- 
domly eliminated to yield two extreme groups (As 
and Bs) and a midmost group (ABs or controls) 
of 20 patients each. The As were defined as those 
Ss having scores between 14 and 19, Bs—scores 
between 5 and 9, and controls—scores between 10 
and 13, inclusive. 

The primary measure of presenting symptomatol- 
ogy was a therapist-rated checklist (15 items)" 
obtained immediately following the initial interview. 
Each item was followed by a 5-point scale assessing 
the extent to which a given complaint was chara 
teristic of the patient (1 equals not at all; 3 equals 
moderately; 5 equals extremely). Three ratings were 
designed as AVOS criteria (withdrawn, hallucina- 
tions or delusions, externalizes anger) and three 8$ 
TAS criteria (depression, suicidal thoughts, inter- 
nalizes anger). The remaining ratings (degree D 
disturbance, prognosis, likability, rigidity, etc.) fun 
tioned as fillers. 

Patients’ own complaints were assessed by a Pf 
visional checklist (32 items including fillers) with 
S-point scales (1 equals not at all; 3 equals modet- 
ately; 5 equals extremely) asking patients how mUC 
a given problem “distressed, bothered, or worre 
you during the past week, including today.” The 
items of this checklist were loosely adapted {rom 
the Psychiatric Outpatient Mood Scales (MeNait 
Lorr, 1964) as well as the Mooney checklist (Mooney 
& Gordon, 1950) and included items related to tie 
TAS and AVOS symptom clusters. Sample 
items are: depressed, dejected, feel worthless, Ia 


2Robert C. Carson, personal communication 
March 1967. ined 

8 All measures used in this study may be obtain 
from the senior author upon request. 
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of interest; AVOS: awkward in meeting people, 
slow to get acquainted, few or no close friends. 
Patients expectancies regarding their first hour of 
psychotherapy were assessed by a revised version of 
a 27-item Psychotherapy Epectancy Scale or PEI, 
described by Berzins (1966). The provisional ver- 
sion of this scale used here (60 items, including 
fillers) asks patients to indicate how strongly they 
expect a given patient or therapist behavior to take 
place during the next (here, the first) hour of treat- 
ment (1 equals not at all; 4 equals moderately; 7 
equals very strongly). While ordinarily this scale is 
scored for four patient-role categories (Nurturant- 
Coercive, Critic, Self-Reliant, Cooperative), the 
validity of the scale for male patients is questionable 
at this stage, and therefore items are simply used 
as indications of self-reported expectancies among 
patients varying in A-B status. 


RESULTS 
Patients’ A-B Status and Symptoms 


The relation of patients’ A-B status to 
therapists’ ratings in the TAS and AVOS 
categories is summarized in Table 1. It is 
evident that patients scoring in the A direc- 
tion on the A-B scale are rated higher than 
ABs (controls) and Bs on the TAS criterion 
items of depression, suicidal ideation, and 
internalization of anger. This finding clearly 
Supports Sandler’s results on the stress reac- 
tions of undergraduates. 

Patients scoring in the B direction on the 
A-B scale, however, are not classifiable as 
Showing AVOS symptoms, excepting their 
ostensibly greater (than for As) tendency to 
externalize anger. It is apparent that the re- 
lation of A-B status to internalization-exter- 
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nalization of anger is not linear: the middle 
group (ABs) is rated as least internalizing 
(most externalizing) of the three groups; the 
regression for internalization of anger departs 
significantly from linearity (Fin = 5.99, p < 
.01), with a correlation ratio (eta) of .40. 

Turning to the patients’ own presenting 
complaints, as reflected in their checklist re- 
sponses prior to the first interview, Table 2 
reports the items on which Pearson :'s sig- 
nificant at or beyond .10 were found, 

The majority of complaints associated with 
A-B status confirm therapists! criterion rat- 
ings: a status appears related to feelings of 
depression and  worthlessness, tearfulness, 
apathy, a tendency to sleep instead of study- 
ing, and thoughts which cannot be confided, 
These symptoms are consistent with the TAS 
symptom cluster. The one item related to 
scoring in the B direction on the A-B scale 
(have keen sense of moral obligation) does 
not conform to the AVOS category, nor is its 
meaning clear. This item is megatively re- 
lated to therapists’ ratings for hallucinations 
and delusions (r = —.34, p < .01) and posi- 
tively related to good prognosis (r = +.27, 
p<.05), suggesting that this “complaint” 
may have an adaptive meaning. 

The lack of association between B status 
and the AVOS symptom cluster may partially 
result from a heterogeneity of referents for 
AVOS-like symptoms in the measures used. 
Extremely low means for therapist ratings of 
hallucinations and delusions (Table 1), for 


TABLE 1 
RELATIONSHIP BETWEEN PATIENTS’ A-B STATUS AND THERAPIST-RATED SYMPTOMS 


Therapist ratings rie ipe 
ae 
TAS criteria 
Depression TA 
E seno +.29** 
izes anger -F.27** (.40*) 
AVOS criteria Y fe) 
ithdrawn —12 
Hallucinations, delusions —.10 
ernalizes anger —.28** 


M 

As ABs Bs F 
3.30 2.74 2.30 6.38** 
2.40 1.89 1.60 SEY he 
4.05 3.16 3.65 5.440" 
2.35 2.21 2.65 <1.00 
1.10 1.21 1.25 «1.00 
145 2.05 1.95 2.54* 


Note.—f values hei d elsewhere are for two-tailed tests. One patient in the AB group was not rated, therefore N = 59. 
2 ‘This isa restos fatio (eta), calculated because of significant (p < .01) nonlinearity of regression. ‘All other correlations 


are Pearson r's. 
*? < 410, 
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TABLE 2 
RELATIONSHIP BETWEEN PATIENTS’ A-B STATUS AND SELF-REPORTED SYMPTOMS 
rrelation with B 
Abbreviated content patients’ A-B 
moe As ABs Bs F 
Feel depressed 1 -.38*** 4.05 3.95 3.15 5.13 
Feel worthless +.29"* 3.70 3.30 2.63 3.83** 
Often feel like crying +.28** 2.75 2.74 1,95 2.64* 
Lack of interest +.26** 3.30 2.90 2.35 1.92 
Sleep not study +.24* 3.40 2.75 2.53 1.93 
Can't confide thoughts +.24* 2.95 2.15 2.05 2.82* 
Have keen sense of moral obligation —.2 2.88 3.20 3.50 <1.00 


Note.—Due to omissions, N = 19 for three means, and N = 17 for one. 


*p < 10. 
** p < .05. 
wrk p < 01. 


example, suggest that the incidence of psy- 
choticlike symptoms was low in this sample 
of college clinic patients. On the other hand, 
patients’ self-reports in the AVOS category 
addressed themselves to more “benign” forms 
of AVOS-like problems (e.g., “awkward with 
other people,” “few or no close friends,” “slow 
to get acquainted,” etc.). In almost all cases 
involving AVOS-type complaints, B patients 
exceeded ABs and As, although the means 
did not reliably differentiate the groups. 
Therapists and patients do not appear to have 
addressed themselves to the same issues, as 
indicated by the differential validities (cor- 
relations between patients’ complaints and 
criterion ratings) of the TAS and AVOS cate- 
gories. The patient-therapist correspondences 
on TAS symptoms were satisfactory (7's 
ranged from +.24 to +.47, median +.34) 
while the coefficients for AVOS symptoms 
were consistently lower (7s ranged from —.07 
to +.32, median 4.16). 

In a number of instances, the association 
between A-B status and patients’ complaints 
significantly departed from linearity (p< 
.05). The roughly U-shaped functions call 
attention to the possibility that the middle of 
the A-B score distribution (ABs) may con- 
stitute a meaningful group in its own right, 
but the characteristics of ABs are not easily 
interpreted. Thus, while on some self-report 
items which dealt with TAS contents (Table 
2), ABs were intermediate between As and 
Bs, it was seen (Table 1) that therapists 
rated ABs as least internalizing of anger 
among the three groups. Curvilinear relation- 


ships between A-B status and patients’ com- 
plaints included the following: relative to As 
and Bs, ABs reported themselves as less easily 
influenced by other people, less influenced 
by others’ moods, less irritated by others, less 
prone toward escapism, and less unsure of 
others’ reactions, even though they com- 
plained more of insomnia (correlation ratios 
ranged from .25 to .41 for these items). But 
what at first might appear to be the “better 
adjustment” of ABs was not supported by 
two therapist ratings which designated ABs 
as having poorer prognoses and being less 
trusting than As and Bs (etas of .37 and .34, 
respectively). 

The expected association between patients’ 
A-B status and the TAS mode of adjustment 
appears to be generally supported in this 
sample, while the findings with regard to B 
and AB patients are unclear. 


Patients’ A-B Status and Psychotherapy 
Expectancies 


The correlations between patients’ A-B 
scores and their expectancies regarding ther- 
apy are given in Table 3, along with the 
means for the trichotomized sample. While 
no hypotheses were made with respect to €X 
pectancies, the findings regarding symptoms 
would suggest that As would display little 
energy in therapy interviews, if the clinical 
association of depression and anergy be taket 
as a guideline. That this is not the case 5 
evident in the upper portion of Table 3. P? 
tients scoring in the A direction on the AB 
scale appear to see the therapist as basically 


j 
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TABLE 3 
RELATIONSHIP BETWEEN PATIENTS’ A-B STATUS AND THEIR EXPECTANCIES REGARDING PSYCHOTHERAPY 


M 
Patient expectancy (How strongly do you expect...) | correlation’ | 
As ABs Bs F 

your therapist to mostly listen to you talk 

out your problems? +.40** 5.45 4.83 4.30 $43** 
to feel like opening up without any help 

from your therapist? T3759 5.26 4.83 4.00 3.62** 
to be the one who begins the talking? +.29%* 4.50 4.00 3.75 1.03 
to feel that it’s all right to talk about your 

feelings toward your therapist? +.26 5.80 5.50 5.00 1.73 
to “carry the ball” conversationally? +.26** 4.85 4.05 4.30 1.52 
to initiate the conversation? +.24* 4.45 4.28 3.75 1,21 
to lead the way in bringing up topics to 

talk about? T.22* 4.60 4.39 3.75 Ma: by) 
your therapist to level with you? —.33** 5.45 6.45 6.55 3.93** 
your therapist to clearly announce his value 

judgments about your behavior? —.22* 4.05 5.33 4.85 2.84* 
your therapist to find out what's the cause 

of your problems? —.22* 4.80 5.39 5.50 142 


Note.—It was belatedly discovered that two patients in the AB group did not complete this measure, therefore N = 58. 


^? 
min 

a good listener. In complement, they antici- 
pate exerting considerable initiative in begin- 
ning and maintaining the flow of conversation, 
and they expect to talk about their feelings. 
This emphasis appears to be changed for B 
patients (lower portion of Table 3). While, 
as with symptoms, few items are related to 
A-B scores in the B direction, B patients ap- 
pear to expect the therapist to “level with 
them," to make evaluative comments, and 
to be concerned with the etiology of the pa- 
tient's problems. 


DISCUSSION 


Individual differences on the A-B variable 
appear to have some relation to the present- 
ing complaints and therapy-related expectan- 
cies of college clinic patients. The results 
Supported the expected association between 
A status and the TAS mode of adjustment, 
and A patients also appeared to expect them- 


- Selves to play verbally active and productive 


Toles in treatment. With the exception of a 
tendency of B patients to externalize anger, 
the expected relationship between B status 
and the AVOS mode of adjustment was not 
clarified by this study. The B patients’ ex- 
Pectancies suggested that they anticipated 
encountering and dealing with a straightfor- 
ward, analytical, teacherlike figure. 


* Bur are Pearson r’s since no significant departures from linearity of regression were observed. 


The role-expectancy data appear to be 
somewhat consistent with the A-B differences 
in “clinical style” reported by Betz (1967) 
as characteristic of A and B therapists. Betz 
pointed out that B therapists tended to be 
either passive or instructional in their inter- 
actions with schizophrenic patients; in the 
present study, B patients appeared to en- 
visage a somewhat “didactic” interaction. The 
B patients’ expectancies regarding "structure" 
in psychotherapy are also congruent with the 
presumably high level of cognitive differen- 
tiation (greater field independence) of B 
therapists (Pollack & Kiev, 1963) and un- 
dergraduates (Shows & Carson, 1966). The 
A-type therapists, according to Betz, were ac- 
tively and experientially involved with their 
patients; here, A patients expected themselves 
to “carry the ball" conversationally and did 
not appear to expect much activity from the 
therapist. 

The unexpected aspect of the present re- 
sults regarding the A patient was that, despite 
presumably high levels of depressive suffer- 
ing, the A patient anticipated being quite 
vocal about his troubles! Speculatively, this 
tendency toward active, productive communi- 
cation, which seems incompatible with the 
clinical picture of depression, bespeaks a 
readiness to turn to other people in times of 
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distress. If Bs evidence such tendencies to a 
lesser extent than As, the failure to obtain 
a full replication of Sandler’s data on under- 
graduates might partially be due to variables 
which themselves are a function of A-B status, 
for example, an interpersonally avoidant mode 
of adjustment to stress. If these speculations 
have merit, then one would expect more As 
than Bs to come for psychotherapy (it will 
be recalled that no S scored within the lower 
five “B” points on the A-B scale), whereas 
one would not necessarily expect any dif- 
ferences in rates of volunteering for psycho- 
logical experiments (e.g., in Sandler’s study). 
Since it is generally acknowledged that neu- 
rotic patients often refer themselves for treat- 
ment, while schizoid or schizophrenic patients 
seldom do so, one might hypothesize that the 
A-B dimension, when interacting with levels 
of subjective distress, predicts differential 
rates of turning to other people for help, with 
As exceeding Bs in this regard. 

The actively approaching treatment styles 
of A therapists, reported by Betz (1967), 
thus would appear similar to the tendencies 
toward interpersonal approach inferred from 
A patients’ role expectancies. If such con- 
tinuities are genuine, then a most important 
hypothesis, raised by even a partial replica- 
tion of Sandler’s results, is that therapist- 
patient complementarity, rather than simi- 
larity, on the A-B variable may mediate the 
“effectiveness” results obtained in prior re- 
search. Swensen (1967) has also suggested 
from a differing point of departure that the 
A-B dimension may in part be an interper- 
sonal approach-avoidance dimension: A ther- 
apists’ approaching behaviors would mesh 
well with the needs of withdrawn schizoid 
patients, whereas the neurotic (TAS) patient 
may not require an especially approaching 
therapist. If A-type therapists, like A patients, 
show tendencies to turn against themselves 
under stress, the finding that A therapists 
perform less well with neurotic patients (Mc- 
Nair, Callahan, & Lorr, 1962) may be due to 
A therapists’ “blind spots” which interfere 
with therapeutic communication. Effective in- 
teraction would again be indexed by comple- 
mentarity rather than similarity. The argu- 
ment with respect to B-type therapists and 
patients is more tenuous but bears further 


J. I. Berzins, W. H. FRIEDMAN, AND E. SEIDMAN 


investigation. The extent to which personal 
therapy and professional training mitigate 
against such blind spots is of course an em- 
pirical question. 

Briefly put, the complementarity hypoth- 
esis leads to the prediction that A therapists 
would perform better with B patients, and 
B therapists—with A patients (than As with 
As and Bs with Bs), on both the approach- 
avoidance and blind spot bases. The improve- 
ments in outcome resulting from such match- 
ing may of course be slight. The data of the 
present study indicated only moderate re- 
lationships between A-B status and the vari- 
ables found associated with it. Also, the 
measures used were not behavioral and the 
AVOS category in particular may have been 
inadequately measured. Some of the curvi- 
linear relationships observed as a function of 
including the AB (middle) group, which has 
few precedents in work with this scale, sug- 
gests, however, that it would be desirable to 
include groups of AB therapists and patients 
in future studies. 

While the present study, thus, can only 
point toward promising further work, it 
should be pointed out that there is continu- 
ing evidence regarding intriguing continuities 
between diverse samples (ie. practicing 
therapists, college students, and, here, pa- 
tients) with respect to psychological variables 
associated with A-B status. 
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INCONSISTENT COMMUNICATIONS AND PSYCHOPATHOLOGY’ 


NANCY G. BEAKEL anp ALBERT MEHRABIAN 
University of California, Los Angeles 


The present study investigated the presence of possible incongruity between 
verbal and nonverbal components of parental messages to a disturbed child. 
Video-taped family interactions of two groups of families in which there were 
two degrees of psychopathology of an adolescent member were employed. There 
were two different measures of incongruity between attitude conveyed in 
verbalizations and posture. The findings did not support the hypothesized 
relationship between incongruity and degree of pathology. The results did 
however indicate a relationship between degree of negative attitude conveyed 
by parental messages and rated psychopathology of the adolescent in the 
family. It was found that parents of the more disturbed adolescents verbally 
communicate more negative attitudes toward the adolescents than parents of 


the less disturbed adolescents. 


The present study is concerned with in- 
congruity between the verbal and postural 
attitude communications of parents of dis- 
turbed children. A number of investigators 
have suggested that inconsistent attitude 
communications of parents contribute to the 
psychopathology of their children. While in- 
itial theorizing along these lines, as exempli- 
fied by double-bind theory, related inconsis- 
tent attitude communications of parents to 
schizophrenia in their offspring (e.g., Bate- 
son, Jackson, Haley, & Weakland, 1956; 
Weakland, 1961), investigators subsequently 
suggested that there is a correlation between 
the frequency of occurrence of inconsistent 
attitude communications in a family and the 
degree of psychopathology of the most dis- 
turbed member of that family (e.g., Haley, 
1963). In the present context, “communica- 
tion of attitude” refers to the degree of lik- 
ing, preference, or positive evaluation ex- 
pressed by one individual toward another. 
In addition to verbalizations (i.e., contents 
and tone of voice), facial expression, body 
position, or gesture can, for example, also 
convey attitudes. Thus, differences between 
attitudes conveyed simultaneously in differ- 


1This research was supported by National Insti- 
tute of Mental Health grants MH 08744 and MH 
13509. Special thanks are due Elliot Rodnick, Michael 
J. Goldstein, Edward Gould, and Sigrid McPherson 
for their ratings of adolescents! psychopathology. The 
cooperation of Daphne Bugenthal in providing an- 
chor audio and video-tape recordings is greatly 
appreciated. 


ent components of a communication can yield 
incongruous attitude communications. 

Assessment of the effects of consistent and 
inconsistent attitudes in content and tone of 
voice was studied by Mehrabian and Wiener 
(1967). Three degrees of attitude (i.e., posi- 
tive, neutral, and negative) in the tone of 
voice were each combined with three degrees 
of attitude in content (ie. meanings of 
words). They found that when vocal com- 
munication of attitude is inconsistent with 
verbal communication of attitude, normal 
addressees respond to the two-channel com- 
munication by subordinating the verbal con- 
tent to the vocal component. If, for example, 
the word “dear” is said in a tone of voice 
which is independently judged as communi- 
cating a negative attitude toward the ad- 
dressee, the consensual interpretation of the 
total communication is negative, at least for 
normal Ss who are addressees of inconsistent 
communications. 

The decoding of inconsistent communica 
tions was also investigated by Mehrabian an 
Ferris (1967). Their Ss rated all possible 
combinations of positive, neutral, and neg 
tive attitudes conveyed facially and vocally. 
Inferred attitude in this case was found t° 
be a linear function of the independent effects 
of the two components, with the facial com- 
ponent receiving 3/2 the weight received by 
the vocal component. Thus, the findings ° 
the Mehrabian and Wiener and Mebrabial 
and Ferris studies suggest that normal ad 
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dressees resolve inconsistency conveyed by 
various components of an attitudinal message 
by taking a weighted sum of the components. 
Both studies, therefore, do not provide sup- 
port for the double-bind theory assumption 
that inconsistent attitude communications 
cannot be resolved (i.e., are confusing for the 
addressee). 

The present study deals with a second as- 
pect of the double-bind theory, namely, the 
assumption that inconsistent communications 
of attitude are more frequent in the families 
of children who exhibit greater degrees of 
psychopathology. Thus, the present study 
was designed to explore the incongruity be- 
tween attitude communicated in verbalization 
and attitude communicated posturally for 
two groups of parents who differed in the 
degree of psychopathology manifested by 
their adolescent child. 

The communication of attitude via posture 
requires some consideration prior to the pres- 
entation of the study. Mehrabian (1968) 
hypothesized that a more open position, 
greater relaxation, and a more forward orien- 
tation of the body toward an addressee com- 
municates a more positive attitude. He 
found that a more forward orientation and 
greater relaxation communicate a more posi- 
tive attitude to an addressee. In addition he 
found that an open position conveys a posi- 
tive attitude only when the communicator is 
female. His results can be used to measure 
either the actual degree of positive-negative 
attitude communicated by the posture of a 
seated communicator, or to obtain an esti- 
mate of the kind of posture that an addressee 
expects provided he only hears the verbaliza- 
tion accompanying that posture. The latter, 
anticipated posture, when compared to the 
actual posture, provides one measure of in- 
congruity in communication. 

In the following study, attitudinal mes- 
Sages were taken from two groups of parents 
who differed in the degree of psychopathol- 
ogy manifested in their adolescent child. In- 
dependent judgments of attitude in verbal- 
ization and posture were obtained, on the 
basis of which measures of incongruity of 
attitude communication were computed. It 
was hypothesized that parents of the more 
disturbed adolescents exhibit a greater de- 
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gree of incongruity between their verbal and 
postural attitude communications to their 
children than parents of the less disturbed 
adolescents. 


METHOD 


Subjects. Eight Ss, four males and four females 
in the 20-30 age range were selected from volunteers. 

Materials. Verbalizations and postures were taken 
from among audio- and video-tape recordings of 
adolescents and their parents. The families were 
part of a sample of 21 adolescents and their parents 
who discussed a family problem stemming from 
the child’s disturbance. In no case were any of the 
participants aware of the recordings which were 
made through a one-way mirror. All family mem- 
bers were seated during the recording. 

Three clinical psychologists who were familiar 
with the individual problems represented in the en- 
tire sample ranked the adolescents concerned ac- 
cording to severity of pathology. The clinicians were 
requested to rank order the 21 adolescents from 
most severe to least severe without regard to diag- 
nostic classification. Kendall’s Coefficient of Con- 
cordance (Winer, 1962) indicated satisfactory agree- 
ment among the three judges (W=.85). Rank 
scores were averaged over the three judgments in 
order to select pairs of parents to serve in the ex- 
perimental groups. Parents of the 5 adolescents re- 
ceiving the lowest average rank scores were assigned 
to the more pathological group, while parents of 
the 5 receiving the highest average rank were as- 
signed to the less pathological group. 

For each parent of both pathology groups, four 
to six statements averaging approximately 35 words 
in length were selected from the audio portion of 
the family discussion tapes to be used as verbaliza- 
tions. In selecting these audio communications, the 
judge was presented only with the audio tapes and 
was instructed to select statements which satisfied 
the following criteria: (a) statements should be 
complete sentences, (b) statements should involve 
the parent speaker addressing the child directly, 
either by name or by use of the pronoun "you," 
and (c) the statements should concern some aspect 
of the child's behavior problem or personality. Due 
to the selectivity of these criteria, the number of 
statements obtained for each parent ranged from 
four to six. It should be noted that in selecting these 
verbal communications, the judge had no access to 
the nonverbal correlates of these communications 
and therefore could not have been influenced by the 
degree of consistency or inconsistency of the total 
communications. 

In the subsequent part of the selection of ma- 
terials, the judge identified the video portions cor- 
responding to the already selected verbal communi- 
cations. In those instances where there were more 
than four verbal communications for a parent, the 
judge selected the four communications in which 
the parent was maximally visible, and if there still 
were extras, randomly eliminated one or two of the 
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set to obtain only four video communications. In 
sum, these procedures yielded a final set of four 
audio-video communications for each parent. 

The total set of 80 communications consisted of 
four combined audio-video messages of each of 10 
mothers and 10 fathers. These communications were 
coded and randomized. 

Procedure. Judgments of attitude conveyed in 
verbalizations were obtained from four Ss who used 
a 7-point positive-negative attitude scale. The Ss 
could indicate the overall attitude expressed in each 
verbalization on a scale ranging from “extreme 
negative,” 1, to “extreme positive,” 7. Before presen- 
tation of the verbalizations, Ss were given four 
anchor stimuli and the use of the scale was ex- 
plained. 

The same four Ss subsequently rated video seg- 
ments on the same scale to provide “global judg- 
ments” .of attitude conveyed by posture. A practice 
session was given using four sample video-taped 
postures as anchors for the scale. In both practice 
and rating sessions, only that portion of the monitor 
screen showing the body of the parent speaker was 
exposed to the Ss. Other members of the family 
and the head of the parent speaker were masked by 
a black shield placed over the screen. The position 
of the child receiver relative to the parent speaker 
was indicated by a white “X” on the shield. This 
shielding device restricted judgments of attitude con- 
veyed by posture to positions of the body only. 

Four additional Ss provided “weighted judg- 
ments” of attitude conveyed by a posture antici- 
pated for a given verbalization, These Ss listened 
to the taped verbalizations. Their instructions were 
to imagine the posture assumed by the parent 
speaker of each verbalization in a situation in which 
the parent and the child are seated. For each ver- 
balization, Ss described the posture they imagined 
by selecting the appropriate descriptive word from 
each of the following four pairs: open-closed, tense- 
relaxed, forward-back, and toward-away from. “Open” 
was defined as a position of the limbs allowing 
access to the speaker's body and "closed" as a posi- 
tion denying access. “Forward” was defined as a 
forward inclination of the torso and "back" as any 
other position, including erect positions of the body. 
"Toward" was defined as a position of the body 
in which the speaker's chest is oriented toward the 
child receiver and "away from" as a position in 
which the speaker's chest is turned away from the 
child. “Tense” and “relaxed” were considered self- 
explanatory and were not specifically defined. A 
practice session was included in which drawings were 
used to explain the four pairs of body positions. 

This second set of four Ss subsequently viewed 
and provided weighted judgments of the video seg- 
ments. As in the case of the global judgments of 
posture, the monitor screen was shielded, affording 
Ss a view of the parent speaker’s body only. 


RESULTS 


The following set of measures were ob- 
tained for each of the 80 communications. 
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Judgment of attitude in verbalization: This 
was the rating of the degree of positive-nega- 
tive attitude communicated by the verbal 
component of each communication, as rated 
on a 7-point scale. 

Global judgment of attitude in posture; 
This was the rating of the degree of positive- 
negative attitude communicated by the pos- 
tural component of each communication, as 
rated on a 7-point scale. 

Weighted judgment of attitude in actual 
posture: This was the rating of the degree 
of positive-negative attitude communicated 
by actual posture, using a set of weightings 
for postural components taken from Mehra- 
bian’s (1968) study. 

Weighted judgment of attitude in antici- 
pated posture: This was the rating of the 
degree of positive-negative attitude in the 
posture which was imagined for a verbaliza- 
tion. These scores were computed on the 
basis of the weightings employed in the pre- 
ceding measure. 

Besides these four primary measures, two 
additional composite measures of incongruity 
were also computed for each of the 80 com- 
munications. 

Global incongruity score: To obtain à 
measure of incongruity between judgments of 
attitude in verbalization and global judg- 
ments of attitude in posture, the absolute 
difference between the two attitude scores 
was used. These absolute difference Scores 
ranged from zero to six. 

Weighted incongruity score: A second meas- 
ure of incongruity employed the weighted 
judgment of attitude from posture. This was 
the absolute difference between the weigh 
judgment of attitude in the actual posture 
accompanying a verbalization and of the at 
titude in posture anticipated for that verbal 
ization. j 

In the case of each of the above S5 
measures, the scores obtained for a give 
communication from each of four raters were 
averaged. 

Analysis of variance for reliability coeffi- 
cients (Winer, 1962) was computed for the 
four primary measures. For the judgments of 
attitude in verbalizations, r4 = .78; for glo- 
bal judgments of attitude in posture, = 
47. Weighted judgments of attitude in act! 
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posture yielded r4 = .63, and weighted judg- 
ments of attitude in anticipated posture 
yielded 74 = .48. 

Computation of a correlation matrix for 
all dependent variables yielded a significant 
positive correlation between global judgments 
of attitude in verbalizations and weighted 
judgments of attitude in anticipated posture 
(r = 36, p < .01). A significant positive cor- 
relation was also found between global and 
weighted judgments of attitude in actual 
posture (r = .40, p < .001). 

A 2 (Pathology) X 2 (Parent Sex) X 5 
(Individuals) x 4 (Communications) analy- 
sis of variance was performed on each of the 
following four dependent measures. In this 
design, individuals are nested under the 
pathology and sex conditions, and communi- 
cations are nested under pathology, sex, and 
individuals conditions. 

Analysis of variance of global incongruity 
scores indicates only one significant effect 
due to sex of parent (F — 5.21, df — 1/60, 
MS, = .28, p < .05). The mothers are more 
incongruent (1.24) than the fathers (.98). 

Analysis of variance of weighted incon- 
gruity scores indicates significance due to 
pathology group (F = 4.12, df = 1/60, MS. 
71.53, p < .05), sex of parent (F = 12.06, 
df = 1/60, MS, = 1.53, p < .01), and Path- 
ology Group X Sex of Parent (F — 4.12, df 
= 1/60, MS, = 1.53, p < .05). Comparisons 
of the four means corresponding to the Path- 
ology Group X Sex of Parent interaction ef- 
fect, using ¢ tests, indicates that the mothers 
of the less pathological group are more incon- 
gruent (4.63) than mothers of the more 
pathological group (3.50) and the fathers of 
both groups (3.08, each). The other three 
groups do not differ significantly among 
themselves. 

Analysis of variance of attitude scores 
based on judgments of attitude in verbaliza- 
tion indicates only one significant effect due 
to pathology group (F= 7.77, df = 1/60, 
MS, = 61, p<.01). Attitude communica- 
tions of the more pathological group (3.05) 
are less positive than attitude communica- 
tions of the less pathological group (3.54). 

Analysis of variance of attitude scores 
based on global judgments of attitude in pos- 
ture indicates no significant effects. 
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Discussion 


The results of the above experiment do 
not support the hypothesis that there is 
greater incongruity in the postural verbal 
communications of parents of more disturbed 
adolescents compared with the incongruity in 
the communications of parents of the less 
disturbed adolescents. Two different measures 
of incongruity were employed. For the first, 
global, incongruity measure which was based 
on direct judgments of the verbalizations and 
postures of the parents, there is no signifi- 
cant difference in the incongruity of commu- 
nications from parents of the more and less 
disturbed groups. For the second, weighted, 
incongruity measure which is based on antici- 
pated postures in comparison to actual pos- 
tures for a verbalization, the mothers of the 
less disturbed group of adolescents are found 
to show a greater amount of incongruity. In 
sum, the data fail to support the hypothesis 
and for one of the measures provide con- 
tradictory evidence for the communications 
of mothers. 

The correlations among the various de- 
pendent measures provide validational sup- 
port for the weighting of postural components 
employed in the present study. It will be 
recalled that only two significant correlations 
were obtained; one between judgments of 
attitude in verbalizations and weighted judg- 
ments of attitude in anticipated posture, and 
another between global and weighted judg- 
ments of attitude in actual posture. 

The second correlation indicates that the 
weighted judgments of attitude in posture 
are comparable to the global judgments of 
attitude. The first correlation indicates that 
the quality of anticipated posture in terms 
of its components is comparable in attitude 
to the quality of the verbalization upon 
which the anticipated posture is based. 

While the findings involving the incon- 
gruity measures are difficult to interpret in 
terms of double-bind conceptualization, the 
findings involving measures of degree of posi- 
tive-negative attitude communication can be 
interpreted. The parents of the more dis- 
turbed adolescents show more negative atti- 
tudes towards these adolescents (in their 
verbalizations, but not in their posture) than 
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parents of the less disturbed adolescents. An 
extrapolation of the multicomponent attitude 
communication model proposed by Mehra- 
bian and Ferris (1967) suggests that the 
total attitude communicated verbally and 
posturally is more negative for the parents 
of the more disturbed adolescents. If the 
total attitude communicated verbally and 
posturally can be considered to be represen- 
tative of the overall quality of attitude com- 
munications of these parents toward the 
adolescents involved, then the findings of the 
present study suggest a positive correlation 
between degree of negative attitude commu- 
nication of parents toward a child and degree 
of psychopathology of the child. 

The relationship between psychopathology 
of children and the negative attitude com- 
munications of their parents can be due to 
either, or both, of the following. The parents 
may have more negative attitudes toward 
these more disturbed children because the 
children create more problems for them than 
the less disturbed children do for their par- 
ents. Alternately, initially negative attitudes 
of the parents may have contributed to the 
psychopathology of the children. In either 
case, negative attitudes of parents could be 
considered to at least contribute to the 
maintainance of the children’s maladjusted 
patterns of functioning. In discussing their 
findings, Mehrabian and Wiener (1967) sug- 
gested that “it could be argued that unus- 
ually frequent negative attitude communi- 
cating messages do contribute to severe 
psychopathological functioning. . . . For ex- 
ample, indiscriminate negative reinforcement 
is not conducive to learning the numerous 
interpersonal and social skills which are lack- 
ing in individuals classed as schizophrenics 
[p. 114].” Rogers’ (1959) conceptualization 
of psychopathology also suggests a relation- 
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ship between negative attitude communica- 
tions of parents and psychopathology of their 
children. In his theory, greater psychopath- 
ology of a child is associated with greater 
degrees of “conditional positive regard” of 
parents toward the child. Conditional positive 
regard refers to the conditional quality of 
the love or liking of one person toward an- 
other, as would, one assumes, be manifest in 
the frequency of negative attitude communi- 
cations. In sum, the findings of the present 
study suggest that the study of the overall 
quality of positive-negative attitude rather 
than the incongruity in attitude communica- 
tion, may be a more helpful avenue for the 
investigation of the relationship between com- 
munication patterns and psychopathology. 
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PSYCHOLOGICAL DIFFERENTIATION AND PERCEPTUAL 
DEFENSE: 
STUDIES OF THE SEPARATION OF PERCEPTION FROM EMOTION * 


JAMES G. MINARD? 


Staie University of New York 
Downstate Medical Center, Brooklyn 


AND 


WILLIAM MOONEY 


Western Psychiatric Institute 
University of Pittsburgh 


Psychological differentiation (field independence, analytical thought) is as- 
sumed to involve separation of emotion from perception. Findings supported 
this assumption as well as previously criticized clinical investigations and an 
interpretation of perceptual defense. Perceptual measures of differentiation 
correlated (p < .025) with the degree to which emotion altered the probability 
of tachistoscopic recognition. Recognitions by poorly differentiated Ss included 
22.4% fewer emotional than neutral stimuli (p < .05). This type of perceptual 
defense correlated positively, as predicted, with psychological defense serving 
to blot out emotional experience. Verbal response bias, general accuracy, and 
verbal intelligence’ were controlled or unrelated to findings. Discussion empha- 
sized relevance to findings of certain research on the process of stimulus 
representation and on situations involving rapid response to fleeting stimuli. 


Both the critics and Es involved have 
recognized that contamination of variables is 
a significant problem in study of psychological 
differentiation; that is, study of behavior cor- 
related with the ability to be independent of 
a perceptual and social field and to think in 
an analytical way (Witkin, Dyk, Faterson, 
Goodenough, & Karp, 1962). Contamination 
(bias in the evaluation of one variable caused 
by information regarding another) has been 
a particularly difficult problem in testing the 
important assumption that more differenti- 
ated people are characterized by greater sepa- 
ration of their perception from their emotion 
(the separation assumption). 

Problems of contamination have not come 
from indicators of differentiation, since these 
have included objective perceptual tasks in 
which an S adjusted himself or a rod to a 


| This study was initiated and all measures of 
differentiation obtained in the Psychology Labora- 
tory, Department of Psychiatry. Research reported 
in this paper was supported in part by a grant 
(M-628) from the United States Public Health 
Service, National Institutes of Health. The writing 
of the paper was aided by the reading and com- 
ments of Kathleen M. Young, Robert Loiselle, John 
Dulaney, Gene Ingledue, and Donald Coleman. 

? Requests for reprints should be sent to James 
Minard who is now at the Western Psychiatric In- 
stitute, University of Pittsburgh Medical School, 
3811 O'Hara Street, Pittsburgh, Pennsylvania 15213. 
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vertical position or recognized a figure within 
an embedding context. Rather, the possibility 
of contamination has arisen when emotional 
perceptual separation has been evaluated by 
clinical procedures, since these procedures 
also permit the evaluation of differentiation. 
Though attempts have been made to reduce 
possible contamination, data such as that pro- 
vided by Rorschach tests, interviews, or diag- 
noses do not permit convincing elimination of 
information on differentiation. So long as clin- 
ical procedures are used, critical readers may 
be convinced differentiation could have con- 
taminated all other evaluations and may use 
this as a basis for rejecting much research 
on correlates of differentiation (e.g., Zigler, 
1963a, 1963b). 

Critical test of the separation assumption 
requires evaluation of emotional perceptual 
separation by a procedure which is not known 
to provide information on differentiation, The 
investigation reported here used an objective, 
restricted procedure: measurement of visual 
perceptual defense. Perceptual defense is de- 
fined as a description of the relationship be- 
tween stimulus emotionality and the ease of 
recognition of stimuli (Brown, 1961; Minard, 
1965). In the present study, perceptual de- 
fense was indicated by a difference in the 
probability of accurate responses attributed 
to the emotional significance of stimuli. The 
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accurate responses were forced choices made 
as words were repeatedly presented at 
a threshold level. The perceptual defense 
difference involved ePa, the probability of 
accuracy when the presented word was emo- 
tion arousing, and mPa, the corresponding 
probability when the word was nonemotion 
arousing, but was otherwise similar: per- 
ceptual defense = mPa — ePa. 

The possibility that this difference was 
caused by nonperceptual variables was ex- 
perimentally and statistically controlled by 
previously developed techniques (Minard, 
Bailey, & Wertheimer, 1965). These tech- 
niques and the selection of emotional stimuli 
are described in the Method section. 

Differentiation was evaluated by three fre- 
quently used perceptual (or at least non- 
verbal) tasks (Witkin, 1965). In each task 
it was advantageous to S to have ready access 
to a stable representation of some aspect of 
his previous experience, an aspect such as the 
feel of being vertical or the shape of a visual 
stimulus. For the Body Adjustment Test 
(BAT) he adjusted his body to the gravita- 
tional vertical while in a tilted room. For the 
Draw a Person (DAP) rating scale, he drew 
a human figure of each sex and was evaluated 
on the basis of the sophistication of body 
concept indicated by his drawings? For the 
Embedded Figures Test (EFT) he recog- 
nized previously presented geometric figures 
when they were included as integral parts 
of patterns. 

One hypothesis clearly follows from the 
separation assumption and the definition of 
perceptual defense: The less the differentia- 
tion of an S, the greater the effect of emotion 
on perception, and the greater the absolute 
nPa — ePa difference. Because the separation 
assumption does not specify whether emotion 
facilitates or disrupts perception, the hypothe- 
sis refers only to the absolute zPa — ePa dif- 
ference and not to its direction. 


8 Drawings were rated by Hanna Marlens, who 
developed the scale and participated in study of its 
reliability and validity (Witkin et al., 1962). She saw 
only the drawings and was not otherwise informed 
about this study. Other differentiation measures were 
administered and scored by Judy Birnbaum and 
William Fishbein. 
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Additional hypotheses regarding the direc- 
tion of this difference can be based on previ- 
ous clinical investigations (Witkin, 1965) and 
on the assumption that perceptual and psycho- 
logical defenses are related (Brown, 1961, pp. 
68-72). Well-differentiated Ss are thought to 
“isolate,” separating experience from affect, 
so they may recognize emotional and neutral 
stimuli with equal ease, suggesting that emo- 
tional perceptual separation can be great 
enough to eliminate the effect of emotion on 
the recognition of stimuli. Poorly differenti- 
ated Ss are thought to “blot out” emotionally 
significant experience, so they may show less 
accuracy in responding to emotional stimuli 
than in responding to more neutral stimuli. 
Such a result would suggest that, when emo- 
tional perceptual separation is too slight, emo- 
tion disrupts stimulus recognition. The Ss 
who are average in psychological differentia- 
tio have not been studied enough to 
allow prediction of their perceptual defense. 
According to the separation assumption, they 
have an intermediate separation of perception 
from emotion, but it is not known whether 
this will disrupt or facilitate the process of 
stimulus recognition. 

Description of two studies follows. The first 
used perceptual defense and the three mea- 
sures of differentiation to test hypotheses im- 
plied by the separation assumption, previous 
clinical investigations, and the assumption 
that perceptual and psychological defenses are 
related. The second more directly tested 
the assumed relationship of perceptual and 
psychological defense. The Discussion sec 
tion considers the numerous applications of 
the concept of emotional perceptual sepa 
ration, and theoretical interpretation of its 
occurrence. 


Stupy I 
Method 


The Ss were 30 Brooklyn College males between 18 
and 25 yr. of age. They volunteered for a paid «per- 
ceptual experiment" announced in their psychology 
classes and at the student employment office. Pre- 
viously obtained BAT norms indicated the sample 
included both well-differentiated and poorly differ 
entiated Ss. 

Order of test administration functioned 935 a 
random variable since BAT, EFT, and DAP wer 
administered at times determined, not by 95 cb 
characteristic, but by the availability of reseat 
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assistance and apparatus. Two experimental sessions 
were required for perceptual defense evaluation. Dur- 
ing one session BAT and EFT were given. During 
the other session duration of the two testing periods 
was equalized by administering DAP and the Infor- 
mation and Comprehension subtests of the Wechsler 
Adult Intelligence Scale (WAIS). The Es evaluating 
differentiation had nothing to do with the evaluation 
of perceptual defense and did not know perceptual 
defense scores. 

Perceptual defense. Methods of perceptual defense 
measurement, and their rationale, have previously 
been described (Minard, 1965; Minard et al., 1965). 
During the first of the two sessions, each S was given 
an individually administered word-association test 
(WAT) containing 42 potentially emotional words 
selected with the aid of judges familiar with the 
college population. The judges’ task was to select 
words which were quite familiar yet potentially 
emotion arousing (e.g., adjust, afraid, child, dance, 
date, defect, dependent, dumb, failure, love, school, 
suck, sin, wish, etc.). For each word, there was a 
presumably neutral word matched as closely as pos- 
sible (e.g., sit matched with sin). First letter and 
length were always identical for paired words and 
they were matched in frequency according to the 
Thorndike-Lorge count (1944). Matched words had 
as many corresponding letters as were allowed by 
the Thorndike-Lorge list. Words were typed in capi- 
tals with an electric typewriter. Each was typed in 
the center of an index card (for a subsequent word- 
association test) and on the gray, architect’s tracing 
Paper used to provide a low-contrast background 
when stimuli were tachistoscopically presented. The 
word-association test was administered individually 
with the instructions “say the first thing which comes 
to mind, as soon as it comes to mind, when you see 
the word on the card.” Cards were presented at 
5- to 10-sec. intervals and responses conspicuously 
timed with a stopwatch. Four personally emotional 
words were selected from the list of potentially 
emotion arousing words. Each selected word had 
elicited an associative reaction time within the 
longest 8% of all times for that S, a time longer 
than the average time of other Ss on that word. If a 
word elicited evidence of emotional arousal such as 
blocking, stammering, long explanations, or sweating, 
it was chosen in preference to words similar on the 
basis of other criteria. The four selected words and 
the neutral words matched with them were arranged 
in alphabetical order to form the list used for that S 
in the subsequent perceptual defense experiment. 
Approximately 1 wk. later, E familiarized S with the 
selected words, using a slide projector to present 

em one at a time. Words remained on the screen 
Until identified and S was informed he would be 
asked to recite the list when it was completed. After 
S's recitation, E recited the list rapidly and asked S 
to repeat it. 

The list was then presented as E adjusted the 
tachistoscope to a threshold level; that is, a point 
peek allowed identification of the general shape or 
Tagments of the letters of a word whenever S was 
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especially attentive. As a check on the selected condi- 
tions, the words were presented once again to see if 
S could identify more than one and less than four 
words in a series of eight. Stimuli were repeatedly 
presented at this threshold level during the following 
measurement trials. Accuracy during these trials was 
between 36.7% and 45.7%, using 90% confidence 
limits determined by Student ¢. To permit correc- 
tions, a randomly selected half of the stimulus pairs 
were omitted and smudged blank slides added with- 
out S's knowledge. Each stimulus was presented 10 
times and 16 blanks were presented during the total 
of 56 presentations. Each S made a forced choice 
from stimuli on his list after each presentation. 
Other, less important procedural details have been 
described previously (Minard, 1965; Minard et al, 
1965). 

Accurate responses were used to compute the 
nPa-— ePa difference, the percentual defense score. 
Responses to blank slides were used to compute an 
nPg—ePg difference (the difference in response 
probabilities when responses are guesses), This was 
the verbal response bias score. Response bias and 
other information provided by the experiment al- 
lowed correction of perceptual defense for non- 
perceptual variables and more accurate measurement 
of emotional biasing of the probability of stimulus 
recognition.* 


* Corrections are illustrated with accurate neutral 
responses. In principle, corrections were the same 
for all stimulus-response categories. 

Correction must consider the probability of chance 
accuracy, response bias, and the tendence of an S to 
restrict his responses to a reasonable set of alterna- 
tives. If there is no response bias and S chooses 
freely from the list of eight, the expected number of 
chance accuracies when a neutral word is presented 
is equal to the number of guesses to neutral words 
(n) times the probability of chance accuracy (Pca): 
number of chance accuracies = (n) (Pca). 

However, although S's list contains eight words, 
the probability of chance accuracy is not neces- 
sarily 3. Only four words on S's list are presented and 
only these four can be accurate responses. By re- 
stricting his choices to the presented words, S may 
increase his probability of chance accuracy to 4. 
When S chooses from both the presented and the 
unpresented stimuli, his probability of chance ac- 
curacy is the joint probability of choice from the 
presented group (Ppg) and the probability of chance 
accuracy when such choices are made (4): 


Pca — (4) Pog. [1] 


The probability of choice from the presented group 
is empirically determined from responses to blank 
slides. 

The tendency to either choose or avoid neutral 
responses (response bias) will change the number 
of neutral responses accurate by chance. If neutrals 
are preferred, the number of chance accuracies will 
be increased; if they are avoided, it will be reduced. 
The change will be equal to the number of chance 
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Though the authors believe perceptual defense is 
sometimes due to verbal response bias, analysis indi- 
cated that in this study verbal response bias and 
other nonperceptual variables did not make a signifi- 
cant contribution to the relationship between per- 
ceptual defense and psychological differentiation. 

Differentiation. For EFT, S was shown a simple 
geometric figure, asked to describe it, then to locate 
it in a more complex design. After the first, practice, 
trial he was given 12 measurement trials with dif- 
ferent simple and complex figures. His score was 
the total number of seconds required to locate fig- 
ures. For BAT, S was seated in a chair facing the 
wall at the end of a small room. Wood molding and 
pictures clearly defined the wall’s horizontal and 
vertical aspects. Both room and chair could be tilted 
independently. During the six experimental trials, 
the room was always tilted 35° and the chair 22°. 
Every possible combination of tilts was presented, 


accuracies multiplied by a verbal response bias co- 
efficient (c). This number is added, algebraically, to 
(n) (Pca). 

The number of chance accuracies, corrected for 
response bias is, 


(n) (Pca) + Cc) (n) (Pca). [2] 


Here, c is the difference between the proportion of 
neutral responses when blanks are presented and .50 
—the proportion suggesting no response bias. If the 
difference is negative, algebraic addition will reduce 
the estimated number of chance accuracies. 

Finally, it is false to assume S is only guessing 
throughout the experiment, One must estimate the 
number of guesses to neutral words (7) because S's 
description of his responses, his accuracy, and the 
conditions for presentation assures one that he was 
sometimes accurately recognizing the presented stim- 
ulus and was not always accurate just by chance. 
The number of guesses is not known, but the number 
of wholly inaccurate responses (I) is. This number 
has been reduced by the number of responses accu- 
rate by chance, which is equal to (1) (Pca): 


I-—n-— (n)(Pca). [3] 
It is a matter of algebra to solve for 7, 
n—I/(1— Pca). [4] 


This allows the desired correction of the number of 
accurate neutral responses to obtain a more correct 
estimate of the number of recognitions. The cor- 
rected number of accurate neutral responses (NAc) 
equals the observed number of accurate neutral 
responses (NA) less the expected number of chance 
accuracies, corrected for responses bias by Formulas 
2 and 4: 


NAc — NA — ( (n) (Pea) — (c) (n) (Pca) ), 
simplitying 
NAc=NA—(1—c)(n)(Pca). [51 


Further information on the correction procedure 
has been provided elsewhere (Minard, 1964). 
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half with room and chair opposite in direction of 
tilt; half with tilts the same. On each trial, S in- 
structed E to adjust his chair to a position which 
seemed vertical to S. Vertical was carefully and re- 
peatedly defined as the gravitational vertical. The 
BAT score was the total degrees of error for all 
trials. Further information on EFT and BAT has 
been provided by Witkin et al. (1962) who used 
the two tests together with the rod-and-frame test 
as their "perceptual index" of differentiation. 

For the DAP rating scale, S was asked to “draw a 
person," then to "draw a person of the opposite 
sex." Drawings were used by the DAP rater to arrive 
at a rating based on a number of specified criteria. 
These criteria used observable characteristics of fig- 
ures rather than projective interpretations (e.g. one 
characteristic contributing to a low score is “objec- 
tively interchangeable male and female figures”). 
Criteria falling into the categories of “identity and 
sex differentiation,” “form level,” and “level of 
detailing” were used to assign Ss to defined points 
on a 5-point scale. More complete description of the 
scale and data regarding its reliability and validity 
have been reported previously (Witkin et al, 1962, 
p. 117). Reported interjudge correlations for DAP 
ranged from .83 to .92. Reported correlation with 
the perceptual index of differentiation was .50. In the 
present study, correlation of DAP rank with sum 
rank on BAT and EFT was .63 (p < .005, Pearson 
r). Correlation of BAT rank with EFT rank was 45 
(p < 01). 

In summary, differentiation was measured as in 
previous research. Evidence for the validity of its 
measurement is provided both by that research and 
by the significant intercorrelation of tests in the 
present sample. 


Results 


Results were consistent with the assump- 
tion that greater differentiation is associated 
with greater separation of perception from 
emotion. In evaluating differentiation, Ss Were 
ranked according to scores on BAT, EFT, 
and DAP; ranks were then averaged across 
the three tests for each S. In evaluating 
degree of separation between perception an 
emotion, Ss were ranked on the basis of their 
absolute nPa — ePa difference, using accurate- 
response probabilities algebraically correct 
to reduce the importance of nonperceptua 
variables. For convenience, this score will be 
termed degree of bias in recognition. 

Since high scores on BAT, EFT, and DAF 
indicate lack of differentiation, there show! 
have been a positive correlation betwee? their 
ranking (lack of differentiation) and dest 
of bias in recognition. The Pearson correlation 
of the two variables was in the prec 
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direction and significant (.41, p < .025). A 
Kendall rank-order correlation was significant 
at the same level and was unchanged by 
correcting for tied ranks (7 = .27). 

Next, the analysis considered the correla- 
tion between degree of bias in recognition and 
each measure of differentiation (Table 1). 
The correlation of DAP and degree of bias 
in recognition was also statistically signifi- 
cant as indicated by rank-order correlation 
(r = .38, p < .001). Neither correlational nor 
graphic comparisons of the relationship of 
degree of bias with the three measures of 
differentiation suggested differences between 
tests which were statistically significant or 
theoretically meaningful. 

Could the assumed separation of perception 
from emotion really be due to nonperceptual 
variables? An advantage of the perceptual 
defense task is that it permits objective in- 
vestigation of this question. If reduction of 
bias in recognition were due to verbal intel- 
ligence, a negative correlation between the 
WAIS subtests and degree of bias would have 
been found. Instead, summed WAIS scale 
score had a positive, statistically insignficant 
Pearson correlation with degree of bias 
(126). If the apparent degree of bias in 
Tecognition were really due to verbal bias in 
the responses used to report recognitions, then 
bias in recognition should have had a fairly 
high, positive correlation with degree of verbal 
response bias (Pg — ePg, absolute, based on 
Buesses in response to smudged blank slides). 
Instead, degree of verbal response bias and 
degree of bias in recognition had a correlation 
Which was small and negative (—.12). The 
general level of tachistoscopic accuracy corre- 
lated only —.04 with degree of bias in recog- 
nition, (Correlation of general accuracy with 


TABLE 1 


Positive CORRELATIONS BETWEEN LACK OF 
DIFFERENTIATION "AND DEGREE OF 
Emotionat Bras IN RECOGNITION 


Test 


DAP 
.41* 


BAT 
24 
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lack of differentiation was only —.005.) Un- 
less post hoc hypotheses are to be adopted, it 
must be concluded that the evidence of sepa- 
ration of perception from emotion was not due 
to nonperceptual variables or to the general 
level of accuracy. 

The direction of the uncorrected mPa — ePa 
difference was as expected for the well- 
differentiated and poorly differentiated third 
of the sample. Poorly differentiated Ss accu- 
rately identified fewer emotion-arousing than 
neutral words: of their accurate responses, 
61.2% were neutral and only 38.8% were ` 
emotional. The difference of 22.4% is signifi- 
cant (¢ = 2.08, p< .05). Well-differentiated 
Ss recognized about an equal number of neu- 
tral and emotional words (53.296 neutral and 
46.8% emotional). The 6.4% difference is 
not significant (¢ = .90). Average Ss showed 
a “vigilant” tendency to recognize more 
emotion-arousing than neutral words: 57.096 
of accurate responses were emotional and only 
43.0% were neutral, a difference of —14% 
(t = 2.44, p < .05, two-tailed). Even though 
Ss who were average in differentiation recog- 
nized the highest percentage of emotional 
words, they were average in terms of the 
absolute nPa — ePa difference, as predicted 
by the separation assumption. 

Statistics applied to all three group means 
supported the ¢ tests applied to individual 
means: analysis of variance showed the 
groups to differ significantly in perceptual 
defense (F = 4.99, p < .05). Duncan’s multi- 
ple-range test showed perceptual defense of 
average Ss differed significantly (5 < .05) 
from that of the least differentiated Ss. Dif- 
ferences between group means were not found 
for either general accuracy (F = 2.208) or 
verbal intelligence (F = .698). Verbal re- 
sponse bias did not differ significantly from 
group to group (F = 2.711). 

Algebraic corrections of the Pa — ePa dif- 
ference provide the most critical evidence 
regarding the contribution of nonperceptual 
variables to group differences in perceptual 
defense. These corrections consider more than 
simple verbal response bias and have previ- 
ously eliminated a substantial mPa — ePa dif- 
ference created by verbal conditioning 
(Minard et al., 1965). As the dashed line 
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in Figure 1 indicates, the effect of correcting 
perceptual defense results was very slight. 
Group differences in perceptual defense re- 
mained significant after correction (F = 5.919, 
p<.05). 


Discussion 


Results supported both the separation as- 
sumption and the definition of perceptual de- 
fense. As predicted, the less the differentiation 
the greater the effect of emotion on percep- 
tion, as indicated by an objectively measured 
perceptual defense task. Had this not been 
the case, the authors would have had reason 
to question not only the conception of psy- 
chological differentiation, but also the concep- 
tion that the absolute zPa — ePa difference 
indicates the degree to which emotion affects 
stimulus recognition. Because of the way in 
which emotional perceptual separation was 
measured here, contamination by clinical 
judgment can be rejected as an explanation 
of the results of this experiment and may be 
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questioned as an adequate explanation of the 
results of previous work. 

The zPa — ePa difference was 6.4% in 
well-differentiated Ss, —14% in Ss of average 
differentiation, and 22.4% in poorly differen- 
tiated Ss. These results supported hypotheses 
based on previous clinical studies and on the 
assumption that there is a relation between 
perceptual and psychological defense. If the 
separation assumption is accepted, perceptual 
defense results also suggest the hypothesis 
that a low level of emotional perceptual inter- 
action has little effect on recognition, an inter- 
mediate level of emotional perceptual inter- 
action increases the probability of recogni- 
tion, but too high a level of emotional percep- 
tual interaction disrupts recognition of emo- 
tional stimuli. The substantial Pa — ePa 
difference in poorly differentiated Ss, together 
with previous clinical reports that such Ss 
“blot out" emotional experience, suggested 
the desirability of a more direct test of the 
assumption that perceptual defense is corre- 
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PERCENT NEUTRAL RESPONSE 
(FOR ALL ACCURATE RESPONSES) 


40 
"DIFFERENTIATED" 
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40 


50 


PERCENT EMOTIONAL RESPONSE 
(FOR ALL ACCURATE RESPONSES) 


60 
"UNDIFFERENTIATED" 


Fic. 1. Correction for nonperceptual variables did not significantly change 


results demonstrated by the statistical analysis. (The percentages of accurate 
responses which were neutral or emotional, and their relationship to Ss' differ- 
entiation remains essentially as before.) 
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lated with increased reliance on psychological 
defenses such as denial and repression. 


Stupy IT 


In this test of the assumed relation between 
perceptual and psychological defense, an ap- 
propriately experienced clinician, uninformed 
regarding perceptual defense, evaluated col- 
lege males from whom another E obtained 
perceptual defense scores. The clinician’s pro- 
cedure allowed consideration of the poten- 
tially contaminating influence of Ss’ ability 
to cooperate. The perceptual defense pro- 
cedure allowed explicit consideration of the 
possible contribution of verbal response bias 
and general accuracy. It was predicted that 
perceptual defense would be correlated with 
greater reliance on defenses such as simple 
repression. 


Method 


Using the same procedures as in Study I, 33 
University of Pittsburgh males were obtained: and 
their perceptual defense determined. Each was seen 
twice by a psychoanalyst whose previous psychi- 
atric experience had included many evaluations of 
overtly normal college-age men. His 30-45-min. 
interviews were tape-recorded. All Ss were overtly 
Cooperative, but differed in their ability to respond 
to questions such as “describe yourself.” Immediate 
Impressions and tape recordings were used by the 
Interviewer to rank Ss on lack of cooperation during 
the interview as well as reliance on denial, avoid- 
ance, and simple repression. 


Results and Discussion 


As Table 2 shows, perceptual and psycho- 
logical (“ego”) defense proved correlated as 
Predicted. Psychological defense correlated 
45 with lack of cooperation (p < .005). 
However, lack of cooperation did not corre- 
late with perceptual defense. The statistically 
Insignificant correlation between lack of co- 
Operation and response bias may be large 
enough to warrant further investigation using 
4 perceptual defense procedure in which 
verbal response bias is encouraged. Prediction 
of correlation between psychological defense 
and the Pa — ePa difference proved to re- 
quire consideration of the direction of that 

ference. This finding emphasizes the fact 
that, although degree of emotional bias con- 
Tibutes to perceptual defense, it can not be 
equated with it, 
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TABLE 2 
CORRELATION BETWEEN PSYCHOLOGICAL 
AND PERCEPTUAL DEFENSE 
Lack of co- Psychological 
operation defense 
Perceptual defense 
Uncorrected .09 A1* 
Corrected .08 43* 
Response bias 19 10 
General accuracy —.01 —.06 
Pn-Pe, absolute .08 .03 


*p <01, 


GENERAL DISCUSSION 


No matter how convincing the. demonstra- 
tion that emotional perceptual separation is 
not dependent on contaminated clinical 
evaluations, no matter how relevant separa- 
tion may be to psychological defense, expla- 
nation of emotional perceptual separation 
would have to be sought in assumed artifacts 
if rational scientific explanations were not 
available, or at least potentially available. 

Brown (1961) has described at least six 
theoretical explanations of perceptual defense, 
each applicable to the interpretation of emo- 
tional perceptual separation. For example, 
two interpretations are based on assumed 
mechanisms of psychological defense (pp. 68- 
72). Though applicable, none of these in- 
terpretations is especially suited to data of 
the present experiment. Available interpreta- 
tions fail to distinguish clearly between emo- 
tionally biased recognition and purely verbal 
response bias. They give no explicit attention 
to the concept of psychological differentiation 
and little attention to current concepts of the 
process by which stimuli are represented. 

Progressive changes in the nonverbal rep- 
resentation of stimuli, accomplished during 
psychological differentiation, have been im- 
plied by theorists such as Hebb (1959, 1963) 
and Piaget (e.g., Flavell, 1963, p. 56). The 
authors’ research indicates that greater dif- 
ferentiation is accompanied by an increased 
tendency to form and maintain a stable rep- 
resentation of a naturally fixated stimulus. In 
this work the authors observed that, for un- 
differentiated Ss, natural fixation can some- 
times be enough to make a large, clearly visi- 
ble stimulus seem to disappear or fragment 
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for as much as 80% of a 1- or 2-min. fixation 
period. The typical correlation between a 
differentiation measure and the duration of 
stimulus disappearance is about the same as 
that between differentiation and the degree 
of emotional bias in recognition (e.g., Minard, 
1967; Minard & Batcher, 1967). It is as- 
sumed that lack of differentiation is accompa- 
nied by a lack of access to stable, well-main- 
tained stimulus representations. 

The assumed relationship between differen- 
tiation and access to stable stimulus repre- 
sentations may help explain why more dif- 
ferentiated Ss are better able to indicate the 
gravitational vertical, draw detailed human 
figures, and identify embedded figures under 
circumstances which cause less differentiated 
Ss to perform poorly. Stability of stimulus 
representations should be explored further by 
studies involving fixation of a wide variety of 
stimuli, fixation imposed by special appa- 
ratus, and by exploration of working hy- 
potheses suggested by the present results. 

The authors hypothesize that lack of sta- 
bility of stimulus representations may be one 
factor which increases the degree to which 
emotional arousal can alter the representa- 
tion of emotional stimuli. Emotional arousal 
at any time during the formation, storage, or 
retrieval of stimulus representations may alter 
them enough to affect their contribution to 
the task of recognition. In poorly differenti- 
ated Ss, marked alteration of representations 
of emotional stimuli may interfere with rec- 
ognition. In Ss of average differentiation, more 
moderate alteration of representations may 
actually facilitate recognition (e.g., by reduc- 
ing the importance of trivial differences be- 
tween presented and previously experienced 
stimuli). In well-differentiated Ss, the stabil- 
ity of stimulus representations may be so 
great that emotional arousal has no significant 
effect on them. Further refinement of this 
formulation calls for studies in which emo- 
tional arousal is experimentally manipulated 
while the stability of stimulus representation 
is continuously measured. To summarize, 
emotional perceptual separation is open to 
rational interpretation based on existing 
theory, but may prove better explained after 
theoretically oriented research on the rela- 
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tionship between differentiation and the char- 
acteristics of stimulus representations has 
been carried out. 

Emotional perceptual separation is impor- 
tant in interpreting relationships between 
differentiation and psychopathology. In these 
interpretations, the separation assumption is 
used to account for the strength and type of 
psychological defense. Witkin (1965) has re- 
viewed applications of the concept of differ- 
entiation to over 15 types of psychopathology 
and to the course and outcome of therapy. It 
would be truly unfortunate if such a range of 
applications involved an assumption based on 
contaminated clinical evaluations. In the 
present research, because of the techniques 
used, the restriction of data obtained, and 
restrictions on communication between re- 
searchers measuring different variables, con- 
tamination of evaluation can be rejected as an 
interpretation of this study. It may also be 
questioned as an interpretation of previous 
work. Though not regarded as successful by 
all critics, Witkin's group made serious ef- 
forts to prevent contamination. They may 
well have succeeded. 

Further research is needed with perceptual 
defense in samples involving a wider range of 
psychopathology. However, the present stud- 
ies of fairly healthy samples probably pro- 
vided a rather severe test of the hypothesis 
that perceptual and psychological defense 
can be related. As with measurement of dif- 
ferentiation, measurement of perceptual de- 
fense may contribute to conceptualization of 
psychopathology. For example, the view that 
blotting out of emotional experience involves 
a perceptual process is supported by correla- 
tion between perceptual defense and those 
psychological defenses which serve to elimi- 
nate emotional experience. Instead of being 
attributed to failure of recall, thought, 0T 
report, some “blotting out” might best be 
called, “failure to recognize.” Refinement of 
this conceptualization and better understand- 
ing of its implications may result from a more 
detailed study of the contribution of stimulus 
recognition to the behavior of a poorly dif- 
ferentiated S. r 

As described by researchers such as Condon 
and Ogston (1966) communication in 
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interview involves rapid synchronous presen- 
tation of more than one type of communica- 
tion. There is such rapid and frequent re- 
sponse to postural and gestural communica- 
tion that normal speakers and listeners ap- 
pear to show "precisely shared patterns of 
change." These are not evident to casual ob- 
servation, but become quite evident when 
behavior is observed with the aid of time- 
motion analysis of 48-frame-per-second film.* 
In a poorly differentiated S, perceptual de- 
fense against emotional stimuli might inter- 
fere with accurate recognition of critical as- 
pects of active communication, partly ac- 
counting for the superficial, dependent, anx- 
ious quality of his interactions with others. In 
an S of average differentiation, perceptual 
vigilance may enhance recognition and re- 
sponse to emotionally significant aspects of 
communication. This may help him to be the 
more responsive therapy patient and the more 
actively interacting therapist described by 
Witkin (1965) and by Pollack and Kiev 
(1963). In a well-differentiated S, his capac- 
ity for independence, as well as the organiza- 
tion and analysis shown by his thought, may 
be due in part to a lack of emotional bias in 
his recognition of stimuli critical to the analy- 
sis of experience, including the experience 
provided by social communication.? 

Tn conclusion, the separation assumption is 
open to rational interpretation, to objective 
measurement, to use in conceptualizing rela- 
tions between differentiation and behavior, 
and to support and application without de- 
pendence on potentially contaminated clinical 
evaluation. 


5 The description of communication is based on 
reports by Condon and Ogston (1966) and those 
they cite. It was checked by a discussion with Con- 
don and Ogston and by looking at their high-speed 
films, in which interviewer and S often look like 
two puppets manipulated by the same person. 

5 Other objective support for relationships between 
differentiation and defenses has been provided by 
Witkin et al. (1962) and is not reviewed here. The 
Supplementary relationship of the present study to 
len research has also been discussed by Witkin 

965). 
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A national sample of 181 transvestites replied to the Personality Research 
Form, an inventory which assesses 20 areas of normal personality functioning. 
A control sample of 62 Ss was also tested. In comparison with the controls 
and the test norms, the transvestites presented themselves as more controlled 
in impulse expression, more inhibited in interpersonal relations, less involved 
with other individuals, and more independent. 


Three major types of male individuals exist 
who, with some regularity, dress in clothes 
society considers appropriate to members of 
the opposite sex (Benjamin, 1966; Stoller, 
1968; Walinder, 1967); homosexuals, trans- 
sexuals, and transvestites. The transvestite is 
usually a heterosexual male who enjoys wear- 
ing feminine attire, and for whom cross- 
dressing is usually a habit of long standing. 
This report is the third of a series of papers 
dealing with the psychological characteristics 
of the male transvestite. 

Hardly anything is known scientifically 
about transvestites, as most reports deal- 
ing with these persons suffer from several 
limitations. In the first place, the word 
“transvestite” is frequently used to refer to 
members of any of the three groups described 
above. Second, practically all reports dealing 
with transvestites are single case studies. 
Large-scale studies tend to be summaries of 
cases previously published in the literature, 
rather than reports of original research of the 
investigator, based upon his own first-hand 
clinical or experimental experience with this 
population (Housden, 1965; Pauley, 1965). 
Finally, virtually all reported studies have 
dealt with psychiatrically disabled individu- 
als. The present research, however, investi- 
gated the characteristics of transvestites 
sampled from the community at large. The 
reader may be referred to the work of 
Lukianowicz (1959) for a comprehensive 


1 Requests for reprints should be sent to P. M. 
Bentler, Department of Psychology, University of 
California, Los Angeles, California. 


general summary of knowledge regarding 
transvestism. 

The authors recently published two studies 
dealing with the average personality profile 
which transvestites generated for a projective 
test (Bentler, Shearman, & Prince, 1969) and 
an objective test (Bentler & Prince, 1969). 
The first study reported the responses of 25 
transvestites to the' Holtzman Inkblot Test, 
while the second study was a report of the 
responses of 176 transvestites to a standard 
MMPI type psychopathological inventory. 
Both studies represented investigations of 
transvestites selected from the community 
without regard to their psychiatric status, and 
the results of these studies were consistent in 
noting that transvestites showed no gross 
psychiatric symptomatology. Although a pos- 
sible indication of thought disturbance was 
found in the projective study, these trends 
were not verified on the self-report inventory. 
The main result which was consistent in the 
two studies indicated that the transvestites 
were, on the average, slightly more controll 
in their impulses than a control or normative 
group. 

The previous studies have dealt quite 
specifically with the problem of psychiatric 
symptomatology. A separate but equally im- 
portant question to be answered is whether 
transvestites differ on the average from con- 
trol Ss on nonpsychiatric personality chara 
teristics. The purpose of this investigation 
to compare transvestites with control Ss on 4 
number of personality scales which have no 
particular bearing on abnormal symptomatol- 
ogy, but may well provide insight into thé 
nature of transvestic behavior patterns. 
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METHOD 


Two national samples of transvestites (TV) un- 
selected with respect to psychiatric status were ob- 
tained from the files of a magazine published for 
transvestites by the second author. The first group, 
a 1966 sample, consisted of 200 subscribers to the 
magazine selected from the first portion of the alpha- 
betical subscription list. The second group, the 1967 
sample, consisted of 125 subscribers selected from 
the second portion of that list. The Ss were sent a 
personality inventory by mail and were asked to 
cooperate anonymously in gathering scientific infor- 
mation on transvestism. 

The Personality Research Form (PRF, Jackson, 
1967), Form BB, was sent to all individuals.? This 
test is an inventory of personality scales designed to 
measure areas of normal functioning such as those 
described by Murray (1938). Twenty-two traits are 
assessed by the PRF, as indicated in Table 1. Each 
trait scale is composed of 20 true-false items, so that 
the whole inventory contains 440 items. 

The original plan of this study called for com- 
paring the responses of TV Ss directly with norma- 
tive data available for the inventory, but it became 
apparent that the average age of TV respondents 
(43 yr.) was greater by over 20 yr. than the age 
of the normative sample. Thus, a control group of 
62 participants of a University of California exten- 
sion course was obtained. These persons were 
"middle-manager" individuals sent by their companies 
to participate in sensitivity training with the goal 
of enhancing their business potential. This control 
sample, although not in any sense an ideal random 
sample of males in the community, showed approxi- 
mately the same age and education distribution as 
the transvestite sample. 


RESULTS 


The 115 TV Ss in the 1966 sample, 66 TV 
Ss from the 1967 sample, and all 62 Ss from 
the control sample returned usable data. 
Statistics were computed separately for the 
two TV samples. The means and standard 
deviations on all scales were remarkably simi- 
lar for the two TV samples, so the data were 
therefore combined for purposes of analysis. 
The left-hand column of Table 1 lists the 22 
PRF scales. The first 20 scales describe vari- 
ous aspects of personality functioning, while 
the last two scales—the Infrequency and De- 
sirability scales—are control scales designed 
to assess bias in responding. The columns 
labeled 1, 2, and 3 report the average score 
on the various scales obtained for three groups 


?The authors thank D. N. Jackson for making 
available his copyright test for distribution in the 
manner indicated prior to its publication. 


141 


of Ss. The first column presents the means 
for the 181 TV Ss; the second column reports 
the scale means obtained with normative data 
based on 1,029 Ss (norm); the third column 
represents the mean scores obtained by the 
control group (control). The differences be- 
tween TV and norm sample means on the 
various scales appear in Column 4, with a 
negative sign indicating that the norm mean 
was higher than the TV mean, while the ab- 
sence of a sign indicates the reverse. Col- 
umn 5 presents the differences between the 
TV and control sample means. Finally, the 
standard deviations for the TV and control Ss 
are presented in Columns 6 and 7. 

A discriminant function analysis comparing 
the control and TV samples showed that the 
two sets of means differed significantly. The 
F ratio (df = 22/220) was 5.45, significant 
at the .001 level, indicating the average scores 
obtained by TV Ss differ from those obtained 
by the controls. Similarly significant scale-by- 
scale differences were found in comparing TV 
Ss with the normative sample. 


Discussion 


Both TV and non-TV Ss showed acceptable 
degrees of purposeful responding, as indicated 
by the control scales. The Infrequency scale, 
which assesses implausible or pseudorandom 
rseponding shows that TV Ss respond more 
purposefully than either set of control Ss. The 
Desirability scale, which assesses the extent 
to which Ss present a favorable picture of 
themselves, indicates that TV Ss do not differ 
from controls. Thus, the data are sufficiently 
meaningful to warrant interpretation. 

Various groups of PRF scales consistently 
differentiate TV from non-TV Ss. One set of 
scales show that TV Ss are clearly more con- 
trolled in their impulses than are non-TV Ss. 
Thus, they obtained higher mean scores on 
the Harmavoidance and Order scales, and 
lower scores on the Change and Impulsivity 
scales, Harmavoidance refers to a tendency to 
avoid risks of bodily harm and that of not 
enjoying exciting activities which might in- 
volve danger. High scores on the Order scale 
indicate a concern with keeping personal ef- 
fects and surroundings neat and organized. 
On the other side, the relatively low Change 
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TABLE 1 
STATISTICAL DATA FOR PERSONALITY RESEARCH FORM SCALES 
(4) (5) (7) 
3) (6) 

Scale TU Nota | Contra M RP ies | TV SD | Conka 
1 Abasement 6.18 6.22 5.37 —.04 .81 2.83 243 
2 Achievement 12.51 12.58 14.65 —.07 ^ 3.52 3.01 
3 Afüliation 12.39 14.98 12.98 —2.59 4.46 3.93 
4 Aggression 4.83 7.93 7.10 —3.10 3.39 2.99 
5 Autonomy 10.71 8.62 9.94 2.09 3.26 3.07 
6 Change 10.81 11.74 12.39 —:93 3.08 2.45 
7 Cognitive Structure 10.69 10.90 10.66 =.21 3.41 3.09 
8 Defendence 8.26 8.76 7.69 —.50 2.58 2.54 
9 Dominance 8.81 11.07 13.15 —2.26 4.49 3.85 
10 Endurance 13.12 10.67 13.47 2.45 3,32 3.32 
11 Exhibition 8.01 10.83 10.31 —2.82 4.34 4.31 
12 Harmavoidance 12.35 7.46 9.97 4.89 3.98 ENS 
13 Impulsivity 7.84 9.78 8.34 —244 3.87 3.79 
14 Nurturance 11.60 12.68 12.69 —1.08 3.57 2.98 
15 Order 13.20 10.81 12.44 2.39 443 3.87 
16 Play 9.23 12.13 9.03 —2.90 3.66 3.02 
17 Sentience 14.44 15.24 14.90 —.80 3.03 3.28 
18 Social Recognition 8.89 11.90 10.89 —3.01 3.48 3.21 
19 Succorance 6.97 7.88 7.60 —.91 3.81 2.95 
20 Understanding 12.16 12.46 12.52 —.30 4.03 3.26 
21 Infrequency .35 .69 .66 —.34 .63 1.10 
22 Desirability 15.82 15.45 16.26 37 2.88 2.62 


Note.—TV = transvestite. 


scores indicate a relative dislike for new and 
different experiences, and a preference for 
routine. Low Impulsivity scores are indicative 
of tendencies to act primarily with delibera- 
tion rather than on the spur of the moment, 
and the tendency to withhold feelings and 
wishes rather than giving these free expres- 
sion. These results are consistent with those 
reported previously by the authors (Bentler & 
Prince, 1969; Bentler, Shearman, & Prince, 
1969) and others (e.g., Randell, 1959). 

A second set of differences occurs with 
regard to the degree and nature of interper- 
sonal orientation. Thus, TV Ss tend, on the 
average, to be relatively withdrawn from so- 
cial involvement. Thus, they have lower scores 
on the Affiliation, Exhibition, Nurturance, 
and Social Recognition scales. The relatively 
low Affiliation scores indicate that, in com- 
parison to controls, TV Ss do not as readily 
enjoy being with friends and people in gen- 
eral, that they do not tend to make efforts 
to win friendships and maintain associations 
as readily. The relatively lower scores on the 
Exhibition scale indicates tendencies to intro- 
version, so that TV Ss would prefer not to 
be the center of attention and do not report 


themselves as enjoying behavior such as 
witticism or dramatics which would require 
an audience. The Nurturance scale indicates 
that TV Ss tend to be less interested in 
readily performing favors for others, giving 
assistance, sympathy, and comfort to others 
who may need it. Finally, the lowered Social 
Recognition scores indicate that TV Ss report 
themselves to be less approval seeking in the 
sense of being concerned about what other 
people might think of them; they are not 
quite as concerned as non-TV Ss in desir- 
ing to be held in high esteem by acquain- 
tances, nor in working for the approval and 
recognition of others. These data are Con- 
sistent with those of Morgenstern, Pearce, 
and Rees (1965), who noted that their 1? 
TV Ss were, on the average, one standar 
deviation below the normative mean on the 
Maudsley Personality Inventory Extraversio 
scale. 

Similarly, in the area of interpersonal rela- 
tions, but more particularly dealing with the 
degree of ascendency demonstrated by Ss, +, 
Ss report themselves as relatively lower ™ 
Dominance and Aggression. Thus, they at 
tempt to influence and direct other people 
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express opinions, and enjoy a leadership role 
relatively less than control Ss. Similarly, the 
relatively lowered Aggression scores indicate 
that TV are not as easily annoyed as non-TV 
Ss, do not particularly enjoy combat and 
argument, and are not willing to hurt other 
people in order to get even with those who 
they perceive as having harmed them. 

Finally, in regard to the willingness to 
follow directions from other people, TV Ss 
tend to be relatively more self-reliant and 
rebellious than others. They report themselves 
as relatively higher in Autonomy and lower 
in Succorance. This means, according to 
Jackson, that TV Ss are relatively more inter- 
ested in being free, not tied to people or 
obligations, and that they enjoy breaking 
away from restraints or restrictions. The rela- 
tively lowered Succorance scores indicate that, 
in comparison to non-TV, TV Ss on the aver- 
age do not as readily seek the sympathy, 
protection, love, advice, or reassurance of 
other people. They report themselves as not 
feeling insecure when without support by 
others, and tend not readily to confide their 
difficulties to another receptive person. 

The interested reader can consult the 
Personality Research Form Manual (Jackson, 
1967) if he wishes a more extensive descrip- 
tion of the various scales for further clarifica- 
tion of the remarkable and consistent differ- 
ences observed between TV and non-TV Ss. 
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This paper is a reply to criticisms made by Hirt and Kurtz of the Fisher- 
Cleveland exterior-interior body scheme model. It was shown that these criti- 
cisms are contradicted by the pertinent literature. Questions were also raised 
concerning the rationale of the Hirt and Kurtz study, the characteristics of the 
experimental populations, and the appropriateness of some of the statistical 


procedures, 


Hirt and Kurtz (1969) make a number of 
theoretical criticisms of the Fisher-Cleveland 
(1969) work and also purport to challenge 
the general validity of the exterior-interior 
model on the basis of a single empirical study. 
The present authors suggest that many of 
their statements represent a puzzling lack of 
acquaintance with the pertinent literature. 
Further, possible methodological difficulties 
with their one empirical study are pointed out. 

By way of general criticism, they comment 
that the reliability and validity of the 
Fisher-Cleveland boundary scoring system 
have never been established; that the inde- 
pendence of the Barrier score is in doubt 
because it is positively correlated with the 
Rorschach F+ index; that the relations of 
the boundary scores to measures of psycho- 
pathology have been inadequately explored; 
and further that their correlations with chro- 
nicity and degree of disability associated with 
illness are not known. However, the existing 
literature plainly contradicts these assertions. 

Thus, with regard to the reliability of the 
scoring system, there are now many studies 
which have demonstrated adequate scorer and 
test-retest reliability for the boundary scores. 
This information is available to any inter- 
ested person in the reports of McConnell 
and Daston (1961), Williams and Krasnoff 
(1964), Fisher (1963), Fisher and Cleveland 
(1968), Ramer (1963), Landau (1960), and 
others. 

1Requests for reprints should be sent to Seymour 


Fisher, Department of Psychiatry, Upstate Medical 
Center, 766 Irving Avenue, Syracuse 10, New Vork. 


The statement by Hirt and Kurtz that the 
validity of the boundary scores has not been 
shown is easily challengeable. Apparently, 
they are not aware that there have been 
about 125 studies dealing with this question. 
As Fisher and Cleveland (1968) and also 
Fisher (1963) have shown, the overwhelming 
majority have supported the construct valid- 
ity of the boundary scores. Apropos of this 
point, note that in a recent review of the 
literature concerned with the psychosomatic 
implications of the exterior-interior model, 
Buss (1966) remarked upon its high predictive 
success, “This is an excellent record for any 
theory in the area of psychosomatic disorders 
[p. 421].” 

The Hirt and Kurtz belief that Barrier is 
“clearly related to" F+% is also erroneous. If 
one turns to actual results, one observes that 
Compton (1964) was not able to find such a 
relationship. Further, one of the present 
writers has observed largely chance correla- 
tions between Barrier and F+% in numerous 
samples of Ss (Fisher, in press). 

Hirt and Kurtz state that the relationships 
of the boundary scores to degree of disability 
and chronicity of disability are not known; 
but this represents again a lack of knowledge 
of the literature. Multiple studies have shown 
that the correlations in question are of 4 
chance order (e.g. Allardice & Dole, 1966; 
Fisher & Cleveland, 1968; Landau, 1960; 
Sieracki, 1963; Ware, Fisher, & Cleveland, 
1957). ; 

Another point made by Hirt and Kurtz 15 
that inadequate attention has been directed 


144 


REJOINDER TO HIRT AND Kurtz 


to ascertaining the correlation between the 
boundary scores and ego strength. In actual- 
ity, much energy has been devoted to this 
problem. It is particularly pertinent that 
neither Holtzman, Thorpe, Swartz, and 
Herron (1961) nor Fisher and Cleveland 
(1968) have been able to find consistent cor- 
relations between the boundary scores and 
paper and pencil tests of ego strength, such 
as the MMPI. The correlation between the 
boundary scores and an MMPI index of psy- 
chopathology which Hirt and Kurtz observed 
(but only in a small subsample) stands as 
an isolated observation. Further, the MMPI 
index they employed has its own serious 
validity problems, Goldberg (1965), who de- 
vised the index, stated in a study in which 
he examined its utility, “The validity of even 
the best of the signs reported in the present 
study may provide no reassuring balm for 
MMPI users [p. 24].” But more importantly, 
if Hirt and Kurtz had examined papers by 
Fisher (1964a, 1966) dealing with the bound- 
ary and psychopathology, they would have 
discovered that much thought has been given 
to the problem and that currently it is not 
feasible to assert that the state of the bound- 
ary and degree of psychopathology are con- 
sistently linked. 

Critical questions can be raised concerning 
the statistical procedures used by Hirt and 
Kurtz. The chief defect relates to the fact 
that the assumption of homogeneity of co- 
variance was substantially violated with ref- 
erence to the Penetration comparison they 
reported. The fact that there was a negative 
correlation (—.626) between Penetration and 
the MMPI index of pathology in the interior 
group and a positive one (.315) in the exterior 
group (a difference of considerable magni- 
tude) makes use of covariance in this case 
invalid. Further, one must challenge the ap- 
propriateness of an analysis of variance de- 
sign which grossly combines the two exterior 
groups despite the fact that their means 
obviously indicate that they differ (and prob- 
ably significantly) from each other for both 
Barrier and Penetration. Also, reported mean 
values indicate there might even be significant 
differences for Barrier and Penetration be- 
tween the dermatitis group and each of the 
interior groups. 
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The prime virtue that Hirt and Kurtz 
ascribe to their study is that it is the first 
to compare more than one type of exterior 
and interior symptomatology group simultane- 
ously. This is untrue. The original Fisher and 
Cleveland (1955) monograph demonstrating 
boundary differences between exterior and 
interior symptom groups included patients 
with rheumatoid arthritis, dermatitis, and 
muscle difficulties in the exterior sample and 
patients with stomach ulcers and ulcerative 
colitis in the interior sample. The numbers of 
different categories of patients were greater 
than those presented by Hirt and Kurtz in 
their study. This fact seems to be unknown 
to them. It is also striking that nowhere in 
their paper do they mention the large number 
of studies which have specifically verified the 
Fisher-Cleveland symptomatological model. 
These studies cannot be dismissed simply be- 
cause they usually involve comparing two 
symptom groups at a time. Logically, a test 
of the exterior-interior model may be said to 
occur whenever an exterior and an interior 
symptom group are compared; and the fact 
is that these studies have involved a greater 
diversity of patients than represented in the 
Hirt and Kurtz samples. 

Hirt and Kurtz have also failed to mention 
in their paper that a large amount of litera- 
ture (e.g, Williams & Krasnoff, 1964) has 
accumulated which indicates that the Fisher- 
Cleveland exterior-interior model successfully 
predicts differential physiological reactivity of 
exterior and interior body sites in both normal 
Ss and patients with *psychosomatic" symp- 
toms. These studies are of crucial importance 
because they support the model’s basic state- 
ments about outside versus inside physiologi- 
cal reactivity in a more controlled and spe- 
cific fashion than can studies based on groups 
with gross symptomatological differences. 

In short, the general criticisms made of the 
Fisher-Cleveland work by Hirt and Kurtz are 
contradicted by the real facts. One must ques- 
tion why they have chosen not to mention the 
succession of studies that do buttress the 
validity of the exterior-interior model. They 
have presented their own work as if it existed 
in a vacuum. 

It is ironical that while they criticize the 
previous Fisher-Cleveland work for not ade- 
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quately defining the characteristics of Ss used, 
they present no information at all about 
their Ss, except to indicate their gross diag- 
nostic labels. This could be a serious defect 
because it is known that both sex (Fisher, 
1964b) and achievement motivation (e.g., as 
Tepresented by socioeconomic status, Landau, 
1960) affect Barrier level. How do we know 
that the inability of Hirt and Kurtz to find 
boundary differences between the symptomat- 
ological groups did not reflect the fact that 
these groups varied significantly in sex or 
socioeconomic composition? Even more con- 
fusing is the definition offered of the exterior, 
interior, and mixed Ss who were described by 
Hirt and Kurtz as involved in Experiment II. 
We are told that these Ss were assigned to the 
groups by a "physician experienced in psychi- 
atry and internal medicine.” What do we 
know about the reliability of this physician's 
judgments? Also, they remain relative mean- 
ingless unless we are given details of the 
Specific symptoms that he included in each 
group. 

If one were to assume there were no seri- 
ous errors in the way Hirt and Krutz assigned 
Ss to the diagnostic groups, what would their 
findings mean? Since the exterior sympto- 
matological groups (rheumatoid arthritis and 
dermatitis) employed have already shown 
themselves in past studies to conform well to 
the exterior-interior model and since the two 
interior groups (heart pathology and asthma) 
have not previously shown such conformance, 
it would be logical to conclude that these 
latter groups do not “fit” the exterior-interior 
Schema. But it should be noted that of all of 
the interior symptom groups studied these are 
the only. ones that would apparently not 
conform to expectation. 

This could represent a limit on the general- 
ity of the exterior-interior model. But one 
might wonder why coronary pathology and 
asthma should prove to be such exceptions. 
With respect to coronary pathology, there are 
multiple factors (e.g., smoking, diet, exercise) 
which are suspected of playing a direct con- 
tributory role. If smoking or diet do play such 
roles, it would obviously not be fair to expect 
the exterior-interior model to apply. It would 
be just as reasonable to expect the exterior- 

interior concept to apply in any other situa- 
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tion where an individual is the target of an 
environmental pathogenic or traumatic force, 
If a man swallows a poisonous substance 
and it damages his stomach he could not 
meaningfully be said to have developed an 
"interior" symptom. One must ask whether 
there is any reasonable limit on the range of 
body symptoms to which the exterior-interior 
concept can be applied. If it can be applied 
to coronary pathology, why not to liver 
damage or kidney pathology? Perhaps it is 
a defect of the exterior-interior model that 
it does not define the limits of its applicability 
in this sense, One can only say that this is 
not merely the fault of the model. A large 
share of the difficulty arises from the fact 
that so little is known about the etiology of 
many syndromes like stomach ulcers, colitis, 
and arthritis. However, it does not seem un- 
reasonable to exclude the application of the 
exterior-interior model to syndromes in which 
there is evidence that toxins, infections, and 
trauma arising from “outside” forces play a 
direct causative role. Coronary defects" may 
fall into this inappropriate category. 

An explanation of the negative findings 
with regard to the asthmatic group is not 
apparent. Perhaps they illustrate a failing in 
the exterior-interior model. Perhaps they are 
a function of the fact that asthma may have 
a strong hereditary component, as asserted by 
Leigh, Marley, and Braithwaite (1967) and 
Schwartz (1952). Perhaps they are related to 
the fact that many asthmatics are on 4 
regimen of sympathomimetic drugs which are 
known to produce neuromuscular sensations 
which may, as shown by Fisher and Renik 
(1966), produce a reinforcement of boundary 
sensations and therefore artificially increase 
the Barrier score. This remains to be SER 
Incidentally, if Hirt and Kurtz were intet- 
ested in reassessing previous results dealmg 
with the exterior-interior model, it seems il- 
logical that they should have chosen to repre 
sent the interior category with two symptom 
syndromes that Fisher and Cleveland havé 
never labeled as appropriate for this purpose: 

The attack of Hirt and Kurtz upon the 
Fisher-Cleveland model involves essentially 
two different components. There is, first 0 
all, a series of broad denunciatory statements 
which, as shown, are untenable in the face 9 
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even a moderate knowledge of the pertinent 
literature. Second, there is a single empirical 
study that omits important details concerning 
Ss employed; uses inappropriate statistical 
procedures; and embraces one symptomat- 
ological group (viz., coronary defects) that 
may be quite inappropriate for testing the 
exterior-interior model. Finally, even if this 
empirical study were to turn out to be sound, 
it represents one of a few minority negative 
findings among many more positive findings 
by other investigators. 
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Fifteen schizophrenics were compared with 15 controls of similar mental test 
status and educational background on two classes of elementary problems 
from an extensive series of artifact testing tasks monitored by an automatic 
problem programming machine called the Heuristic Evaluation Problem Pro- 
grammer. In terms of an index that reflects primarily the ability to conduct 
the required inquiry efficiently, the groups did not differ significantly on sta- 
tistical tests that employed between-Ss differences as the error variance. Re- 
peated measures of group differences did, however, show a small, but statis- 
tically significant, superiority for the control Ss, Intercorrelations between 
scores on various problem sets indicated that the tests were reliable. During 
the first three-quarters of the problem series, there was a marked inefficiency 
of performance in a small group of paranoid schizophrenics as compared with 
a group of chronic undifferentiated schizophrenics. 


The schizophrenic’s strange behavior, his 
indefensible beliefs, and the bizarre non se- 
quiturs by which he attempts to rationalize 
them have created and supported a firm con- 
viction that there is something wrong with 
the way schizophrenics think. It has not been 
easy, however, to decide what this deficit is. 
The logical oddities that abound on the ward, 
especially those associated with the fourth 
syllogistic figure, apparently caused von 
Domarus (1946) to assert that schizophrenic 
thinking is guided by a “paralogic.” They 
also led Arieti (1955, 1959), in extending 
this thesis, to describe them as “paleologic,” 
in order to make explicit reference to the 
phylogenetic regression that he felt to be 
characteristic of the schizophrenic’s aberrant 
thought processes. This clinical emphasis on 
logic and syllogisms constituted a definite 
theoretical proposal that could not be left 
unchallenged. Several investigators (Gottes- 
man & Chapman, 1960; Nims, 1959; Wil- 
liams, 1964) have compared the performances 
of normals and schizophrenics on multiple- 
choice tests of the ability to differentiate be- 
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tween valid and invalid syllogisms, but the 
anticipated deficit did not appear. Schizo- 
phrenics recognized logical fallacies at least 
as well as their controls did, and, conse- 
quently, no evidence was generated to sup- 
port the notion that their abberant way of 
thinking might be a result of a rejection of, 
or regression from Aristotelean principles. 
Does this outcome suffice to discredit the 
hypothesis that there is something wrong 
with the logical processes of schizophrenics? 
The tests in question, though favorites of 
some logicians, fail to reflect the important 
fact that syllogisms are useful only when 
viewed as schemata designed to assist in 
evaluating deductions that are intermediate 
products in the daily struggle to reconstruct 
an adequate image of the world. It may have 
been precisely this functional use of logic 
that von Domarus and Arieti meant to im- 
pugn in their patients. Concern with this 
interaction between the mind and reality 1 
prominent in several disciplines that deal 
with the behavior, errant or typical, of hu- 
man beings. The clinician stresses its im- 
portance by the diagnostic value he places 
on “reality testing” (Weiner, 1966). Millet, 
Galanter, and Pribram (1960) emphasize its 
significance for the theoretical psychologist 
by the determining influence they assign Ww 
the image of reality in goal directed thinking 
The philosopher has certainly endowed this 
problem with more devoted contemplation 
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than any other professional intellectual. John 
Dewey (1938) essentially equates the ability 
to think effectively with the ability to con- 
duct competent inquiry into existing states 
of affairs. This includes the ability to extract 
valid information from transactions with 
well-behaved segments of reality as well as 
the ability to organize and direct an inquiry 
toward a selected goal. It is the function of 
logic to supply conceptual paraphernalia pro- 
paedeutic to competent inquiry, but here, 
presumably, its responsibility ends. How the 
man on the street uses the products of their 
effort is not considered a proper topic of 
concern for logicians. Nonetheless, it is in 
this task of application that schizophrenics 
and the authors fear, many others may fre- 
quently fail, in spite of their ability to recog- 
nize invalid syllogisms in multiple-choice 
tests. Some empirical discipline, perhaps psy- 
chology, must undertake to study the human 
use of the principles of inquiry as they are 
applied in attempts to control and compre- 
hend the puzzling events that characterize 
the real world. 

Indeed, initial efforts have already been 
made in this direction (Blatt, 1961; John, 
1957; Rimoldi & Devane, 1961), but the ex- 
perimental tasks that have been developed 
are rather complicated for some large and 
important segments of the population (Jer- 
ome, 1962; Young, 1966). In the interest of 
extending the perspective of research on re- 
ality testing, it is necessary to develop a 
variety of problem situations, especially at 
the lower end of the difficulty scale if one 
wishes to secure information about the heu- 
ristic behavior of people less gifted than col- 
lege graduates. The Heuristic Evaluation 
Problem Programmer (HEPP), the test de- 
vice employed in the present study, was de- 
signed with just this purpose in view; it is 
a well-behaved segment of reality, an arti- 
fact to be sure, that S investigates under the 
constraints of a particular procedure. A priori 
consideration suggests that the problems it 
Monitors have three important properties: 
(a) they constitute a family in the sense of 
requiring for their solution the application of 
a variety of closely related logical forms; 
(5) they are arranged in an approximate 
Order of difficulty based on objective mea- 
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sures of complexity (see following section); 
and (c) they represent a range of difficulty 
that is quite large and includes at least some 
classes accessible to the heuristically ungifted. 

It was the purpose of the present study to 
determine whether the procedure described in 
the following section is capable of eliciting 
from schizophrenics performances that are 
about as good as those of control Ss of com- 
parable mental test status and educational 
background on the elementary problems of the 
HEPP series. It is especially important to 
succeed in this connection, because the more 
difficult problem classes are developed from 
this level by gradual increments of concep- 
tual complexity without introducing proced- 
ural changes or new materials. Success on 
the first two classes of problem, therefore, 
provides assurance that S understands the 
basic requirements of the situation and is 
capable of operating adequately in the test 
environment. 


METHOD 
Subjects 


Fifteen experimental Ss with the following sub- 
classification of schizophrenia were included in the 
study: chronic undifferentiated, nine; chronic para- 
noid, four; schizo-affective, one; schizoid personal- 
ity, one. The duration of illness from the time of 
first hospital admission ranged from 1.5 to 11 yr, 
with a mean of 4.6. According to parental reports, 
corroborated by hospital records, at least seven of 
these patients had serious personality and social dif- 
ficulties necessitating placement in special schools or 
outpatient psychotherapy from the age of 7 on. 
Fifteen normal controls were selected from a group 
of National Institutes of Health volunteers who 
showed no pathology under extensive psychiatric 
examination, In making this selection, it was neces- 
sary to sacrifice age and sex matching in order to 
achieve approximate equality of mental test scores 
and duration of formal education. Indexes referring 
to these extraexperimental variables are shown in 
Table 1 for the two groups. 


Apparatus 


The technical details of the apparatus have been 
described at length elsewhere (Jerome, Young, & 
Umberger, 1964), but the problem situation can be 
explained in terms of S's operation panel shown in 
Figure 1. Arranged in a 4X 4 matrix are 16 letter- 
labeled indicator switches which constitute S's in- 
puts to the system. When pressed, they are illumi- 
nated and close signal circuits used by the control 
unit. On any given trial S may operate any number 
of these buttons, including all of them or none of 
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TABLE 1 
WAIS Scores, EDUCATION, AND AGE OF SUBJECTS 


Schizophrenics Controls 
Item 
M SD M SD 
WAIS 
Verbal 108.2 143 108.5 7.5 
Performance| 100.6 17.6 102.9 8.7 
Full Scale 105.3 15.4 106.4 7.2 
Education 12.1 3.0 11.8 2.8 
Age 24.5 6.3 35.3 15.3 


them. The button labeled “Tell-Me” (TM), just 
below the input matrix, is the only system output 
accessible to S. It is an information signal, which, 
when pressed, is illuminated by a red or a green 
light depending upon the relation between the states 
(on-off) of the inputs and the problem constraints 
effective at that time. A trial consists of a selection 
of inputs terminated by a pressing of the TM but- 
ton. The panel, with S's selection illuminated and 
the information color displayed, remains lit for 3 
sec, after TM is pressed. During the display period, 


Fic. 1, S's panel. 
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codes representing the selection and the color of 
TM are recorded on a punched tape for subsequent 
input into a computer program. The “cancel” button 
is available at all times, and the numeric readout 
(top center) provides a running count of the trial 
number. The three dark buttons (to the left and 
right of TM, and upper right-hand corner) were 
not used in this experiment. The S had ample op- 
portunity to become familiar with these operating 
characteristics during the detailed instructions for 
the first problem class. 


Instructions 


All instructions were given on an cight-light field, 
the two top rows being covered by an opaque 
screen, Immediately after the instructions on a given 
class were completed, S solved five problems on this 
same field. The E made a maximum effort to con- 
vey to S all information relevant to each problem 
class as it was introduced. There were certain op- 
erating options and constraints common to both 
classes. In any problem there was only one input 
that had anything to do with the color of TM. 
Before this input had been identifed in a given 
problem, it was referred to as “HEPP” merely as 
an expository strategem to avoid the awkward repe- 
tition of “the one and only light that has anything 
to do with the color of Tell-Me." The S was always 
free to turn on any number of lights. The procedure 
was completely S paced; S was encouraged to take 
his time to plan. It was always S's task to find 
HEPP with as few TM presses as possible. Within 
any problem class, a change of problem consisted 
in changing the particular input that influenced the 
color of TM. The S was elaborately informed about 
every change of problem. The instructions employed 
to convey this information were highly redundant 
and consisted of a sequence of short, simple state- 
ments, each demonstrated, when possible, by panel 
operations executed by S at the direction of E. The 
S was encouraged to ask questions, and E repeated 
instructions and demonstrations until he was con- 
vinced that the problem was understood as clearly 
as possible under the circumstances that prevailed. 
If necessary, explanations were continued during 55 
first solution efforts. 

With this background the two problem classes 
used are easily defined: In HEPP I problems there 
was one input light that had to be on in order for 
TM to turn green when it was pressed, otherwise 
TM turned red. In HEPP II problems there was 
one input light that had to be off in order for 
to turn green when it was pressed, otherwise 
turned red. There were two reasons for using bó! 
types of problem in spite of their similarity: there 
is some evidence that schizophrenics have specia 
difficulty in controlling set (Shakow, 1963); the in- 
verted relation between HEPP states (on-off) and 
the information signal (red-green) occurs frequently 
in the more complex problems of the series, and 1 
was, therefore, important to obtain an indication P 
the effect of the inversion at this relatively simple 
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Jevel. Transfer from one class to the other would be 
expected to be positive and complete, except for dis- 
turbance associated with the reversal of set. In 
this study, however, it will not be possible to obtain 
a satisfactory measure of this effect, because the 
various problem categories were given to all Ss 
in the same sequence. 


Test Sequence 


Using roman numerals to indicate problem class 
and arabic numerals to indicate the number of 
lights in the display field, problems were presented 
in the following order: I-8, I-16, II-8, II-16. Each 
S was given five problems in each of these cate- 
gories. Except to inform S of the change, there was 
no interruption between problems of a given class. 
The instructions for the Class II problems were 
given immediately after completion of the Class I 
quota, and testing was resumed. In most cases, all 
problems were completed in one session that varied 
from 30 to 50 min. depending upon S's rate of work. 


RESULTS 


It will be recalled that the instructions re- 
quired S to locate the HEPP with as few 
inquiries (TM presses) as possible. He was 
free to adopt whatever strategy appeared to 
him to be appropriate to this end. The op- 
erating characteristics of his plan can be 
displayed quite simply by plotting the num- 
ber of possibilities that remained after each 
trial or selection. Data presented in this man- 
ner in Figure 2 were selected to illustrate 
some fundamental points. The curve for S 
756 shows the operating characteristic of the 
optimal plan. On each trial S tested half of 
the remaining possibilities and, thus, with 
this strategy, would always solve the 16 light 
problems in exactly four trials. The S 712 
tried lights in sets of various sizes until one 
that contained HEPP was encountered and, 
then, he proceeded with an item-by-item test 
on the members of that set until HEPP was 
identified. This procedure, does not show as 
much insight as the optimal strategy, but it 
1s superior to the item-by-item test illus- 
trated by S 627, who was lucky enough to 
encounter HEPP on his fifth trial. Through 
the operation of a random success factor, this 
and the previous procedure can occasionally 
have quite happy outcomes, success in one 
and two trials, respectively, but they also can 
be especially hazardous, if not applied in an 
orderly way. The dangers of disorderly search 
are illustrated in the data from S 773. He 
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used a mixture of the last two plans with no 
apparent pattern in his selections and made 
several redundant choices (Trials 5, 8, 9, and 
10) that suggest serious disorientation, 

These possibility functions provide pictures 
that are useful for reviewing the perform- 
ances of individual patients, but they are not 
appropriate for group comparisons. Some 
kind of data reduction, with its attendant 
loss of detail, is necessary in the interest of 
comprehensibility. Since the amount of work 
done by each S was fixed, the groups might 
be compared with respect to the average 
number of trials required for solution. Asso- 
ciated with this measure, however, is the 
random success factor mentioned in connec- 
tion with S 627's fifth trial. This effect would 
greatly increase the error variance and render 
tests of significance insensitive. Moreover, 
such a score would not reflect the strategy 
employed with a satisfactory degree of ac- 
curacy. The index described below, though 
somewhat complicated, does overcome these 
difficulties. 


Expected Information Value (EIV) 


The HEPP problems were designed to per- 
mit an evaluation of each selection made by 
S in the course of his solution effort. In fact, 
there are two such measures for each trial: 
(a) the amount of information actually 
elicited by the selection and (5) the amount 
of information that it might rationally be ex- 
pected to elicit. The former is subject to the 
random success factor, but the latter is not 
affected by the outcome at all. Expected 
information value (EIV) depends only upon 
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the selection in relation to the uncertainty 
structure in which it is made. Thus, at the 
beginning of any trial, there are N equally 
probable solutions, and the existing uncer- 
tainty is loge N. The display chosen by S 
for testing partitions the possibility set into 
two subsets, one consisting of N1 possibil- 
ities that remain if the display is classified 
in one way, say green, the other consisting 
of N — N, = Ns possibilities that remain if 
it is classified in the other way, say red. The 
corresponding uncertainties are logo N, and 
logo No, representing information gains of 
log; N — loge N, and logs N — log» No, re- 
spectively. The EIV of a selection is the av- 
erage of these two possible gains weighted 
for their separate probabilities of occurrence. 
Since the original JV possibilities are assumed 
equiprobable, the probability of occurrence 
of either gain depends only upon the number 
of possibilities included in the associated sub- 
set; that is, the probability of a gain of the 
first magnitude is N,/N and that of the sec- 
ond is N»/N. The expected information value 
of the selection, thus becomes EIV = V,/N 
(logo N — logo N1) + No/N (logo N — logs 
Nz) which, for convenient reference to exist- 
ing tables, (e.g., Attneave, 1959) can be re- 
written EIV = 1/N (N logs N — N; logs N, 
— Ns log; N2). The EIV ranges from 0 (no 
information obtained) to 1 (half of the exist- 
ing possibilities eliminated). Throughout the 
remainder of this report, indexes referred to 
as "problem scores" will be averages obtained 
by weighting each trial EIV by the number 
of possibilities existing just prior to the se- 
lection to which it refers. In the tables, these 
indexes have been multiplied by 100 to elimi- 
nate decimal points. 


Relative Efficiency 


Figure 3 summarizes the general outcome 
of the experiment and shows the average 
problem score of each group. for the various 
problem classes and sizes. It rather strongly 
suggests that the performances of the schizo- 
phrenics were inferior to those of the controls. 
For each of the four problem categories sepa- 
rately, a two-way analysis of variance with 
repeated measures was performed. The only 
reliable effect was that due to problem order 
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in the I-8 and I-16 categories (Ż < 05). 
Similarly, ¢ tests on the difference between 
means for individual problems failed to re- 
veal any in which the groups differed sig- 
nificantly. In both of these tests, the error 
variance was determined by the between-Ss 
variability confounded with other random 
factors. Since schizophrenics are known to ex- 
hibit abnormally large individual differences 
in a variety of performances (Shakow, 1963), 
such tests are likely to be insensitive and to 
fail to detect real differences between groups. 
However, it will be noticed that the controls 
had the better EIV on 15 of the 20 problems, 
Each pair of problem means provides an esti- 
mate of the relative proficiency of the two 
groups. When the distribution of differences 
generated by these paired comparisons was 
used to estimate an error variance, the un- 
impressive mean difference of 1.7 in favor of 
the controls was found to be statistically sig- 
nificant (p < .02).* 

The general upward trend from early to 
late problems, seen in several segments of 
Figure 3, indicates that both group perform- 
ances improved with experience. The reliabil- 
ity of the observed gains from the first to 
the fifth problem within each category was 
estimated by determining whether the mean 
of the 15 within-Ss differences was signifi- 
cantly greater than zero. On this criterion, 
the controls were found to have made reliable 
gains (p < .03) in all problem categories €x- 
cept II-16 and the schizophrenics improved 
reliably in I-8 (p < .01) and IT-16 (p < 02) 
only, despite large apparent gains in both 
I-16 and II-8. 

Changes in problem category were ass0- 
ciated with two types of cognitive demand. 
Those from I-8 to I-16 and from II-8 to II- 
16 involved increments in uncertainty; that 
from I-16 to II-8 involved a reversal of the 
information cue and a decrease in uncer- 
tainty. As a result of the progressive com 
plexity design of the study, these two latter 
effects, which would be expected to influence 
performance in opposite directions, can not 
be separated or evaluated independently. 
However, since both groups performed at 


3 One-tailed p values have been used consistently 
for the results of £ tests. 
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Fic. 3. Average EIVs for schizophrenic and control 
groups on successive problems. 


least as well on the first problem of II-8 as 
they had on any previous problems, there was 
no direct evidence of disturbance associated 
with the reversal of set. It may be noted, 
however, that the schizophrenics showed a 
large gain, though nonsignificant (p < .16), 
from the last problem of I-16 to the first of 
II-8, whereas the controls had a small, but 
unreliable, loss at this point. Thus, if the 
schizophrenics were disturbed by the rever- 
sal, as their comments frequently suggested 
they were, the reduction in problem size en- 
abled them to more than compensate for 
losses associated with the inversion. If the 
controls derived any benefit from the reduc- 
tion in uncertainty, it was completely masked 
by the effect of information reversal. That 
changes in uncertainty had considerable in- 
fluence was shown by (a) a significant loss 
(p < .04) in both groups associated with 
the shift from I-8 to I-16, an increment in 
uncertainty; (5) the schizophrenics’ gain at 
the switch from I-16 to II-8, when uncer- 
tainty was decreased; and (c) the schizo- 
phrenics’ significant loss at the shift from 
IL-8 to II-16 (p < .03), when complexity was 
increased again. Since the performances of 
the controls were not significantly affected at 
these latter transition points, the evidence 
Suggests that the schizophrenics were more 
sensitive to uncertainty than the controls. 


Subtypes of Schizophrenia 


Average HEPP scores according to prob- 
lem category and type of schizophrenic pa- 
tient included in the experimental group are 
shown in Table 2, which exhibits a marked 
Contrast between the performances of para- 
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TABLE 2 
AVERAGE EIVs BY SUBCLASSIFICATIONS 
OF SCHIZOPHRENIA 
Subclassifications N | L8 | 1-16 | 11-8 | 11-16 
Chronic undifferentiated | 9 | 89 | 84 | 91 | 87 
Chronic paranoid 4 | 69 | 58 | 75 | 85 
Schizo-affective 1 | 57 | 55 | 66 | 48 
Schizoid personality 1 | 83 | 67 | 95 | 88 


noid and chronic undifferentiated schizo- 
phrenics. A more detailed comparison appears 
in Figure 4, which, except for the separation 
of schizophrenic groups, is analogous to Fig- 
ure 3. Again using differences between pairs 
of problem means to estimate the error vari- 
ances, it was found that the difference in 
favor of the undifferentiated schizophrenics 
over the controls was significant at the .01 
level. The differences between chronic undif- 
ferentiated and paranoid schizophrenics, and 
between controls and paranoids were signifi- 
cant at the same level. 

Figure 4 also provides an explanation for 
the failure of the method of gains to detect 
a reliable improvement in I-16 and II-8 for 
the schizophrenics despite a rather large ap- 
parent gain from the first to the fifth problem 
in these categories. It will be noticed that, 
whereas the four paranoids made large gains 
on the average, the undifferentiated schizo- 
phrenics apparently had achieved a rather 
high plateau, and consequently made little or 
no average gain. This biomodality, of course, 
made for a large error variance and an asso- 
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ciated lack of sensitivity. This, in turn, was 
a result of differences in practice effects 
among the two subtypes of patients. 


Correlations 


The HEPP index employed in computing 
the rank-order correlation coefficients pre- 
sented in the following tables was obtained 
by averaging the five problem scores of each 
S within the relevant problem categories. 
Table 3 gives the correlations between per- 
formances on the various types of problems 
with the two S groups combined. All corre- 
lations are positive and significant at less 
than the .01 level. Their magnitudes indicate 
a high interclass reliability of HEPP per- 
formances. Moreover, since the observed 
trend in amount of correlation is in the di- 
rection of a priori similarity, it is suggested 
that the within-class reliability is probably 
even higher. These results also indicate that 
the considerable between-category improve- 
ment of performance with experience, ob- 
servable in Figure 3, was not accompanied 
by a marked rearrangement of the relative 
proficiencies of individual Ss, at least when 
the combined groups are considered. 

Comparable indexes are given for the two 
groups separately in Table 4. It will be ob- 
served that the schizophrenics contributed 
more than the controls to the overall corre- 
lations; for every pair of correlates, their 
coefficient was higher than that of the con- 
trols. Moreover, for the schizophrenics, all 
correlations were significant whereas, for the 
controls, only intraclass correlations were re- 
liable. The difference in correlation pattern 
between the two groups indicates that the 
relative proficiencies of individuals was more 
stable among the schizophrenics than it was 
among the controls. This, in turn, may be 


TABLE 3 


CoRRELATIONS BETWEEN HEPP PERFORMANCES, 
Groups COMBINED 


HEPP 116 I8 11-16 
1-8 .13* .50 E 
I16 54 .67 
Ir-8 .69 


*p < 01. 
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TABLE 4 
CORRELATIONS BETWEEN HEPP PERFORMANCES 


Schizophrenics Controls 
HEPP 
116 | ILS | IL16 | 1-16 | ILS | 1116 
L8 |.82**| .58* f*- | .635** | .28 | 739 
- Je 42 |.62* 
Jt .66** 
* p «.05 
5$ $0 


attributable to the marked separation be- 
tween paranoids and undifferentiated schizo- 
phrenics noted in connection with Figure 4. 

The correlations between HEPP perform- 
ances and the extraexperimental variables re- 
flected by WAIS performance, education, and 
age, are displayed in Table 5 for the com- 
bined group. HEPP II performances clearly 
correlated more frequently with these varia- 
bles than did HEPP I performances. This 
suggests that some experience with HEPP- 
like problems was required before the cog- 
nitive processes associated with the ex- 
traexperimental variables manifested their 
influence significantly. Age was the only 
variable listed that did not correlate signifi- 
cantly with any HEPP performance in this 
combined group. 

In Table 6, comparable indexes for the 
two groups are listed separately. There were 
no significant correlations with HEPP I 
scores among the’ controls, but the schizo- 
phrenics did have a reliable correlation be- 
tween verbal intelligence and HEPP 1-16. 
However, these variables did not manifest 
their general relevance to the schizophrenics 
performance until the HEPP II-16 problems 
were given, whereas it was manifest among 
the controls for the HEPP II-8 problems, 
that is, somewhat earlier. Finally, it should 
be noted that the controls showed a signifi 
cant relation between HEPP II-16 and age. 
The failure of the schizophrenics to do 50 
may be a result of the restricted age rang 
in this group (15-40 yr.) compared with the 
controls (16-65 yr.). Moreover, since all eX- 
cept one of the correlations between HEPP 
performances and age were negative, it aP- 
pears quite possible that there is a real rela- 
tion between performance and age that was 
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not detectable under the conditions of this 
experiment. 


Discussion 


The results of this study indicate that the 
procedure employed elicited from schizo- 
phrenics performances on elementary HEPP 
problems that were at least as efficient as 
those of control Ss. The implications of this 
outcome depend, of course, upon the nature 
of the problem involved and upon the possi- 
bilities for future developments that this 
success entails, A HEPP problem of any de- 
gree of difficulty confronts S with the task 
of planning and executing an inquiry into 
the operation of an artifact that behaves in 
a lawful way. The S may be given much, or 
little, information about an artifact that op- 
erates according to simple, or complex, prin- 
ciples, and, depending on these variables, he 
may be required to adduce, by exploration 
and reasoning, much, or little, about the 
laws that are controlling this test segment of 
reality. The HEPP situation requires that S 
execute a sequence of meaningful, goal-di- 
rected transactions, or interactions, with the 
artifact. Because he controls the information 
flow, his actions may be minimally or maxi- 
mally effective; he may create ambiguities 
and difficulties for himself by a poorly de- 
signed plan of inquiry, or he may minimize 
the cognitive strain of the solution effort by 
sagacious planning. Finally, because he can 
be exposed repeatedly to formally identical 
situations that differ only with respect to the 
particulars of the solution, he can be given 
numerous opportunities to criticize his pro- 
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TABLE 5 


WAIS Scores, EDUCATION, AND AGE CORRELATED 
with HEPP PERFORMANCES, GROUPS COMBINED 


Item 18 1-16 IL-8 II-16 

WAIS 

Verbal ll .34* A1* 43%" 

Performance 33* 29 -40* .34* 

Full scale 20 .33* A9** | 49e 
Education .06 21 .36* .63** 

e —.19 —21 | —30 | —.25 

*p«.05 

** 5 < 01 


cedure, correct its faults, and thus to profit 
by experience. 

The present study indicates that schizo- 
phrenics are capable of performing these 
tasks effectively; hence, that they are capable 
of understanding the artifact-testing assign- 
ment and have intact those cognitive and 
heuristic processes necessary for executing it 
with “normal” efficiency at the elementary 
level. Moreover, since schizophrenics mani- 
fest elation on success, chagrin at failure, and 
improvement with experience, without the use 
of noxious reinforcement, it appears that 
these gamelike problems have, for them, in- 
trinsic motivating components that many 
other tests fail to exhibit. Given the basic 
ability and motivation shown in this study, 
schizophrenics should be able to work on in- 
creasingly difficult HEPP problems until they 
encounter some specific cognitive demand to 
which they are not prepared to respond ef- 
fectively. 

The marked difference between the per- 
formances of paranoid and chronic undiffer- 


TABLE 6 
WAIS Scores, EDUCATION, AND AGE CORRELATED WITH HEPP PERFORMANCES 


Schizophrenics Controls 
Item 

1-8 1-16 IL-8 IL16 18 1-16 IL8 11-16 

WAIS 
Verbal 28 AT* 43 quee —.09 10 40, 2 
Performance .28 :31 33 .A4* 40 21 48. in 
Full scale .20 36 E .64** 40 AL 637 9 i 
Education 22 .29 14 .57* —.12 .00 Jn s EN 
Age —10 | —.09 —3 19 —.27 —37 | —26 —.51 

* 
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entiated schizophrenics, noted in connection 
with Figure 4, is regarded as a far from well- 
founded finding. Although, on the average, 
these groups did not differ markedly with 
respect to duration of hospitalization, 5.5 yr. 
and 6.4 yr., nor WAIS intelligence, 105.5 and 
106, for the undifferentiated and paranoid, 
respectively, the latter were, on the average, 
9 yr. older than the former. Since, however, 
HEPP performances did not correlate sig- 
nificantly with age among the schizophrenics, 
it is unlikely that a large performance dis- 
parity would be associated with this factor. 
Nonetheless, before hypothesizing a specific 
heuristic difference between paranoid and 
chronic undifferentiated schizophrenics, it is 
highly desirable to expand both groups in 
the interest of obtaining possibly more repre- 
sentative samples. 
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CLIENT-COUNSELOR COMPATIBILITY AND THE 
OUTCOME OF COUNSELING? 


GERALD A. MENDELSOHN ? anp NEIL O, RANKIN ë 
University of California, Berkeley 


Client-counselor compatibility as measured by the Fundamental Interpersonal 
Relations Orientation Behavior (FIRO-B) scale, was related to clients’ 
(N = 115) perceptions of the relationship and evaluations of the counselor and 
the usefulness of counseling Compatibility was a poor predictor for male clients 
but an excellent one for females. Generally, compatibility in the control need 
area was related positively to outcome, but surprisingly, compatibility in the 
inclusion and affection need areas was related to negative outcomes. Conse- 
quently, the global compatibility measure used successfully by Sapolsky yielded 
no significant correlations in this study. However, a composite formed a 
posteriori for the females produced a correlation of .54 with the major out- 
come variable, These results point to the necessity of examining potential sex 
differences, particularly in studies of the clinical relationship, and to the 


possibility of empirically based client-counselor matching, 


In a recent paper, Sapolsky (1965) re- 
ported that “the degree of interpersonal com- 
patibility [in terms of Schutz’s, 1958, Funda- 
mental Interpersonal Relations Orientation 
Behavior, FIRO-B, scale] existing between 
patient and doctor was a significant variable 
affecting the outcome of hospital treatment 
[p. 75].” Since patients in high compatibility 
pairs felt more similar to and better under- 


measure of interpersonal compatibility; the 
test is specifically designed for use in studies 
of dyadic and group behavior. Sapolsky’s 
results are encouraging as a validation of a 
potentially useful test, as an empirical con- 
firmation of ideas about the importance of 
the clinical relationship to outcome, and as 
an indication of the possibility of rational 
patient-doctor matching. There are, however, 


B Sed by their doctors, aa Ae ae ee a number of limitations of the study which 
| understood were related to favorable out- T : s lit 
| comes, he suggests that the relationship be- ise serious questions about the generality 
tween compatibility and outcome is medi- nd meaning of his findings. Not only is the 
ated by the effect of compatibility on the sample small in size, but it also consists en- 
patient's perception of his doctor. Although  tirely of females treated in a hospital by 
he notes the small size of his sample and the three first- and second-year residents. Fur- 
need for more data, he concludes that com- ther, there are several methodological ques- 
patibility may be an important variable un- tions involved in the use of compatibility 
derlying the establishment of *good" thera- scores which are overlooked in the study. It 
peutic relationships. is the intent of the present investigation to 
It is surprising that FIRO-B has been so examine the effects of compatibility ina very 
little used in clinical research, for it is virtu- different clinical setting and by a quite dif- 
ally unique in providing a direct operational ferent approach to analysis. If compatibility 
1Thi a A STIRE in terms of FIRO-B is really an “important 
habilitation “Administration Grant RD-1741--66, Underlying personality variable contributing 
The authors wish to express their thanks to the staff to the establishment of good’ therapeutic 
d the Counseling Center of the University of oa relationships [Sapolsky, 1965, B ir 
ornia, Berkeley, and especially to the center's i- ct comparable (althoug not iden- 
BRUN UE A. Kirk, for advice, assistance, and Es Te i5 tie of Sapolsky irrespective 
2 Requests. for reprints should be sent to Gerald of the setting and the approach to analysis. 
7 A. Mendelsohn, Department of Psychology, Univer- It should be noted at the outset that the 


Sity of California, Berkeley, California. 
® Now at Clark University. 


study to be reported is not a replication of 
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Sapolsky’s work but rather is complementary 
to it.* 


METHOD 


Subjects and Data Collection Procedure 


The Ss of this study were 162 clients, 104 males 
and 58 females, who had come to the Counseling 
Center of the University of California, Berkeley, 
seeking assistance with vocational, educational, and 
personal problems. Every client coming to the Coun- 
seling Center for the first time during the data 
collection period was asked to participate in a re- 
search project designed to improve counseling serv- 
ices. Clients were told that this would entail about 
an hour of testing and that the results of the tests 
would not be available to their counselors but would 
be reserved for research purposes only. More than 
90% of the clients asked to participate agreed to 
do so; lack of time was the usual reason given 
for refusal. Use of the Counseling Center’s services 
is voluntary, so that neither the decision to come to 
the Center nor to participate in the study was the 
result of coercion or administrative decision. 

The clients were seen by 11 counselors; 4 of 
them (two males and two females) had 5 yr. or 
more of counseling experience, 4 (two males and 
two females) had between 2 and 5 yr. of experience, 
and the remaining two males and one female were 
graduate students in clinical psychology who had 
less than a year of counseling experience. The coun- 
selors in each experience level saw an approximately 
equal number of clients. Three-quarters of the clients 
were assigned to a counselor of their own sex. The 
counseling was of short duration, eight interviews 
being the maximum in this sample. The orientation 
of counseling is toward viewing the client’s prob- 
lems within a developmental and personality frame- 
work as well as in terms of the current environ- 
ment, The client-counselor relationship is explicitly 
emphasized as a critical factor in the counseling. 
Consequently, although the setting and the Ss of 
the present research are different from those studied 
by Sapolsky, the clinical relationship cannot be con- 
sidered any less important to the counseling process 
than to other clinical processes, 

Prior to the beginning of the data collection, all 
participating counselors took FIRO-B and two other 
self-report inventories not germane to the present 
paper. Each client took the same set of tests before 
the first interview. After the conclusion of the series 
of interviews, an evaluation questionnaire was 
mailed to each client in the sample. A series was 
considered to be concluded when 3 mo, had elapsed 
after the last recorded interview. If a client did not 
return the questionnaire within 2 wk., a follow-up 
letter with another questionnaire enclosed was sent. 
A total of 115 clients, 73 males and 42 females, 71% 
of the initial sample, returned usable questionnaires. 


*' The data for the study were collected prior to 
the appearance of Sapolsky’s paper. 
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Independent Variables 


A full description of the development and theo- 
retical rationale of FIRO-B and the procedure for 
deriving compatibility scores from it are given in 
Schutz (1958). The test provides scores in three 
need areas, inclusion (I), control (C), and affection 
(A) which Schutz argues constitute a sufficient set 
of dimensions to predict interpersonal behavior, 
The test attempts to measure both the extent to 
which the S expresses behavior toward others in 
each area and the extent to which he wants others 
to express the behavior toward him. Thus each $ 
received six scores: expressed inclusion (Ie), wanted 
inclusion (Iw), etc. Schutz then delineates three 
kinds of compatibility, reciprocal compatibility 
(rK4), originator compatibility (oK), and inter- 
change compatibility (xK:;); the subscript i refers 
to one member of a dyad and the subscript j to 
the other member. The meaning and specific formula 
for each is given below. 

Reciprocal compatibility “reflects the degree to 
which members of a dyad reciprocally satisfy each 
others behavior preferences [Schutz, 1958, p. 108].” 
The formula is: rKy = |e; — w| + Je; — wi. A 
zero score reflects maximum compatibility and the 
larger the score, the less the compatibility. 

Originator compatibility is concerned with the 
balance of initiating and receiving behavior in the 
dyad. Compatibility is present when the two mem- 
bers have complementary patterns. 


Conflict arises when there is disagreement regard- 
ing preference of who shall originate relations and 
who shall receive them. For each need area (I, C 
A) there are two types of conflict: between two 
originators, competitive originator incompatibility, 
and between two receivers, apathetic originator 
incompatibility [Schutz, 1958, p. 109]. 


The formula is: oK«; = (e, — wi) + (ej — wj). Posi- 
tive scores reflect competitive incompatibility and 
negative scores apathetic incompatibility. Again & 
score of zero indicates maximum compatibility. 

Interchange compatibility “refers to the mutual 
expression of the ‘commodity’ of a given need area 
[Schutz, 1958, p. 110]." It can be seen from the 
formula that it is, in effect, a measure of the simi 
larity or dissimilarity in behavior preferences be 
tween dyad members: Ku = |(e«-- ws) — (4 
1,)|. As with reciprocal compatibility, the larger 
the sum the less the compatibility. It is true m 
cases that zero indicates maximum compatibility. 

Since there are three test dimensions and thre 
forms of compatibility, there are nine compatibility 
scores which will be referred to as “individual com 
patibility scores.” In addition, Schutz suggests á 
number of composite scores, of which only one 7 
of present concern: the sum of all nine E 
designated K. It is this measure which was used e 
Sapolsky. The authors will refer to it in this PaP 
as the "global compatibility score." 

There are several problems inherent in the H 
of matching scores of any kind in counseling " 


| 
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search (see Mendelsohn, 1966) and particularly in 
the use of composite measures of matching. The 
most important problem is that matching scores are 
necessarily a function of the separate scores from 
which they are derived and may thus, be correlated 
with them to a greater or lesser degree. Conse- 
quently, what one attempts to explain by matching 
may, in fact, be the result of client or counselor 
scores alone. If one examines the formula for origi- 
nator compatibility, for example, it is apparent that 
it is nothing more than an unweighted arithmetic 
combination of two client and two counselor scores; 
any results attributed to originator compatibility 
can equally well be attributed to the client and 
counselor scores. This relationship between the com- 
ponent scores and the compatibility scores does not 
hold for the measures of reciprocal or interchange 
compatibility, however, because both those measures 
involve absolute difference scores. While they may 
be related to client and counselor scores, unlike 
the measure of originator compatibility, they include 
an independent matching component. Since Sapolsky 
does not examine the separate effects of the client 
and counselor scores, it is difficult to reach any con- 
clusions about the independent effects of compati- 
bility in his study.5 

Two additional problems arise in the use of the 
unweighted sum of the nine compatibility scores, 
that is, K, as a measure of global compatibility. 
Both reciprocal and interchange compatibility are 
absolute scores, but originator compatibility can 
have both positive and negative values. Thus when 
the nine scores are summed, K will be larger if 
there is “competitive originator incompatibility” and 
smaller if there is “apathetic originator incompati- 
bility.” This leads to a situation in which as one 
form of incompatibility increases, overall compati- 
bility, as operationally defined, also increases. The 
problem could be handled if originator compatibil- 
ity scores were treated as absolute values, but 
neither Schutz nor Sapolsky makes any such sug- 
gestion. Again this leads to difficulty in interpreting 


iere findings in terms of compatibility. The 


econd problem in the use of a global measure like 
K is that its components may contribute differen- 
tially to the prediction of a criterion. A single com- 
ponent or a subset of components may predict as 
well or better than K and it is even possible that 
two components may predict the criterion in op- 
posite directions, In the latter case, a simple sum 
in which all components are weighted equally can 
lead to a cancellation of effects. Under any circum- 
Stances, a global measure is more difücult to inter- 
Pret than its individual components and a differ- 
entially weighted composite is likely to predict better 
than a composite formed of equally weighted com- 
Ponents, 


eS gn 

5Sapolsky is not, it seems, unaware of this prob- 
lem but the attempt to control for it (p. 73) does 
not Provide information on separate effects of pa- 
tient or doctor scores. 
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The authors have gone into some detail in the 
discussion of FIRO-B and the measures of compati- 
bility derived from it both to point out certain 
questions about Sapolsky’s findings and to indicate 
the methodological problems involved in the use 
of matching scores. The authors will necessarily 
return to these problems in the data analyses to be 
reported. 


Outcome Questionnaire 


The outcome questionnaire used in this study is 
the same as that described in Mendelsohn and Geller 
(1965). It consists of 27 items, each in the form of 
a 7-point Likert-type scale. Clients are asked to 
evaluate a number of aspects of their counseling ex- 
perience and to indicate their reactions to the 
counselor and the process, Cluster analyses of client 
responses in four separate samples have produced 
a relatively invariate structure which applies equally 
well to males and females. It consists of five clus- 
ters whose alpha reliabilities are .85, .80, .77, .59, 
and .78, respectively: Cluster 1 represents degree of 
general satisfaction with counseling and evaluation 
of the extent to which objectives were achieved; 
Cluster 2 taps evaluation of the counselor’s skill and 
perceptiveness; Cluster 3 reflects the client’s feel- 
ings of comfort and of being understood in counsel- 
ing; and the last two clusters are concerned with 
specific ways in which counseling was of benefit, 
through the therapeutic means of insight and cathar- 
sis (Cluster 4) and by acquiring concrete informa- 
tion and reaching specific decisions about future 
actions (Cluster 5). Though the five obtained clus- 
ters consist of different sets of items and refer to 
conceptually different aspects of the counseling, their 
intercorrelations are sufficiently high to suggest that 
there is a single dimension of evaluation of the 
counselor and the counseling which underlies much 
of the variance of the questionnaire responses. Con- 
sequently, the first centroid was extracted (Thur- 
stone, 1947) and the 11 items with the highest 
factor loadings on it were selected to define the 
centroid. This set of items had an alpha reliability 
of .94 and accounts for 61% of the initial item 
communality. There are thus six dependent measures 
available, but since the first centroid has such gen- 
erality, it will be used as the major outcome vari- 
able in the presentation of results. It will be referred 
to as “general evaluation.” 

Scores were obtained for the clusters and the 
centroid by summing a client’s ratings on the de- 
fining items and converting the resulting distribu- 
tions to T scores. Items were scored in such a way 
that the higher the T scores, the more favorable 
the client's response. 

The outcome measures of the present study are 
different from those used by Sapolsky. In his study, 
evaluation of outcome was by the supervising psy- 
chiatrist and measures of relationship and rapport 
were inferred from the profile similarity of patient 
and doctor Semantic Differential protocols. In con- 
trast, the client is the source of the outcome vari- 
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ables in the present study and the measures of 
relationship were obtained by direct question. Al- 
though self-report has its limitations as a source of 
evaluation of success, it remains the least equivocal 
method of assessing the client’s feelings about and 
perceptions of the counselor and the process of 
counseling. 

It is, then, only in the use of the same predictor, 
FIRO-B, that the two studies are the same, but 
as noted before, if compatibility is a central factor 
in fostering “good” relationships, results of the two 
studies should be comparable. 


RESULTS 


The correlations between the 10 compati- 
bility scores and the measure of general 
evaluation (first centroid) are shown for 
males and females in Table 1. 

Perhaps the most striking aspect of these 
results is the difference between the corre- 
lations for males and females. For the latter, 
5 of the compatibility scores correlate sig- 
nificantly (p < .05) with general evaluation, 
but for the former none of the correlations 
even approaches significance. This basic find- 
ing holds for the correlations between the 
compatibility scores and the five other clus- 
ters scores as well. There is at least one 
significant predictor for each cluster score 
for the females. In contrast, only two sig- 
nificant (p < .05) correlations were obtained 
for males and both are relatively low (r = 
.25). Since 60 correlation coefficients were 


TABLE 1 


Propuct-MoMENT CORRELATIONS BETWEEN COMBATI- 
BILITY SCORES AND GENERAL EVALUATION 


Need area . Males Females* 

Inclusion 

Reciprocal (rK}) 09 35° 

Originator (oK1) 10 —.17 

Interchange (xK!) 05 .30* 
Control 

Reciprocal (rK°) — 12 09 

Originator (oK9) —2 —.35* 

Interchange (xK9) —41 —.32* 
Affection 

Reciprocal (xK^) +12 34* 

Originator (oK^) 01 —.06 

Interchange (xK^) —.1 «25 
Global compatibility (K)* —.05 A3 


^ Global Compatibility is the sum of the nine com; t 
SEU scores listed above it. Duces 
n = 73. 


on = 42. 
*p <.05. 
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computed for each sex, the conclusion that 
the results for males are due to chance can- 
not be rejected. Thus evidence of a relation- 
ship between FIRO-B compatibility and out- 
come was obtained for females only.* 

Before reaching any conclusions about the 
relationship of compatibility scores to the 
criteria in the female sample, the possibility 
that client or counselor scores alone can pre- | 
dict as well as or even better than the com- 
patibility scores must be examined. The find- 
ings for the general evaluation score are 
typical and will serve to illustrate thc pat- 
tern of results. Two client scores, Wanted 
Control (Cw) and Expressed Control (C;) 
have moderate correlations with the criterion 
(r = .28 and —.27, respectively), but neither 
is significant at the .05 level and neither is 
as high a correlation as any of the five sig- 
nificant ones between compatibility scores 
and general evaluation (see Table 1). Com- 
patibility, then, does seem to have an im- 
portant predictive function, independent of 
client or counselor scores, for the females. 

The second finding of note in these data 
is the failure of the global compatibility 
measure (K), used by Sapolsky, to predict 
any of the outcome measures for either sex 
significantly. For females, this failure occurs - 
despite the presence of a substantial number 
of significant correlations between the out- 
come measures and the individual compati- 
bility scores which are the components of K. 
Inspection of Table 1 reveals the reason for 
this failure quite clearly. As noted before, t 
global compatibility measure is the simpl 
sum of the individual compatibility Score. 
but these nine scores do not all correlate with 
the criterion in the same direction. In fach 
for each sex, just about half the correlation 
coefficients are positive and half are negative 
Consequently, the effects of the individu 
compatibility scores cancel each other ott 
when these scores are combined into the glo 
bal compatibility measure. However, these 


®The same analyses were performed for go 
composed of same-sex and opposite-sex a 
counselor pairs. The results were generally qu 
similar for these two groups and the small jue | 
ences between them which were found were att? | 
table to the differences, reported above, between ; 
male and female clients. 1 
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variations in direction are not random—every 
significant correlation between a compatibil- 
ity score in the I and A need areas and an 
outcome measure is positive in sign and all 
the significant correlations involving the C 
need area are negative in sign. Since maxi- 
mum compatibility is defined as a score of 
zero and the higher the outcome score, the 
more favorable the client's evaluation, these 
results indicate that compatibility in I and 
A need areas is associated with less client 
satisfaction, lower evaluation of the coun- 
selor, and less comfort and rapport on the 
part of the client. The expected relationship 
between compatibility and outcome was 
found only in the C need area. 

Although K, the global compatibility 
measure suggested by Schutz, .proved of no 
value as a predictor in this study, it remains 
possible that some other composite of the 
nine individual compatibility scores could 
produce higher correlations with the criteria 
than does any single compatibility term. 
The authors have noted two difficulties with 
the K measure, one methodological and the 
other empirical, which should be avoided in 
the formation of such a composite. The meth- 
odological problem concerns the measure of 
originator compatibility. Unlike the other 
cempatibility measures, it can take on both 
negative and positive values and is not really 
a compatibility measure at all. Thus origi- 
nator compatibility terms should be excluded 
in the formation of a composite. The second 
difficulty is that in the formation of K all 

ms are weighted equally, that is, they are 
all given a weight of +1. However, it was 
found that some compatibility scores corre- 
late positively and some negatively with the 
criteria and thus tend to cancel each other 
when combined. In order to avoid this prob- 
lem, the terms entering the composite should 
be weighted in the direction of their correla- 
tion with the criteria. If a measure of overall 
compatibility has any utility at all, a com- 
posite formed in this way should yield ap- 
preciably higher correlations than those based 


on the individual compatibility terms. The - 


prediction of general evaluation scores in the 
female sample was selected as a test case. 
f The composite score was obtained in the 
| following manner. First all individual com- 
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TABLE 2 


PRODUCT-MOMENT AND MULTIPLE CORRELATION CO- 
EFFICIENTS BASED ON FIRO-B COMPOSITE SCORES 
(FEMALES FOR GENERAL EVALUATION) 


Predictor Correlation 
Compatibility composite Son 
Client-counselor composite .39* 
Multiple regression 54 

*5 «.05. 
=> < 01, 
> < 001. 


patibility scores were standardized. Then 
each compatibility score which was signifi- 
cantly (p < .05) correlated with the criterion 
was given a weight of +1 when the corre- 
lation was positive and —1 when the corre- 
lation was negative. Thus, the composite con- 
sisted of rK! + rK^ + xK! — xK? and a sin- 
gle composite score was derived for each 
client by summing his weighted standard 
scores. If, for example, a client’s standard 
scores on the four measures were 45, 55, 40, 
and 35, respectively, his composite score 
would be 105. For purposes of comparison, 
a composite based on the significant (p < 
.10) client and counselor score predictors was 
formed in an identical fashion.” These two 
composites were then correlated with the gen- 
eral evaluation score and, in addition, a multi- 
ple-regression coefficient using both com- 
posites as predictors was obtained. The 
results are shown in Table 2. 

Perhaps the clearest way to evaluate the 
utility of the compatibility composite is to 
compare the proportion of variance explained 
by the composite to the proportion of vari- 
ance explained by the best individual com- 
patibility score predictor, that is, by a com- 
parison of 7°. The best single predictor is 
reciprocal compatibility in the I need area— 
it accounts for 12% of the variance of the 
criterion. In comparison, the compatibility 


7 Composites can also be found on the basis of 
multiple-regression analysis, but the Procedure re- 
ported is (a) simpler, (b) more similar to the 
manner in which K, to which it is compared, is 
formed, and (c) likely to be more stable than a 
multiple-regression composite. Nevertheless multiple 
correlation coefficients were also calculated for these 
data; they are virtually identical to those reported 


in Table 2. 
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score composite accounts for 25% of the 
variance. Clearly the use of the composite 
does result in an appreciable improvement 
in prediction, doubling the amount of ex- 
plained variance. This stands in sharp con- 
trast to the results obtained for the K com- 
posite, but this finding should be interpreted 
cautiously for the following reasons. The 
composite was formed a posteriori in an at- 
tempt to maximize prediction. Thus, while 
the data show that a composite can improve 
prediction, cross-validation is necessary to 
evaluate the stability of this particular com- 
posite. Further, some terms enter the com- 
posite with positive and some with negative 
weights; consequently, despite its practical 
value, it is difficult to ascribe a meaning, 
particularly in terms of compatibility, to the 
composite. 

The authors noted before that there is a 
problem in making the inference that results 
are attributable to client-counselor matching 
(compatibility) rather than to the character- 
istics of the client or counselor alone. The 
data of Table 2 also provide information on 
this point. If compatibility does have a pre- 
dictive function independent of the client and 
counselor scores, the multiple-regression pre- 
diction based on both composites should be 
significantly better than the prediction based 
on the client-counselor composite alone. A 
comparison of the two coefficients (.39 versus 
.54) indicates that this is the case. In con- 
trast, there is little difference between the 
multiple-regression correlation and the cor- 
relation based on the compatibility score 
composite alone (.50 versus .54). Further- 
more, the compatibility score composite re- 
mains a significant predictor of outcome after 
the effect of the client-counselor score com- 
posite is partialed out (712.3 = .41, p = .01). 
Thus, the evidence indicates that FIRO-B 
compatibility scores are important indepen- 
dent predictors of outcome for the female 
client. 

Discussion 


This study was undertaken to examine the 
hypothesis that client-counselor compatibil- 
ity, as measured by FIRO-B, would be re- 
lated to the outcome of counseling. Such a 
finding would, of course, add strength to 


GERALD A. MENDELSOHN AND Nett O. RANKIN 


Sapolsky’s suggestion that compatibility may 
contribute to the establishment of a good 
therapeutic relationship. Although the au- 
thors’ results, like those of Sapolsky, indicate 
that some scores derived from FIRO-B are 
effective predictors of outcome, the difference 
between the results of the two studies are 
considerable. Perhaps the most clear-cut dif- 
ference is that the global compatibility meas- 
ure, K, which works quite well in Sapolsky’s 
study, is a failure in the present study. The 
effects of compatibility in different need 
areas, it seems, can be directly opposite, a 
possibility obscured by the use of the K 
score. While it should be kept in mind that | 
the two studies differ radically in the sam- | 
ples and processes studied, the evidence casts 
serious doubt on the generality of the positive 
relationship between "compatibility" and 
favorable outcomes found by Sapolsky. 
Further doubt on the generality of this re- 
lationship arises from the differences in re- 
sults for males and females. Since Sapolsky 
studied only females, no comparisons are 
possible, but it is clear that for the present 
sample, compatibility, as operationalized by 
Schutz, is a variable of consequence only for 
females. This may reflect the well-substanti- 
ated finding that, compared to males, females 
are more sensitive and responsive to inter- 
personal behavior and are more dependent 
on others (Tyler, 1965). Consequently, coms 
patibility is more likely to affect their re- 
sponse to counseling than it is to affect the 
response of the males. But whether or E 
this interpretation of the obtained sex di 
ference is correct, it is clear that generalizing 
about the clinical relationship or the clinical 
process on the basis of results for one sex I$ 
a dangerous procedure, Yet the number of | 
studies in this area which make any attempt 
to explore potential sex differences is quite 
small. Given the consistency with which dif- 
ferences in social behavior between males an 
females have been found, this seems an W 
fortunate omission which may as easily 
to the rejection of valid but limited hypoth 
eses as to the overgeneralization of finding: 
The significant correlations between or 
patibility scores and outcome variables 0d 
low a consistent, but surprising patt i 
compatibility in the I and A need areas 
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related to unfavorable outcomes and com- 
patibility on the C dimension is related to 
favorable outcomes. If the FIRO-B compati- 
bility indexes really measure what they are 
supposed to measure, the results for C are 
expectable and support the notion that the 
direction of the counseling process should be 
shared by the two participants. The results 
for the I and A need areas, the two dimen- 
Sions most concerned with the affective as- 
pects of relationships, on the other hand, 
appear paradoxical However, findings of 
other studies of the effects of client-counselor 
matching (Carson & Heine, 1962; Mendel- 
sohn, 1966; Mendelsohn & Geller, 1965, 
1967) suggest that conditions which encour- 
age closeness can have a deletorious effect on 
the clinical process. Carson and Heine, for 
example, suggest that high similarity of per- 
sonality can lead on the part of the therapist 
to an overidentification with the patient and 
his problems. It seems a tenable hypothesis 
that factors which foster strong emotional 
attachments in typical social relations can, 
unless carefully handled, lead to an excessive 
personalization of the special relationship 
which characterizes counseling and psycho- 
therapy. 

Without comparable data from other clini- 
cal settings, it is impossible to know whether 
the specific results found in the present study 
will be found elsewhere. There are, moreover, 
Other, simpler, compatibility scores possible 
which could be as effective, or more effective, 
than those suggested by Schutz. Despite these 
reservations and despite the differences be- 
tween the present study and that of Sapolsky, 
one central point emerges—FIRO-B scores 
can generate some remarkably good predic- 
tions of outcome, at least for females. Con- 
sidering that (a) the tests are administered 
before the beginning of counseling and 
heither participant knows his own or his part- 
ner’s test results, (b) the results attributed 
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to compatibility scores cannot be explained 
in terms of client or counselors scores alone, 
(c) there are differences among counselors in 
background and style and among clients in 
objectives and maturity, (d) there is a 
myriad of important events which take place 
in the counseling itself, and (e) there is a 
considerable time lapse between the testing 
and evaluation of outcomes, a correlation of 
the magnitude of .54 can be described with- 
out exaggeration as remarkable. That such 
a result was obtained suggests the potential 
importance and utility of systematic, em- 
pirically based, matching of client to coun- 
selor. Effective matching alone can hardly 
guarantee success, but the results of this 
study and of Sapolsky's strongly suggest that 
it is a feasible and practicable way to facili- 
tate favorable clinical outcomes. 
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PSYCHOTHERAPEUTIC CHANGE AND SOCIAL ADJUSTMENT: 


A REPORT OF A NATIONAL SURVEY OF PSYCHOTHERAPISTS 


RUTH K. GOLDMAN ax» GERALD A. MENDELSOHN ? 
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A representative national sample of psychotherapists (psychiatric social work- 
ers, psychologists, and psychiatrists) was asked to describe by means of 
Gough’s Adjective Check List (a) themselves, (b) the type of patient with 
whom they work best (preferred), (c) a patient who has received maximum 
benefit from treatment (cured), and (d) a person with satisfactory adaptation 
to himself and his environment (normal). Despite the heterogeneity of the 
sample, there is a remarkably high level of agreement among therapists in 
their descriptions. Therapists feel they work best with a patient who exhibits 
little pathology and the successfully treated patient is described almost entirely 
in terms of positive social functioning. Therapists do not see the preferred, 
cured, or normal as similar to themselves. These data suggest an implicit 
consensus about the goals of psychotherapy and point to the salience of a 


social adjustment criterion in the evaluation of outcome. 


During the past two decades, there has 
been a growing recognition of the contribution 
of the psychotherapist to the treatment proc- 
ess. Strupp (1960) describes this trend as a 
“ ‘rediscovery’ of the therapist as a person, 
whose attitudes, personality and values pre- 
sumably play an exceedingly important, if 
not crucial, part in the therapeutic process 
[pp. 66-67].” Undoubtedly, one important 
influence on the values and attitudes, if not 
the personality, of the individual therapist is 
the set of beliefs held by his fellow practicion- 
ers, first as they are his teachers and then as 
they are his peers. Yet, one knows very little 
about what therapists as a group are like 
or what they believe about psychotherapy. 
Usually studies of therapists have used small 
samples, comprised of Ss drawn from a single 
clinic or a limited geographical area and con- 
fined to only one profession, most often psy- 
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chology. The authors know of only one study 
to date in which a national sample of thera- 
pists of all relevant professions was used 
(Chance & Arnold, 1960), but in that study, 
all the participants came from six major 
cities, 

The present research was undertaken to 
obtain normative information which could 
make possible some generalization about 
therapists’ self-perceptions, and their values 
and attitudes toward the patients they treat. 
The approach was to ask a representative, 
nationwide, sample of therapists to describe 
themselves, the patient at the outset and at 
the termination of therapy, and the “normal” 
person. These descriptions were then used to 
draw inferences about some of the concep- 
tualizations therapists share about who can 
profit from therapy and the nature of psy- 
chological well-being. 


METHOD 
Procedure 


Twelve hundred psychotherapists were randomly 
selected from the national directories of the Ameri- 
can Psychiatric Association (1961-62), the Clinical 
Division of the American Psychological Association 
(1962), and the Psychiatric Social Work section of 
the National Association of Social Worker’s Direi 
tory (1960). Materials consisting of four adjective 
checklists, a Personal History Questionnaire 
(PHQ), detailed instructions, and a cover letter 
were sent to this group of therapists. At the en 
of a 2-mo. period, a second set of materials he 
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sent to all nonrespondents. Three months after the 
mailing of the second letter, all returns were an- 
alyzed; materials which arrived after this date were 
not included in this analysis. 


Instruments 


Therapists were asked to describe the following 
four concepts by means of Gough’s Adjective Check 
List: (a) “check those adjectives which describe 
you” (self), (b) “check those adjectives which de- 
scribe the kind of adult male patient that you work 
with best” (preferred), (c) “check those adjectives 
which describe an adult male patient that has termi- 
nated therapy successfully” (cured), and (d) “check 
those adjectives which describe an adult male who 
has a satisfactory adaptation to himself and his 
environment" (normal). Order of presentation was 
balanced in order to avoid response set due to 
sequence. 

Gough's Adjective Check List (ACL) is a stan- 
dardized list of 300 words which provides a par- 
ticularly efficient way to obtain a wide range of 
behavioral descriptions of people. A detailed presen- 
tation of the rationale, development, and standardi- 
zation of the checklist is available in Gough and 
Heilbrun (1965). For the authors’ purposes, it has 
the following particular merits: It utilizes no special 
language—in particular it is free of psychiatric and 
theoretical terminology which might act to restrict 
the range of behavior described; it is not oriented 
toward pathology; it is easily understood; it can be 
Scored objectively; and it is consistent with every- 
day modes of describing people. The value of the 
ACL as a descriptive instrument has been discussed 
by Gough (1960) and is well illustrated by Mac- 
Kinnon's (1963) study of creative architects. 

The Personal History Questionnaire was devised 
in order to obtain information about each therapist's 
background and training. They were asked to indi- 
cate the following: profession, sex, religion, parents’ 
socioeconomic status, years of personal psychother- 
apy, years of experience as a psychotherapist, and 
theoretical persuasion. These data were used in the 
Present study to evaluate the representativeness of 
the sample of respondents. In subsequent papers, the 
effect of these variables on the descriptions of the 
concepts will be reported. 


Response Rate 


A pilot study was conducted in order to determine 
an expected response rate. The results of this study 
indicated that the return rate of psychologists and 
Psychiatric social workers was twice that of psychia- 
trists. Therefore, in the final study, twice as many 
Psychiatrists as social workers or psychologists were 
contacted in order to obtain approximately equal 
numbers of respondents in each profession. 

Of 1,200 therapists to whom the materials were 
Sent, 186 could not be reached, 443 responded, and 
of this number, 421 provided data suitable for an- 
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alysis. By profession 29% (n — 144) of psychiatrists, 
61% (n=143) of clinical psychologists, and 60% 
(n= 134) of psychiatric social workers responded. 
These figures represent about 5% of the total popu- 
lation in each profession. The response rate of psy- 
chiatrists is consistent with that generally obtained 
for mail surveys, but for the other two professions, 
it is considerably higher than one would expect 
(Moser, 1958; Sundland & Barker, 1962; Wallach 
& Strupp, 1960). 


Sample Characteristics 


The characteristics of the sample appear in Table 
1. 

Since the intent of this study was to obtain nor- 
mative data about psychotherapists, it is important 
to ascertain whether respondents are a representative 
sample of the population from which they were 
drawn. Two surveys of mental health specialists are 
available for evaluation of this question: the De- 
partment of Health, Education, and Welfare (HEW) 
studies on mental health manpower (1963) and the 
American Psychiatric Association manpower study 
(1963a, 1963b). A minimum of 80% of the mem- 
bers of the national psychiatric, psychological, and 
social work organizations participated in the HEW 


TABLE 1 
SAMPLE CHARACTERISTICS 


Characteristic Percentage 

Sex 

Male 71 

Female 29 
Religion 

Protestant 45 

Catholic 12 

Jewish 40 

None 3 
Socioeconomic status 

Upper 32 

Upper middle 14 

Middle middle 14 

Lower middle 20 

Lower 20 
Personal Psychotherapy 

None 42 

Under 3 yr. 24 

Over 3 yr. 34 
Experience 

nder 5 yr. 16 

5-10 yr. 46 

Over 10 yr. 38 
Profession 34 

Psychiatrists 

Psychologists 34 

Social workers — 32 
Theoretical Persuasion 

Psychoanalytic 63 

Interpersonal 25 

Eclectic 12 

Note.—N = 421. 
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survey and the American Psychiatric Association 
provides data on its total membership.* 

The samples of psychiatrists and psychiatric so- 
cial workers are virtually identical in age, sex, and 
type of clinical employment to the populations from 
which they were drawn. In addition, the data on 
years of experience and religious affiliation of psy- 
chiatrists in the present sample were strikingly simi- 
lar to those reported in the HEW study. Thus, these 
data and those of the Department of Health, Edu- 
cation, and Welfare are quite similar for psychia- 
trists and social workers, although there is the least 
amount of comparison data available for the latter 
group. 

The sample of clinical psychologists is somewhat 
less representative of the national population sur- 
veyed by HEW. However, the discrepancies between 
the data for this sample and those for the HEW 
group are small; the differences, when they appear, 
are of the order of 1096. The present sample of 
psychologists contains more females, self-employed 
therapists, and residents of California than the 
HEW group does. The data on age and years of 
experience, on the other hand, are very similar. 
Considering these comparisons for all three pro- 
fessions, there is reason to have confidence in gen- 
eralizing from the results obtained for the authors’ 
sample to the population of psychotherapists as a 
whole. 


RESULTS AND DISCUSSION 


The results will be presented in two parts: 
in the first, the adjectives most frequently 
chosen by the therapists to describe each con- 
cept will be considered and in the second, the 
focus will be on the adjectives which differ- 
entiate between concepts.* 


Self-Concept 


The 25 adjectives most frequently checked 
to describe the self are: dependable (90%), 
capable (86%), conscientious (86%), intelli- 
gent (86%), friendly (85%), honest (85%), 
adaptable (83%), responsible (82%), rea- 
sonable (81%), reliable (81%), considerate 
(80%), cooperative (80%), understanding 


8 For clinical psychologists, there were no signifi- 
cant differences between respondents and nonrespon- 
dents in the Health, Education, and Welfare study. 

4 Gough and Heilbrun provide a set of 24 scales 
for scoring the ACL, but these scales were stan- 
dardized for self-description only. Consequently, 
their applicability to the descriptions of the “pre- 
ferred,” “cured,” and “normal” is questionable. It 
seemed to the authors, too, that item analyses would 
provide the clearest, most direct, and most detailed 
picture of the therapists’ descriptions, 
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(80%), active (79%), sincere (79%), alert 
(18%), realistic (78%), fair-minded (76%), 
healthy (75%), interests-wide (75%), sen- 
sitive (74%), curious (73%), helpful 
(73%), tolerant (73%), and warm (72%). 

It is apparent from inspection of this list 
that the proportion of therapists endorsing 
these adjectives is very high. This finding is 
not limited to the self-description, but is 
characteristic of the results for all four con- 
cepts. This may not seem surprising since 
these adjectives were selected for presenta- 
tion because they were the most frequently 
checked. But it is important to keep in mind 
that there are 300 adjectives to choose from 
and that on the average, a therapist checked 
100 of them; that is, the chance probability 
of any given adjective being checked is .33. 
Furthermore, the sample is quite heterogene- 
ous with respect to profession, experience, 
training, sex, socioeconomic origins, and 
theoretical inclination. Consequently, these 
percentages are much higher than expected; 
there is reason to have confidence in charac- 
terizing the attitudes of psychotherapists in 
general from these data. | 

The self-portrait which emerges is of a 
person who is steady, competent and intel- ' 
lectually alert, and concerned with the wel- 
fare and feelings of others. Adjectives ass0- 
ciated with originality, maturity, enthusiasm, 
and spontaneity are checked less frequently 
however. This contrast is even more striking | 
when the responses of the psychotherapists 
are compared to those of architects (Mac | 
Kinnon, 1963). At least 80% of both groups 
describe themselves as capable, conscientious; — 
cooperative, friendly, honest, intelligent, a 
reasonable. The therapists describe them 
selves more frequently than the architects 
as adaptable, considerate, dependable, relia- 
ble, responsible, and understanding. The at- 
chitects describe themselves with considerably 
greater frequency than the therapists as iM 
aginative, idealistic, civilized, and serious 
Both groups, then, consider themselves Wor 
thy of trust, skilled, and sensitive to othe 
people. The therapists place greater empht 
sis on consistency and dependability- 
most striking difference between the W 
groups though is the architects’ greater con 
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cern for originality and their greater sense of 
personal involvement. 

This pattern of results leads one to con- 
clude that the therapists were describing 
themselves, in part, in the role of therapist 
and, in part, as individuals; to the extent 
that this is true, the high level of agreement 
among therapists points to a considerable 
consensus on how the therapist ought to be- 
have toward patients. Stability, objectivity, 
and a willingness to help are stressed but, as 
could be expected, indications of affect are 
considerably less emphasized. 


Preferred Patient 


The 25 adjectives most frequently checked 
to describe the preferred patient are: intelli- 
gent (75%), alert (74%), adaptable (70%), 
frank (68%), honest (68%), cooperative 
(67%), anxious (65%), sincere (65%), sen- 
sitive (64%), curious (62%), active (61%), 
interests-wide (61%), conscientious (60%), 
imaginative (58%), friendly (57%), insight- 
ful (56%), dependable (55%), capable 
(54%), reasonable (54%), reflective (54%), 
clear thinking (53%), thoughtful (53%), 
warm (53%), appreciative (51%), and re- 
sponsible (50%). 

There is less agreement among therapists 
in their description of the preferred patient 
than there is about any of the other concepts. 
Nevertheless, the agreement is sufficiently 
high to provide a consistent description of the 
kind of patient with whom the therapists 
work best. He is an imaginative, sensitive, 
Curious, well-motivated, but anxious person. 
Aside from the anxiety, there is little indi- 
cation of pathology or a “cry for help” on 
his part. Adjectives such as despondent, high- 
strung, hostile, impulsive, suspicious, timid, 
unstable, and withdrawn, to cite several ex- 
amples, are checked by fewer than one- 
quarter of the therapists. In fact, if anxious 
were removed from the list of the 25 most 
descriptive adjectives, the preferred patient 
would appear to be an unusually productive 
and creative person. 

There are several possible interpretations 
of what at first seems a puzzling finding. 
The obtained pattern suggests that therapists 
have found in practice trends which parallel 
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those of a number of empirical studies, 
namely, that patients who benefit most from 
psychotherapy are those with the least 
amount of manifest pathology (Barron, 1953; 
Sullivan, Miller, & Smelser, 1958; Taulbee, 
1958). This finding raises serious issues when 
one considers that the patient described by 
therapists as most capable of profiting from 
treatment is not typical of the majority of 
those who feel in need of psychological help 
(Gurin, Veroff, & Feld, 1960). 


Cured Patient 


The 25 adjectives most frequently checked 
to describe the cured patient are: adaptable 
(86%), dependable (79%), honest (77%), 
alert (76%), reasonable (76%), confident 
(75%), friendly (75%), realistic (75%), 
responsible (75%), active (72%), clear 
thinking (72%), capable (71%), self-con- 
fident (71%), cooperative (70%), consider- 
ate (69%), sincere (69%), insightful (68%), 
tolerant (67%), reliable (66%), mature 
(65%), rational (65%), stable (65%), nat- 
ural (64%), understanding (64%), and ap- 
preciative (63%). 

Agreement among therapists in describing 
a patient who has terminated therapy suc- 
cessfully is again quite high. The cured pa- 
tient, as described by therapists, is a person 
who is very well adapted to his environment. 
There are no indications of psychological 
disturbance; instead he has acquired self- 
confidence, contentment, and a measure of 
tolerance and stability. He is, above all, a 
good citizen, capable of taking a responsible 
role in society. But except for the adjective 
natural there is little indication that he is 
seen as a freer, more creative, more unique 
individual. The criterion of successful termi- 
nation seems to be based on social effective- 
ness rather than on self-actualization. 


Normal Person 


The 25 adjectives most frequently checked 
to describe the normal are: adaptable (94%), 
dependable (90%), capable (87%), con- 
fident (86%), friendly (86%), honest 
(85%), realistic (85%), clear-thinking 
(84%), alert (83%), cooperative (83%), 
reasonable (83%), responsible (83%), active 
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(82%), considerate (81%), mature (80%), 
sincere (79%), tolerant (79%), affectionate 
(78%), appreciative (78%), reliable (78%), 
self-confident (78%), warm (78%), con- 
scientious (77%), stable (76%), and tact- 
ful (76%). 

The highest level of agreement among ther- 
apists is found in their description of the 
normal person. There is a marked bimodality 
in the distribution of the percentages for the 
adjectives—either they are checked by more 
than a majority of therapists or they are 
checked very infrequently, that is, by less 
than 10% of the Ss. 

An “adult male who has a satisfactory 
adaptation to himself and his environment” 
is seen as possessing all the positive, but none 
of the negative characteristics of the other 
three concepts. He is reality oriented, so- 
cially effective, self-accepting, free of inner 
conflict, and concerned with the welfare of 
others. But in contrast to the self, preferred, 
and cured, the normal is described as more 
affectively involved in his relationships. It 
is the one description which gives a sense of 
balance between emotion and control. 


Comparisons between Concepts 


In the discussion which follows, the em- 
phasis will be on those adjectives which dif- 
ferentiate between concepts to the greatest 
extent. 


Self-Concept Compared with Preferred 
Patient Concept 


Two-thirds of the adjectives differentiate 
between the two concepts at the .05 level of 
confidence or beyond. In order to highlight 
the differences between the self and the pre- 
ferred, the 12 items with the greatest dis- 
crimination in favor of the self and the 12 
with the greatest discrimination in favor of 
the preferred patient will be presented. In 
this and in subsequent comparisons of con- 


5Since the same therapists provide data on all 
four concepts, it is necessary to take this potential 
source of dependency in the data into account in 
performing statistical tests for differences between 
concepts. This was done by using the chi-square 
test for a repeated measurements situation suggested 
by McNemar (1949, pp. 204-207). It is based on who 
changes response from one concept to the next. 
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cepts, only those adjectives for which a differ- 
ence of 10% or more between concepts was 
obtained will be included. 

The 12 adjectives which differentiate most 
in favor of the self (versus preferred) are: 
helpful (73% versus 27%), understanding 
(80% versus 38%), healthy (75% versus 
34%), tactful (71% versus 3096), considerate 
(80% versus 41% ), sympathetic (72% versus 
33%), loyal (65% versus 28%), discreet 
(56% versus 20%), forgiving (65% versus 
29%), reliable (81% versus 45%), dependa- 
ble (90% versus 55%), and independent 
(65% versus 30%). 

The 12 adjectives which differentiate most 
in favor of the preferred (versus self) are: dis- 
satisfied (45% versus 16%), confused (32% 
versus 3%), fearful (37% versus 9%), an- 
xious (65% versus 38%), immature (31% 
versus 10%), despondent (23% versus 5%), 
nervous (33% versus 16%), awkward (22% 
versus 8%), self-punishing (26% versus 
13%), unstable (14% versus 1%), with- 
drawn (19% versus 7%), and distrustful 
(17% versus 6%). 

There are two primary dimensions on 
which the concepts are differentiated. The 
first concerns interpersonal relations and the 
second psychopathology. Compared to the 
preferred patient, the therapists describe 
themselves as compassionate and responsive 
to the needs and feelings of other people 
The preferred patient is described expecta 
bly as more disturbed, self-absorbed, and 
subject to inner tension than the self. The 
discrimination is considerably greater on the 
first dimension than on the second, however 
consider, for example, the adjective anxious, 
the pathological trait most characteristic © 
the preferred. It is checked by 65% of the 
apists for the preferred and by 387608 
therapists for themselves. The percentage dif- 
ference of 27% is less than that for any ° 
the dozen adjectives discriminating in fave! 
of the self. Thus, it seems that the self 5 
distinguished from the preferred more Ll 
positive qualities than by the negative one 
associated with pathology. It will be a 
that this is not characteristic of the ot 
two comparisons in which the preferred ap 
pears. 


| 
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Self-Concept Compared with Cured Person 
Concept 


More than two-thirds of the adjectives dif- 
ferentiate between the two concepts at the .05 
level of confidence or beyond. 

The 12 adjectives which differentiate most 
in favor of the self (versus cured) are: intelli- 
gent (86% versus 43%), sentimental (49% 
versus 13%), serious (69% versus 33%), 
idealistic (57% versus 25%), reserved (42% 
versus 14%), soft-hearted (34% versus 6%), 
quiet (40% versus 13%), conscientious 
(86% versus 60%), curious (73% versus 
48%), cautious (49% versus 24%), impa- 
tient (28% versus 4%), and anxious (38% 
versus 15%). 

There are only five adjectives which dif- 
ferentiate in favor of the cured (versus self) 
and meet the additional criterion of a 10% 
difference in percentage checked. They are 
self-confident (72% versus 50%), confident 
(75% versus 55%), relaxed (56% versus 
37%), assertive (63% versus 51%), and 
contented (43% versus 32%). 

Perhaps the most striking difference be- 
tween the two concepts is the greater self- 
assurance and confidence ascribed to the 
cured. In contrast, the self is described as 
more introverted and less at ease with the 
world. But the therapists also see themselves 
as more intellectually open, thoughtful, and 
affectively responsive than the cured. There 
is a sense of vitality in the self-description 
which is lacking in the description of the 
cured. 


Self-Concept Compared with the Normal 
Person Concept 


Three-fifths of the adjectives differentiate 
the self-concept from that of the normal 
person at the .05 level of confidence or be- 
yond. 

The 12 adjectives which differentiate most 
in favor of the self (versus normal) are: 
sentimental (49% versus 20%), anxious 
(38% versus 10%), serious (69% versus 
43%), reserved (42% versus 16%), impa- 
tient (28% versus 2%), cautious (49% ver- 
sus 24%), tense (27% versus 3%), compli- 
cated (37% versus 15%), argumentative 


(26% versus 4%), emotional (38% versus 
16%), inhibited (24% versus 4%), and 
stubborn (24% versus 4%). 

The 12 adjectives which differentiate most 
in favor of the normal (versus self) are: con- 
fident (86% versus 55%), self-confident 
(78% versus 50%), relaxed (64% versus 
37%), wholesome (60% versus 37%), cou- 
rageous (59% versus 36%), contented (53% 
versus 32%), calm (70% versus 49%), ma- 
ture (80% versus 64%), natural (72% ver- 
sus 56%), spontaneous (62% versus 46%), 
resourceful (71% versus 56%), and asser- 
tive (66% versus 51%). 

The tendency of the therapists to describe 
themselves as prone to conflict and lacking 
in self-assurance is most evident in the com- 
parison of the self and normal. This tendency 
to include a moderate level of negative char- 
acteristics in the self-description appeared 
also, but to a lesser degree, in the comparison 
of the self and cured. However, the greater 
sense of vitality which differentiates the self 
from the cured does not differentiate the self 
from the normal. 


Preferred Patient Concept Compared with 
Cured Patient Concept 


Three-quarters of the adjectives differen- 
tiate the preferred from the cured patient at 
the .05 level of confidence or beyond. 

The 12 adjectives which differentiate most 
in favor of the preferred (versus cured) are: 
anxious (65% versus 15%), dissatisfied 
(45% versus 6%), intelligent (75% versus 
43%), fearful (37% versus 6%), confused 
(32% versus 3%), talkative (40% versus 
12%), nervous (33% versus 6%), defensive 
(33% versus 7%), emotional (47% versus 
21%), immature (31% versus 5%), in- 
hibited (32% versus 6%), and outspoken 
(42% versus 16%). 

The 12 adjectives which differentiate most 
in favor of the cured (versus preferred) are: 
confident (75% versus 29%), self-confident 
(71% versus 30%), stable (65% versus 
31%), contented (43% versus 9%), mature 
(65% versus 33%), independent (62% ver- 
sus 30%), realistic (76% versus 45%), re- 
laxed (56% versus 25%), self-controlled 
(53% versus 23%), affectionate (62% ver- 
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sus 34%), tactful (57% versus 29%), and 
healthy (60% versus 33%). 

As one would expect, the adjectives which 
discriminate between the preferred and cured 
patient concepts reflect psychopathology on 
the one hand and psychological well-being 
on the other. The three adjectives which de- 
viate from this generalization are talkative, 
intelligent, and outspoken. The significance 
of the first needs no comment but the pres- 
ence of the latter two is of considerable in- 
terest. The authors noted in the discussion 
of the adjectives most characteristic of the 
preferred patient, the appearance of an im- 
portant dimension of intellectual vigor, sen- 
sitivity, interest, openness, indeed of crea- 
tivity. If one examines differences between 
the preferred and cured patient concepts on 
this dimension, one finds the preferred de- 
scribed more often than the cured as artistic 
(p< 001), clever (p< .001), complicated 
(p < .001), spunky (p < .001), curious (p < 
.01), idealistic (p < .02), original (p < .05), 
imaginative (p = .05), and individualistic (p 
= .05). These data suggest that while the 
cured person is described as having achieved 
maturity, self-control, stability, and content- 
ment, he does not appear self-actualized or 
creative, or a genital character, etc. Anxiety, 
defensiveness, and socially maladaptive be- 
havior are absent compared to the preferred, 
but so too are originality and zest. 


Preferred Patient Concept Compared with 
Normal Person Concept 


Four-fifths of the adjectives significantly 
differentiate the preferred and normal per- 
son concepts at the .05 level of confidence or 
beyond. 

The 12 adjectives which differentiate most 
in favor of the preferred (versus normal) are: 
anxious (65% versus 9%), dissatisfied (45% 
versus 3%), fearful (37% versus 1%), con- 
fused (32% versus 1%), nervous (33% ver- 
sus 2%), emotional (47% versus 17%), im- 
mature (31% versus 1%), tense (32% versus 
3%), talkative (40% versus 12%), inhibited 
(32% versus 4%), defensive (33% versus 
5%), and worrying (29% versus 3%). 

The 12 adjectives which differentiate most 
in favor of the normal (versus preferred) are: 
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confident (86% versus 29%), self-confident 
(78% versus 30%), mature (80% versus 
33%), tactful (76% versus 30%), calm 
(71% versus 25%), stable (76% versus 
31%), contented (53% versus 9%), affec- 
tionate (78% versus 35%), independent 
(71% versus 30%), helpful (68% versus 
27%), realistic (85% versus 45%), and 
considerate (81% versus 41%). 

The adjectives which discriminate most be- 
tween the preferred and the normal are 
largely the same as those which discriminate 
most between the preferred and cured. How- 
ever, those adjectives descriptive of creativ- 
ity and intellectual vitality, for example, 
curious, imaginative, original, etc., are no less 
characteristic of the normal than of the 
preferred. The differentiation of the two con- 
cepts, then, is purely in terms of the con- 
trast between psychopathology and social 
psychological well-being. 


Cured Patient Concept Compared with 
Normal Person Concept 


The cured and the normal person concepts 
are the most similar of all the pairs, but still 
one-half of the adjectives significantly dil 
ferentiate the two. The only adjectives which 
discriminate significantly in favor of thé 
cured are endorsed on the average by only 
6 or 7% of the therapists and consequently 
do not fulfill the criterion for inclusion M 
the comparison, that is, a 10% difference 
between the two concepts in percentag? 
checked. They refer exclusively to small rest 
dues of pathological traits in the cured, for 
example, fearful, immature, etc. Since this is 
the case; the authors will present the 20 
adjectives which differentiate most in favo! 
of the normal. They are intelligent (66 
versus 43%), good-natured (57% er 
35%), generous (61% versus 4196), 10 
(57% versus 37%), wholesome (60% ka 
40%), courageous (59% versus 4076); " 
morous (61% versus 43%), tactful (7 
versus 58%), ambitious (70% versus 52 zn 
civilized (65% versus 47%), cathe 
(63% versus 4595), progressive (55% Vi 
sus 37%), thorough (49% versus sie | 
venturous (50% versus 32%), wise (4 
versus 24%), conscientious (777% Y 
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60%), discreet (56% versus 39%), versatile 
(51% versus 34%), original (45% versus 
28%), and resourceful (71% versus 55%). 

In examining this list, it is as important 
to note which adjectives do not appear as to 
note those which do. There is no important 
difference between the descriptions of the 
two concepts with regard to stability, com- 
petence, and reliability. Both the cured and 
the normal are seen as well adjusted in a 
socially defined sense. But the normal is also 
seen as possessing to a greater degree char- 
acteristics which transcend adjustment. Many 
theorists speak of psychotherapy as an ex- 
perience which frees human potentialities and 
fosters the growth of a unique identity. It is 
surprising, then, to find that it is a person 
who has “a satisfactory adaptation to him- 
self and his environment” who more nearly 
approaches this ideal than a person who “has 
terminated therapy successfully.” 


CONCLUSIONS 


The conclusions which emerge from this 
study are, first, that there is a marked con- 
sensus among therapists about, at least, some 
important aspects of psychotherapy. The au- 
thors do not mean to assert that all therapists 
have the same conceptions about all aspects 
of therapy, but rather that there are some 
attitudes and conceptions which can be de- 
Scribed as characteristic of psychotherapists 
in general. This conclusion is in contrast to 
the impression gained from such similar 
studies as those of Strupp (1958) and Sund- 
land and Barker (1962). It likewise con- 
trasts with the opinion of Colby (1964) that, 
"In the domain of psychotherapy . . . there 
is no simple shared paradigm commanding 
consensus." 

It seems to the authors that whether one 
concludes that diversity prevails or that a 
Consensus prevails is dependent upon what 
question is asked. If the concern is with tech- 
nique and the questions are phrased in lan- 
guage which is drawn from particular theo- 
retical orientations, it is likely that a picture 
of differences among therapists will emerge 
(Strupp, 1958; Sundland & Barker, 1962). 
However, if, as in the present case, a theo- 
retically neutral language is used and the 
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emphasis is on goals, rather than means to 
goals, a picture of consensus emerges. The 
extent of agreement about the description 
of a person who has a satisfactory adaptation 
to himself and to his environment moreover, 
suggests that it is possible to derive an em- 
pirical criterion of “positive mental health," 
at least, as this elusive concept is conceived 
by specialists in the field. 

Another point which emerges with con- 
siderable clarity concerns the relationship of 
the therapist to the patient he treats. There 
is little evidence either that the therapist 
feels that he works best with those patients 
who most resemble him, or that he sees the 
patient who has obtained maximum benefit 
from treatment as closely resembling himself. 
For that matter he does not see himself 
as normal (as he has defined it). This sug- 
gests that the therapist does not use himself 
as the point of reference in establishing treat- 
ment goals, but rather adheres to an external 
criterion defined, it seems, by the society in 
which he and his patients reside. 

The final point to be noted is the modesty 
of the expectations therapists hold for psy- 
chotherapy. This can be seen with respect to 
both the patient when he enters therapy, and 
when he has successfully terminated therapy. 
As pointed out earlier, the patient with whom 
therapists say they work best is intelligent, 
alert, well motivated, reliable, and lacking 
in serious psychopathology; anxiety is the 
only major indication of psychological dis- 
tress. In parallel fashion, the description of 
the successfully treated patient stresses his 
self-control and competence, his adaptation 
to social expectations. 

There is little of the self-actualized person 
in this description; rather it is adjustment 
that seems to be the main emphasis. While 
those who criticize psychotherapy for foster- 
ing conformity may find support for their 
view in these data, the authors would argue 
that they reflect what therapists believe to 
be practicable, rather than ideal, and point 
to a realistic appraisal of what therapy can 
achieve. Osgood and Luria (1954) in their 
study of a case of multiple personality pose 
the same question raised by these data, “In 
striving to achieve the goals of societal ac- 
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ceptability and individual happiness, does the 
therapist have to sacrifice the richness, indi- 
viduality and subtler adjustiveness of the 
patient?” Neither their study nor this one 
can provide an answer, but it is a question 
which is too serious to ignore. 
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CONSISTENCY OF SELF-DESCRIPTIONS UNDER DIFFERENT 
ROLE SETS IN NEUROTIC AND NORMAL 
ADOLESCENTS AND ADULTS: 


DAVID G. MARTIN? 
University of Iowa 


Fifteen male Ss in each of four groups (neurotic adolescents, normal adoles- 
cents, neurotic adults, and normal adults) were asked to describe themselves 
with a Q sort as they saw themselves in three different situations (self in 
general self with family, and self with friends). The tendency to describe 
themselves inconsistently was characteristic of neurotic adolescents specifically 
and not of adolescents or of neurotics in general. These results were discussed in 
relation with adolescence as a time of "identity crisis" and with previously 
reported findings that neurotics tend to have inconsistent self-concepts. 


The importance of self-consistency to per- 
sonality has probably been stressed most 
strongly by Lecky (1945) although many 
others have embodied the notion in their 
writing. Lecky stated as a motive the ten- 
dency to create a unified personality which 
the individual defines consistently for him- 
self and others. Other theorists who have 
stressed the development of a consistent 
“self” include Kelly (1955), Allport (1955), 
and Rogers (1951). 

As Wiley (1961) has pointed out in her 
summary of the literature on the self-concept, 
self-consistency has been postulated as de- 
scriptive of personality but little attention 
has been paid to the antecedents which lead 
to this consistency. In addition, most self- 
theorists have treated the self as a construct 
or an entity. In contrast with this, it is the 
author's position that the tendency toward 
self-consistency (defined here as consistency 
of self-description) is an acquired behavior 
and therefore partly a developmental prob- 
lem. The present investigation was based on 
two lines of previous research: the relation- 
ship between self-concept consistency and the 
psychology of adolescence and between this 
consistency and emotional disturbance. 


1This study was supported by the Institute for 
Juvenile Research, Chicago, Illinois, and is based 
on part of a study done as partial fulfillment of the 
requirements for the degree of Doctor of Philosophy 
at the University of Chicago. 

2 Requests for reprints should be sent to the 
author, Department of Psychology, University of 
Towa, Iowa City, Iowa 52240. 


Erickson (1956) has defined three criteria 
of “ego identity”: (a) the adolescent’s per- 
ception of himself as essentially the same 
person over time and interpersonal situations, 
(b) others’ perceptions of the adolescent as 
essentially the same over time and situations, 
and (c) an accrued confidence on the part 
of the adolescent that his perception of him- 
self corresponds to others' perceptions of him. 
This is a more inclusive definition than has 
been used in previous research, which has 
been focused on the first of these criteria. It 
is a widely held notion that adolescence is 
a time of "identity crisis" for all young 
people (Bronson, 1959; Erikson, 1956; Heil- 
brun, 1964; Hess & Goldblatt, 1957; Hood, 
1962; Howard, 1960; Musgrove, 1964; 
Rosenberg, 1965). However, none of the 
literature cited has been based on a com- 
parison between adolescents and adults. Fur- 
ther, Engle (1959) reported “relative stabil- 
ity of the self-concept” among adolescents 
tested over a 2-yr. period, supporting her 
position that “crystalization of the self-con- 
cept is achieved earlier in development.” Di- 
viding her data into disturbed and normal 
groups, Engle also found that the disturbed 
group had significantly less consistency in 
their self-concepts than had the normal group. 

A second issue is the relationship between 
emotional disturbance and selí-consistency. 
The available evidence seems to indicate that 
emotionally disturbed people have greater 
flux in their self-concepts than do normal 
people (Block, 1961; Brownfain, 1952; But- 
ler, 1956; Cartwright, 1961; Engle, 1959; 
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Gough & Heilbrun, 1965; Hickman, 1959; 

Phillipsson, 1955; Tolor, 1955; Woldstad, 

1961). However, with the exception of one 

(Gough & Heilbrun, 1965),° all of the studies 

reviewed in connection with this problem 

used undergraduate Ss and were actually ex- 

aminations of the relationship between self- 

concept inconsistency and emotional disturb- 

ance in late adolescence. 

The above observations suggested that the 
tendency to describe oneself inconsistently 
may be a prominent concomitant of emo- 
tional disturbance in adolescence, but that 
it is less frequently a characteristic of the 
disturbed adult; the presence of emotional 
disturbance may delay the successful clari- 
fication of a stable self-description. It was 
predicted that (a) neurotic adolescent boys 
would describe themselves less consistently 
under different role sets than would normal 
adolescent boys and (5) neurotic boys would 
differ from normal boys on this measure to 
a greater extent than neurotic men would 
differ from normal men. 


METHOD 


Instruments, Data Collection, and Scoring 


The instrument used was the Butler-Haigh Q sort 
(1954), modified to ensure readability of the items 
for a young group. Each S completed the Q sort 
three times. First Ss were asked to describe them- 
selves the way they were “in general,” arranging the 
cards in a normal distribution of nine piles. The Ss 
were then asked to describe “yourself the way you 
are with your friends” and “yourself the way you 
are when you are with your family.” The second 
and third sortings were reversed in order of pres- 
entation for half of the Ss. 

The operational definition of consistency of self- 
description was the mean squared correlation of the 
three possible combinations of Q sorts (ris, fıs, ra). 

Judgments of the "intensity of emotional disturb- 
ance" were made for each of the neurotic Ss on a 
5-point scale (1—none, 2—mild, 3—moderate, 4— 
marked, 5—very intense). A formal psychiatric diag- 
nosis was also made by staff conferences at an out- 
patient clinic to which each of the disturbed Ss had 
applied for services. 


3 Gough and Heilbrun, moreover, did not report 
results related specifically to emotional disturbance. 
Their data showed that men with lower test-retest 
reliability on the ACL were described by observers 
as having more unpleasant qualities, 
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Subjects 


A factorial design was used, and four groups of 
15 Ss each were tested: neurotic adolescent boys, 
normal boys, neurotic adult males, and normal adult 
males. All Ss were Caucasian and, among the adol- 
escents, had both parents living at home. The neu- 
rotic adolescents had an average IQ of 106. Previous 
large-scale testing programs done at the school 
from which the normal adolescent Ss were selected 
had resulted in IQ averages of 105-112 for different 
classes. All Ss were from middle-class homes, fol- 
lowing Warner (1960). The adolescents ranged from 
13 to 16 yr. old. The disturbed adolescent sample 
included all boys who fit the criteria and who had 
made their first contact with the clinic in the pre- 
ceding 4 mo. The disturbed adult sample was ran- 
domly drawn from a list of fathers who had a 
child being seen at the clinic and who themselves 
had gone through the clinic’s diagnostic procedure. 
Each neurotic S had been, within the preceding 4 
mo. judged to have a neurotic disturbance and to 
be at 3 or 4 on the 5-point scale of “intensity of 
emotional disturbance” discussed above. None of 
the disturbed Ss had received formal psychotherapy. 
The Ss in both of the normal groups were randomly 
chosen from the class lists of schools in a middle- 
class area. The normal adult Ss were selected from 
among parents of school children to make them 
similar to the disturbed adult sample in that all of 
the adult Ss had at least one young child. The mean 
age of the normal adult group was 37.2; of the 
disturbed adults, 39.7. This difference was not sig- 
nificant, according to a ¢ test (.20 < p < .30), After 
the selection of normal Ss, school guidance coun- 
selors were asked to eliminate from the list potential 
Ss whose families had a history of emotional dis- 
turbance which had come to the attention of the 
school, These judgments resulted in the elimination 
of 2 adolescent Ss and 1 adult S. 

In the process of soliciting the cooperation of Ss 
there were two refusals of cooperation by disturbed 
boys and no refusals by normal boys or their 
parents, Among the disturbed adults, one man re- 
fused to cooperate, and among the normal adult Ss 
seven declined. All of this latter group stated that 
their reasons were based upon conflicts and pressures 
of time, rather than on reluctance to participate. 


RESULTS 


Analysis of variance of the Q-sort consis- 
tency scores (Table 1) revealed both row 
and column effects and a significant interac: 
tion (? « .05), indicating that the instru 
ment used had differentially discriminat 
the groups. Following Lindquist (1953), / 
tests were employed to test the simple effects 
of interest. Table 2 presents the mean Q-sort 
consistency scores and /-test comparisons O 
the two adolescent groups and the two adult 
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TABLE 1 


ANALYSIS OF VARIANCE OF Q-SORT 
CONSISTENCY SCORES 


Source of variation af MS F 
Age (A) 1 | 166 | 557* 
Emotional health (B) 1 134 4,49** 
AXB 1 134 4.49** 
Within 56 0298 
Total 59 

*p < 025. 
** p <.05. 


groups. The mean consistency score of the 
neurotic adolescent group (.192) was sig- 
nificantly lower than the normal adolescents’ 
mean of .318 (p< .03). The mean consis- 
tency score of the neurotic adult group (.307) 
was not significantly different from the nor- 
mal adults’ score (.392), demonstrating sup- 
port for the hypotheses that neurotic adoles- 
cents describe themselves less consistently 
over situations and that this inconsistency is 
specifically a characteristic of neurotic dis- 
turbance in adolescence, rather than of neu- 
rosis in general. 

A further comparison was made between 
the two normal groups. The normal adoles- 
cent group’s mean consistency score of .318 
was not significantly different from the nor- 
mal adult group’s mean of .392. 


Discussion 


The results support the position that neu- 
rotic adolescents tend to describe themselves 
less consistently over situations than do adol- 
escents or neurotics in general. The finding 
of such an interaction suggests that generali- 
zations from previous work relating neurosis 
to self-concept flux are unwarranted, at least 
when describing neurotic adult males. 


TABLE 2 
t-Test Comparisons OF Q-Sort CONSISTENCY SCORES 


Mean r? t df 
Neurotic adolescents .192 2.00 28* 
Normal adolescents .318 
Neurotic adults .307 
Normal adults .392 1.25 28 


*5« .03. 


In addition, these findings have some rele- 
vance to the issue of identity crisis as a part 
of normal adolescence. Consistent with En- 
gle’s finding but not with Erikson’s theory, 
normal adolescents in the present study did 
not tend to describe themselves less consis- 
tently than did normal adults. The results 
reported apply only to the first of Erikson’s 
three criteria, however, and it is quite possi- 
ble that a more inclusive approach to identity 
would have resulted in different findings. (In 
previous research, however, it has been a 
nearly universal practice to define identity as 
the perception of oneself as the same person 
over time and interpersonal situations.) Per- 
haps the present study is most relevant to 
the identity issue in that it points up the 
necessity for further adolescent-adult com- 
parisons in future studies. 

Future research should also examine the 
developmental aspects of the crystalization 
of a consistent self-description. The present 
findings suggest that normal children go 
through such a crystalization process before 
adolescence and that the presence of neurotic 
problems delays this process until after 
adolescence. 
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RELIGIOUS AFFILIATION AND PSYCHOPATHOLOGY 
IN A MEXICAN-AMERICAN POPULATION * 


ARNOLD MEADOW anp LOUISE BRONSON ? 
University of Arizona 


Protestant and Catholic Mexican-American Ss of similar levels of acculturation, 
education, and socioeconomic background were studied with a variety of instru- 
ments, Evaluations of psychopathology, reported here, were derived from the 
L-R sections of the Cornell Medical Index and behavioral observations. The 
lower rate of pathological responses by Protestant Ss was attributed to the 
social support offered by the small, intimate congregations with their strong, 
paternal leadership and the Protestant doctrines of asceticism and individual 
responsibility which contribute to impulse control. 


Despite the popularity of concepts such 
as “the Protestant Ethic” and the Protestant 
*need-achievement" culture, studies attempt- 
ing to contrast the effects of Protestant or 
Catholic affiliation in personality structure 
have been in large part unsuccessful. In fact, 
one recent work, Religion in American Cul- 
ture, (Schroeder & Obenhaus, 1964) indi- 
cates that not only are there no observable 
differences in personality organization but 
that followers of these faiths when interviewed 
give vague, almost identical, answers to ques- 
tions involving the basic tenents of their 
faiths. The possibility, however, remains that 
real differences previously observed may have 
been blunted in the melting pot of American 
culture. Might these differences appear if 
“old-fashioned” Catholics could be con- 
trasted with fundamentalist, sect-oriented 
Protestants? 


METHOD 


A small group of recently converted, but relatively 
unacculturated, Mexican-American Protestants and 
their Catholic counterparts, provided the authors 
with Ss meeting the above specifications. These Ss 
were studied with various instruments: a structured 
interview, a medical questionnaire, projective de- 
vices, value schedules, and behavior indexes. This 
Paper reports the results obtained from the medical 
questionnaire, the Cornell Medical Index (CMI). 


1This research was supported by United States 
Public Health Service Grant 5-R11-544-2, Arnold 
Meadow, director. i 

? Requests for reprints should be sent to Louise 
Bronson, Student Counseling Bureau, University of 
Arizona, Tucson, Arizona 85721. 


The Cornell Medical Index (CMI) 


The CMI was developed as a screening device for 
both medical and psychiatric problems, and has 
been used for this purpose in hospitals and clinics. 
Moreover, the CMI has been used in recent studies 
in epidemiology: in an area of rapid culture change 
in North Carolina (Cassel & Tryroler, 1961), in the 
rural and urban districts of Zulu (Scotch & Geiger, 
1964), among the Eskimo (Chance, 1965), in Indian 
boarding schools (Nelson, Dawes, Hipple, Jetmalani, 
& Tadlock, 1964), and among inductees of various 
ethnic origins (Croog, 1961). The psychiatric sec- 
tion here used reports seven areas of symptomatol- 
ogy. Its questions, which are direct, brief, and non- 
offensive in nature, use simple “yes” and “no” an- 
swers, and Spanish and English language versions 
have been published. 


Subjects 


The investigation was conducted in a southwes- 
tern city consisting of approximately a quarter of 
a million inhabitants, 17% of whom are Mexican- 
American, Fifty-four Protestant Ss were drawn from 
11 churches, representing the principal Mexican- 
American Protestant denominations in the city. 
These were matched with a Catholic group on the 
variable of age, country of nativity, education, and 
socioeconomic status. The members of both groups 
were located through the religious leaders of their 
church. All were male and 18 yr. or older, 


RESULTS 


The principle finding of the study was that 
the Catholics produced a greater number of 
pathological responses than the non-Catholic 
group. The mean M-R CMI score for the 
Mexican-American Catholics was 9.4; for the 
Protestants 4.6. A two-tailed ¢ test shows this 
difference to be significant at the .001 level. 

Figure 1 shows the contrast between 
Catholic and non-Catholic groups on the 
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symptom syndromes tapped by the CMI. The 
t tests show these differences to be significant 
at the following levels: habits, .001; inade- 
quacy, 75; depression, ms; anxiety, .001; sen- 
sitivity, .01; anger, .01; and tension, .01. 
The markedly high scores in the areas of 
"inadequacy" and "sensitivity" should be 
noted. Typical questions in Section M (in- 
adequacy) are, “Do you get nervous and 
shakey when approached by a superior?" and 
“Do you wish you always had someone at 
your side to advise you?" The high scores 
on this section obtained by the Mexican- 
American Ss substantiate clinical and field 
Observations. In general, this appears to be 
a group with strong feelings of inadequacy 
produced in part by race prejudice and in 
part by the difficulty experienced in meeting 
the demands of a technologically oriented 
world. Section P, (sensitivity) has such ques- 
tions as, *Are your feelings easily hurt?" and 
“Does criticism always upset you?" These 
questions possibly tap a reaction more fre- 
quent in a culture employing shame rather 
than guilt as a mechanism of social control. 
The low scores in the category of “de- 
pression" are striking. This finding is sup- 
ported by clinical evidence. Depression, of 
the type clinically diagnosed in the general 
population, is seldom seen in the Mexican- 
American population. Depressed states do 
occur in Mexican-American patients, but 
their lack of severity and short duration has 
led to the coining of the term “Mexican de- 
pression" (Meadow & Stoker, 1965). 


O caruoLic 
EJ protestant 


PERCENT PATHOLOGICAL 
RESPONSES 


SECTIONS OF CMI 


. Catholic and Protestant responses to L-R 
sections of the CMI. 
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Although there is a trend for older Ss to 
give more pathological responses, this ten- 
dency fails to reach statistical significance 
for either Protestant or Catholic respondents. 

Examination of the mean scores of Ss of 
various educational levels reveals a U-shaped 
relationship (see Figure 2). In separate sign 
tests it is found that Ss with 0-5 yr. of edu- 
cation have significantly more pathology than 
those with 6-11 yr. of education (significant 
at the .01 level), and those Ss with 12 yr. 
or more of education have more symptoma- 
tology than those with 6-11 yr. of education 
(significant at the .05 level). This relation- 
ship may still be observed when age is held 
constant. It is possible that individuals with 
limited education simply do not obtain skills 
requisite for minimal adjustment in an urban 
American society and consequently develop 
more psychopathology. Second, it would seem 
that the most highly educated group is sub- 
jected to greater acculturation stress. Repre- 
senting marginal man caught between the 
presses of Mexican values stressed by his 
parents and the new American values learned 
in school, this group also develops greater 
psychopathology. Those with average educa- 
tion who have secured sufficient skills for 
adequate functioning in the American society, 
but who have not yet encountered to its 
fullest extent the clash of contrasting cul- 
tures, appear to develop the most healthy 
psychological development. 

In drawing conclusions such as those 
above, the possibility of overgeneralization 
must be kept in mind. It may not be as- 
sumed that the differences observed here are 
also true of Anglo-American Catholics and 
Protestants. Nor, may these conclusions be 
applied—without further exploration—to ac 
culturated Mexican-Americans. What happens 
to these groups as they climb the socioeco- 
nomic ladder is an area inviting investigation. 

In addition to overgeneralization, the ques 
tion of response bias must also be considered. 
Despite the close cultural similarities of the 
two groups being compared, it is possible 
that one group might be more motivated to 
give responses of a certain type. For example, 
the low scores of the Protestants could be a 
result of a general tendency to give negative 
responses or a need to appear in a positive 
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light. For the following reasons, however, it 
is felt that the major portion of the differ- 
ences obtained reflect true differences in 
symptomatology. 

First, if a bias does exist it might be ex- 
pected to be displayed consistently through- 
out the CMI responses. There is no signifi- 
cant difference between the responses of 
Catholics and Protestants on Subsections M 
(inadequacy) and N (depression) while all 
other subsections show differences at the .01 
or .001 level of significance. Even if one were 
to assume that the gap between Catholic and 
Protestant scores on these two subsections 
represents a base rate of bias, and lowered 
the differences on the other subscales by this 
amount, large differences, not accountable to 
bias, would still remain (see Figure 1). 

Second, a few items which ask about fam- 
ily background can be used as a sort of lie 
scale. For example, to such questions as, “Did 
anyone in your family ever have a nervous 
breakdown?" and “Does worrying run in 
your family?" Catholics and converts to 
Protestantism should, if no response bias 
exists, give similar answers. And this is the 
case. Again, converted Protestants give 
slightly fewer pathologically oriented re- 
sponses, but the difference between these 
responses and those of the Catholics is not 
significant, even at the .25 level, as reported 
by a sign test. 


o- -o CATHOLIC 
e—e PROTESTANT 


MEAN CMI SCORE 


O3 46 79 1012 l3 
YEARS OF EDUCATION COMPLETED 


Fic. 2. Catholic and Protestant CMI (M-R) scores 
and years of education completed. 
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Discussion 


The major finding—lower pathology re- 
ported by Protestants—deserves further ex- 
ploration. What factors have served to thus 
change this group of converts? Two major 
influences are proposed as having effected this 
trend. The first is that the small social groups 
typical of the Protestant Mexican-American 
churches provide a greater amount of social 
support than the larger, more amorphous, 
Catholic congregations. The second is that 
the content of the Protestant doctrine itself 
produces a less pathological character struc- 
ture than Catholic religious content—at least 
with respect to the specific problems of ad- 
justment with which the Mexican-American 
is confronted in an urban American society. 

'The social support factor may best be 
evaluated against the background of the So- 
noran folk culture from which the Mexican- 
Americans of this study emigrated. Central 
to this culture were the supportive institu- 
tions of the extended family, the parallel 
fictional family the compadrazgo system, and 
the economic, political, and emotional sup- 
port provided by the patron and/or the 
priest. When the Mexican moves to the 
United States these traditional institutions 
are weakened, and no substitutes for these 
institutional lacunae are readily available to 
most of the people. 

But the Mexican-American Protestant 
church appears to fill many of these gaps in 
the weakened traditional culture. Its con- 
gregation provides a close knit social system; 
and, in contrast to the Catholic congregation, 
it is small, and the members are in continu- 
ous social contact. These relationships are 
further modified in the sect-oriented churches 
by the shared sense of sin and guilt and the 
frequent, emotionally fraught, public confes- 
sions. Interviews with the 11 ministers and 
the parishioners of the sampled Protestant 
churches reveal that the leader of the con- 
gregations, himself a Mexican-American, in- 
timately knows the problems of each of his 
members, and is the patron-priest in a very 
real sense. In the community chosen for this 
study the Catholic priests have large parishes 
and, with one exception, no knowledge of 
Spanish. They cannot provide the individual 
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with the multiple social supports which mem- 
bers of a low-status acculturating group so 
greatly need. 

Furthermore, there is the doctrine of the 
religion itself, the second of the major influ- 
ences mentioned above. It must be remem- 
bered that the Protestants studied here are 
members of old-fashioned, sect-type churches. 
Asceticism and individual responsibility are 
greatly emphasized in these groups and it is 
these aspects of their doctrine that would ap- 
pear to be most effective in curbing the types 
of personality disorganization observed in 
the Mexican-American community. Basically 
these problems involve difficulties in control 
of impulsivity. Crimes of violence, drinking, 
fighting, impulsive sexual behavior, sudden 
desertions of family, and the leaving of 
schools and jobs are among the most fre- 
quently observed “problem” behaviors of 
male Mexican-Americans. In the city under 
study, the diagnosis of sociopathic personal- 
ity was the most frequently reported clinical 
syndrome for the Mexican-American clinic 
population. 

The above hypothesis, linking Potestant- 
ism and impulse control, was strongly sup- 
ported by an interesting behavior observation 
made during the course of data collection. 
When the data were tallied on missed ap- 
pointments, it was found that the Catholic 
group had failed to meet 57 appointments, 
while the corresponding Protestant group 
missed but 6 appointments. A chi-square test 
shows that this difference is significant be- 
yond the .01 level. Conversations with re- 
spondent’s wives and assessment of the em- 
barrassed excuses of the Ss themselves led 
the research team to conclude that the major 


variable involved in the appointment cancel- 
lations was an impulsive flight from a situa- 
tion which was perceived as mildly stressful, 
In some cases, however, the S had quite sim- 
ply accepted an invitation to a ball game or 
the corner bar, assuming that the interviewer 
would understand. The difference between 
Protestants and Catholics in this respect is 
not only statistically significant but striking 
in magnitude. The Protestant Ss, in response 
to CMI items (particularly in Section Q, 
anger), indicate that they have increased im- 
pulse control; the quantitative data on missed 
appointments presents objective behavioral 
data which support their statements. 
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The present experiment investigated certain assumptions derived from Ellis’ 
theory relating implicit verbalization to emotional arousal. Emotional responses 
(galvanic skin response and respiration changes) of Ss instructed to silently 
read sequences of affectively loaded sentences were compared with those of Ss 
who read neutral sentences. The relationship between relevance to S of the 
content of the sentences and level of emotional response was also investigated. 
In addition, the effect of sentence type (observation, inference, or evaluative 
conclusion) on level of emotional response was investigated. Results were that 
Ss showed a significantly greater response to affectively loaded sentences than 
to neutral sentences on two out of three response measures. Neither relevance 
nor sentence type were found to have any significant effect. These results 


were interpreted as offering only partial support for the theory. 


Behavior theorists agree that neurotic be- 
havioral reactions are intimately related to 
the individual's learning history; however, 
there is disagreement as to the mechanism 
which allows past experience to interact with 
the current stimulus situation resulting in 
neurotic behavior. Typically some sort of 
mediating response is postulated. In the case 
of phobic reactions, certain theorists (e.g., 
Mowrer, 1947; Turner & Solomon, 1962) 
have stressed the role of fear as a mediating 
response. Mowrer's two-factor theory assumes 
that by classical conditioning fear becomes 
attached to a particular stimulus and that 
avoiding the conditioned stimulus occasions 
fear reduction which reinforces the avoidance 
response. While the present authors are in 
essential agreement with the proponents of 
two-factor theory, it is not difficult to con- 
ceive of instances of avoidance learning which 
the theory cannot deal with adequately. For 
example, snake phobias are relatively com- 
mon in our culture (Lang & Lazovik, 1963) 
and yet it is rare that the snake phobic can 
point to a direct aversive encounter (e.g. 
having been bitten by a snake). Once the 
fear response has become attached to snake 
stimulus, it is reasonable to suppose that fear 
reduction maintains the instrumental avoid- 
ance response. But the critical question of 
how fear became attached to the phobic stim- 
ulus in the first place remains unanswered. 


1 Requests for reprints should be sent to David C. 
Rimm, Department of Psychology, Arizona State 
University, Tempe, Arizona. 
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A reasonable possibility is that verbal stimuli 
act as mediators. One can envision a child 
experiencing physical punishment paired with 
verbal (auditory) stimuli such as “bad” or 
“dangerous” such that these cues come to 
elicit a fear response. Staats, Staats, and 
Crawford (1962) have shown how neutral 
words may come to elicit an emotional re- 
sponse as a result of having been paired with 
aversive stimulation. If at a later time these 
same verbal cues are paired with visual 
“snake” cues, one would expect that by 
higher order conditioning the snake would 
come to elicit fear. The ease with which 
evaluative words may function as uncondi- 
tioned stimuli in a higher order conditioning 
paradigm has been demonstrated by Staats 
and Staats (1958). 

One theory of neurotic behavior which has 
relied almost exclusively on verbal media- 
tional constructs is that of Ellis (1962, 1964). 
Ellis’ main theoretical emphasis is on the 
way in which neurotic behaviors are main- 
tained. It is assumed that such behaviors are 
maintained by self-verbalizations which serve 
to elicit negative effect (e.g., fear) and guide 
instrumental behavior. Presumably emotional 
responses become attached to objects by 
higher order conditioning, although this is 
not discussed in detail. Individuals acquire 
patterns of self-verbalization by imitating 
overt verbal behaviors of significant others. 
Neurotic or self-defeating behaviors result 
when external stimuli give rise to self-verbali- 
zations which are irrational in nature. By 
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irrational Ellis means verbalizations which 
are either based on arbitrary or unscientific 
premises or do not logically follow from their 
premises. Sequences of self-verbalizations 
may be divided into three categories, in order 
of occurrence. The first is a simple observa- 
tion of an environmental happening, for ex- 
ample, “There is a dog in my path.” The 
second is an inferential statement with regard 
to the observation, “That dog is likely to 
bite me.” The third is an evaluative conclu- 
sion, “That would be horrible.” Ellis himself 
classifies implicit verbalizations into two cate- 
gories, observation and inference and evalua- 
tion. The authors chose to divide a sequence 
of sentences into three categories for the pur- 
pose of making a more precise analysis. 

Irrational thinking (where thinking is 
equated with self-verbalization) is seen in the 
inference (dogs don’t usually bite people) 
and in the conclusion (being bitten by a dog, 
while somewhat painful, is not horrible). 
Ellis assumes that the anxiety response asso- 
ciated with the preceding sequence originates 
not with the observation, but with the infer- 
ence, and the evaluative conclusion. In other 
words, if one were merely to observe a dog, 
and not meditate on the possibility of being 
bitten, and come to no evaluative conclusion, 
he would experience no marked increase in 
anxiety. 

Using a technique (rational-emotive psy- 
chotherapy) which purports to modify self- 
verbalizations on the part of the patient, 
Ellis (1962, p. 38) claims an impressively 
high improvement rate (90% for patients 
treated for 10 or more sessions). Although 
Ellis’ theory has become increasingly popu- 
lar over the past several years, and on intui- 
tive grounds is most appealing, it has yet to 
undergo a rigorous experimental test. The 
present experiment, while not a comprehen- 
sive test of the theory, did examine certain 
of the more basic assumptions. The most 
basic assumption is that self-verbalizations 
can elicit emotional responses. To test this, 
experimental Ss were instructed to read to 
themselves triads of sentences, culminating 
in negative affective conclusions. While Ss 
engaged in this activity, galvanic skin re- 
sponse and respiration rate and depth were 
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continuously monitored. Control Ss were re- 
quired to read to themselves triads of sen- 
tences which were affectively neutral and 
which did not culminate in evaluative con- 
clusions. The theory predicts a greater emo- 
tional response on the part of the experi- 
mental Ss. The second assumption tested was 
that there would be marked differences in 
emotional responding among the three com- 
ponents of a self-verbalized sequence. A test 
of this involved comparing the mean level 
of emotional response for each of the three 
sentences presented to the experimental Ss 
on a given trial. The theory predicts an ap- 
preciably greater response to the third sen- 
tence (the evaluative conclusion) and to the 
second (inference) than to the first (observa- 
tion). 

An additional question, with which the 
theory does not explicitly deal relates to the 
relevance of the content of implicit sentences. 
By relevance is meant the extent to which 
a S indicates concern over a particular class 
of experiences. For a S who gives himself a 
high rating on a fear-of-dogs scale, the dog 
biting triad would be relevant. To test the 
role of relevance, prior to the experiment 
proper, Ss were required to rate themselves 
on a number of possible problem areas. On 
the basis of these prior ratings Ss in the ex- 
perimental group were presented with an 
equal number of high-relevance and low- 
relevance triads. On intuitive grounds one 
would expect a higher level of emotional 
arousal to high-relevance triads than to low- 
relevance triads. 


METHOD 
Subjects 


The 27 Ss, both male and female were introduc- 
tory psychology students at Arizona State Univer- 
sity, whose participation fulfilled a course require- 
ment. Nine Ss were assigned to each of two experi- 
mental orders and 9 to a control condition. 


Apparatus and Materials 


Stimulus items were presented via a Kodak Ca- 
rousel 800 slide projector, projected onto the W 
approximately 6 ft, in from of S. The S’s respiration 
rate and depth and galvanic skin response (GSR) 
were continuously monitored using a Keeler Poly- 
graph, model 6303. 
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Procedure 


Preliminary to performing the experiment, stu- 
dents in an introductory psychology class were 
asked to complete a rating sheet, containing items 
possibly related to areas of personal concern. Stu- 
dents rated themselves on a total of 35 scales, with 
a O rating indicating no concern at all, and a rating 
of 7 indicating that a particular item represented an 
area of extreme concern or worry. The 35 scales 
related to possible problem areas including grades, 
professional achievement, personal appearance, social 
behavior, sexual behavior or relationships with par- 
ents, finances, world situations, and illness and 
death. When the 95 rating sheets were returned each 
of the 35 items was ranked according to scale 
value, and the top and bottom 10 items were se- 
lected for use in the experiment proper. None of the 
Ss used in obtaining the preliminary ratings partici- 
pated in the experiment. 

Upon entering the experimental room, E instructed 
S to fill out a rating sheet containing the 20 items 
(the 10 rated highest and 10 rated lowest in the pre- 
liminary study). Seven-point rating scales were again 
employed. After completing this, the GSR electrodes 
were placed on S's palm and the pneumograph at- 
tachment wrapped around S's chest. The S was in- 
structed to get as comfortable as possible. To facili- 
tate relaxation, low intensity semiclassic background 
music was presented to S, via tape recorder. This 
preparatory period of approximately 10 min. was 
instituted in order to allow the response measures to 
stabilize, Five minutes into the preparatory period S 
was asked if he felt comfortable; this uniformly 
ilicited a GSR deflection, which E immediately ad- 
justed to a magnitude of 5 mm. for all Ss. 

The two items with the highest ratings and two 
items with the lowest ratings determined which triads 
of stimuli were to be used with each S. For example, 
suppose an S gave the highest rating to concern over 
grades, the second highest rating to fear of personal 
injury, the lowest rating to not being noticed by 
Others, and the second lowest rating to a concern 
with making mistakes. This S would then be exposed 
to the following four triads of sentences: 


My grades may not be good enough this semes- 
ter . . . I might fail out of school . . . that would 


be awful. 

I might get injured or crippled . . . this could 
make me ugly and incapacitated . . . what an aw- 
ful thought. 


. maybe there 


People don't notice me enough . . 
. how 


is something unattractive about me . . 
terrible. 

Im always making mistakes , . . Ill never do 
anything right . . . what a worthless nothing that 
makes me. 


For half of the experimental Ss (nine) the order 
of presentation was as follows: triad associated 
with the highest rating presented first, lowest triad 
presented next, second highest triad presented next, 
and second lowest triad presented last. For the re- 


maining half of the experimental Ss, order of pres- 
entation was lowest triad first, highest second, sec- 
ond lowest third, and second highest triad last. Two 
different orders of presentation were used in order 
to take into account any possible response habitua- 
tion effects. 


The nine control Ss were also presented with four 
triads of sentences but the triads consisted of syllo- 
gisms of an impersonal nonaffective nature, such 
as: "Inventors are imaginative . . . Edison was an 
inventor . . . therefore he was imaginative." 

Prior to presentation of the first triad, S was told 
that he would be presented with sets of three sen- 
tences, with each member of a given set pertaining 
to the same general idea. To illustrate, S was pre- 
sented with a neutral syllogism similar to the type 
presented above. Each S was then instructed to read 
each sentence to himself as it was flashed on the 
wall in front of him; he was told to concentrate on 
that sentence and nothing else. The first sentence 
of the first triad was then presented. Each sentence 
was presented for 5 sec. with a 10-sec. interval be- 
tween successive members of a given triad. Succes- 
sive triads were separated by a 15-sec. interval. This 
procedure was continued until S had been presented 
with all four sentence triads. 


RESULTS 

Preliminary Ratings 

Of the 35 items rated, the top 10 items, 
for example, associated with the most con- 
cern, received a mean rating of 2.92. The 
item with the highest mean rating (3.96) was 
“your grades won’t be good enough.” The 
lowest of the top 10 items, “You will be em- 
barassed in public,” received an average rat- 
ing of 2.28. The bottom 10 items received a 
mean rating of 1.30. The highest of the bot- 
tom 10, “You are insincere,” had an average 
rating of 1.60, and the lowest “Your sexual 
behavior is immoral,” was rated at .83. 


Habituation 


In order to ascertain the general effect on 
responsitivity, of repeated exposure to stimu- 
lus items for each response measure, the sum 
of the response to each of the first six sen- 
tences (i.e, first two triads) was compared 
with the sum of the response to each of the 
second six sentences (second two triads). For 
GSR, defined as the maximum deflection in 
millimeters during the 10-sec. period follow- 
ing the presentation of a given sentence, for 
the first experimental order (high relevance- 
low relevance-high relevance-low relevance) 
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the mean of the difference scores was 1.96 
(t= .27, ns). For the second experimental 
order (low-high-low-high), the mean differ- 
ence score was .78 (¢ = .31, ms). 

Similar analyses were performed on the 
respiration measures. For rate, the response 
measure was the absolute value of the differ- 
ence between mean respiration time estab- 
lished over a 10-sec. base-line period immedi- 
ately prior to the presentation of the first 
sentence and mean respiration time over the 
10-sec. period following presentation of a 
given sentence. For the first experimental 
order, the mean of the difference scores (ex- 
pressed in millimeters rather than seconds) 
was 1.37 (f= 1.21, ms) and for the second 
order, .33 (t= .69, ns). For depth, the re- 
sponse measure was the absolute value of the 
difference in millimeters between mean depth 
during base line and mean depth over the 
10-sec. period following any given sentence. 
For the first order, the mean of the difference 
scores was .52 (¢ = .67, ms) and for the sec- 
ond 1.26 (¢ = .91, ms). For purposes of com- 
paring experimental and control condition, 
it was necessary to take the absolute (rather 
than algebraic) value of respiration changes. 
While Ss tended to be consistent with respect 
to the direction of change, some showed de- 
creases (in rate or depth) and others, in- 
creases. Algebraic values would tend to can- 
cel one another, negating any real differences 
between experimental and control conditions. 
While the experimental Ss gave no evidence 
of habituation on any measure, significant 
GSR habituation was observed for the con- 
trols. The mean of the difference scores was 
4.52 (t = 3.12, p < .02). For respiration rate 
and depth the means of the difference scores 
were, respectively, .19 (£= .58, ms) and 48 
(t = .91, ns). 


Relevance 


To determine the relationship between 
rated relevance of the content of a particular 
triad and physiological response to that triad, 
high- and low-relevance triads were com- 
pared. This was accomplished by summing 
each of S’s responses to each sentence asso- 
ciated with both high-relevance triads, and 
comparing this with the corresponding sum 
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based upon both low-relevance triads. Since 
neither experimental group showed habitua- 
tion effects, (and the groups would not be 
expected to differ in any other way) they 
were combined in order to increase the num- 
ber of degrees of freedom. 

For GSR, the mean of the differences was 
3.35 (t= 1.62, ns). For respiration rate, 
again using the absolute value of the base- 
line postsentence difference, the mean of the 
difference was .30 (4 = .69, ms). For respira- 
tion depth the mean of the differences was 
.78 (t= 1.22, ns). Thus for all response 
measures, degree of relevance appears to have 
had little relationship to magnitude of re- 
sponse to a particular triad. 


Response to Different Sentences of a Triad 


To determine whether experimental Ss re- 
sponded differentially to the different sen- 
tences of triads, analyses of variance were 
performed for each response measure. Each 
of S's responses to the first sentence (ob- 
servation) of each triad was averaged over 
all four triads, and this was compared with 
the corresponding average for the second sen- 
tence (inference) and the third sentence 
(evaluative conclusion). 

For GSR, the means, based upon all 18 ex- 
perimental Ss were for objective, inference, 
and evaluative conclusion sentences, respec- 
tively, 10.03, 6.78, and 5.92. While the 
greater magnitude of response to objective 
sentence is counter to what the theory pre- 
dicts, the effect is not significant (F — 2.77, 
ns). For respiration rate, the comparable 
means were 1.79, 1.99, and 1.92 (F — 44, 
ns). For respiration depth, the means were 
2.04, 2.49, and 2.25 (F — 1.09, ns). Thus, 
for all response measures, there is no evi- 
dence for differential responding to the dif- 
ferent types of sentences. In particular, the 
hypothesis that there is a greater emotional 
response to the inference and evaluative con- 
clusion is not supported. 

For the controls, for GSR the means for 
the major premise, minor premise, and con- 
clusion were, respectively, 5.17, 3.56, and 
3.94 (F = 1.96, ns). For respiration rate the 
means were .81, .92, and 1.16 (F = 1.68, ns) 
and for depth, 1.17, .89, and .83 (F = 2.93, 


| 


l 


3 


SELF-VERBALIZATION AND EMOTIONAL AROUSAL 


ns). While this analysis with respect to GSR 
is somewhat obscured by the significant habit- 
uation effect previously noted, in general 
one can conclude that for the controls as 
well, responding was not significantly related 
to sentence type. 


Experimentals versus Controls 


The final hypothesis to be examined relates 
to whether the experimental Ss, having been 
exposed to sentences “affective” in content, 
showed a greater response than Ss exposed 
to “neutral” sentences. 

To test this a mean score (for each re- 
sponse measure) was calculated, based upon 
each experimental S’s response to each of the 
12 sentences presented to him. The combin- 
ing of data across conditions was permissible 
since neither habituation, relevance, nor sen- 
tence type was found to have any significant 
effect. Similarly, mean response scores (for 
the respiration measures) for the control 
group were calculated, and experimentals and 
controls were compared. For respiration rate 
the means for experimentals and controls 
were, respectively, 1.42 and .69. This differ- 
ence is significant (t = 2.20, P< 05). For 
respiration depth the respective means were 
1.69 and .72, representing a highly significant 
difference (t = 4.19, p < .001). Such a com- 
parison for GSR is complicated by the 
habituation observed in the control group. 
Taking this into account, mean GSRs for the 
experimental Ss (based upon all 12 sen- 
tences) were compared separately with con- 
trol means based upon the first two triads 
(frst 6 sentences presented to S) and the 
second two triads (second 6 sentences pre- 
sented to S). For the first comparison, re- 
spective means were 5.68 and 5.55 (t — .08, 
ns). For the second comparison, the differ- 
ence is in the predicted direction but does 
not attain significance, with means of 5.68 
and 3.13 for experimental and controls, re- 
spectively (¢ = 1.61, ns). 

Thus for the respiration measures, experi- 
mental and controls Ss could be clearly dif- 
ferentiated. For GSR, experimental Ss also 
showed somewhat greater reactivity though 
differences were not significant. 
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Reliability of Measurement 


For both respiration rate and depth, the 
response measure was the absolute value of 
the difference between base-line response and 
response following the presentation of a given 
sentence. The use of absolute values assumed 
that Ss responded consistently with respect 
to the direction of this difference. To test 
this, each of S’s responses to the first six 
sentences were combined algebraically. The 
S’s responses to the second six sentences were 
treated in the same manner. For respiration 
rate, the number of experimental Ss showing 
agreement in sign (i.e., plus to plus or minus. ` 
to minus) going from the first six sentences 
to the second six sentences was 14. The 
probability of obtaining by chance 14 
matches in sign, out of a possible 18 is 011. 
For respiration rate, again 14 out of 18 Ss 
showed an agreement in sign. For the con- 
trols, for rate, 8 out of 9 Ss showed agree- 
ment in sign (p= 017), and for depth, 7 
out of 9 showed agreement (p = .070). Thus, 
with respect to an increase or decrease in 
response magnitude relative to base line, Ss 
tended to behave quite consistently. 

The analyses of variance, previously dis- 
cussed in connection with sentence type, pro- 
vide an additional check on intra-S reliability. 
For the experimental Ss, for GSR, respira- 
tion rate and respiration depth, respectively, 
the Fs based upon individual differences, 
were 3.40 (p< .05), 4.99 (p < .001), and 
404 (p <.05). For the controls the Fs 
(df = 8/16) were 5.41 (p € .05), 24.54 (p < 
.001), and 21.48 (p< .001). Thus, Ss tended 
to be consistent with respect to magnitude 
as well as direction of response. 

Respiration rate and depth measures would 
not be expected to be independent, and in 
fact for the experimental Ss the product- 
moment correlation coefficient was .60 (z = 
2.67, p < 01). For the controls, the corre- 
lation shrank to .13 (z = 37, n5). The lack 
of significant correlation for the controls can 
be attributed to the very limited range of 
scores, related of course to low magnitude of 
response encountered in this group. 
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DISCUSSION 


The purpose of the present experiment was 
to examine certain of Ellis’ basic assump- 
tions relating to the effect of self-verbaliza- 
tion on emotional arousal. In fact, self-ver- 
balization was neither directly manipulated 
nor independently observed. The relevant 
stimuli directly under E’s control were vis- 
ually presented sentences, although it was 
assumed that S did follow instructions and 
read each sentence to himself. In this con- 
text self-verbalization is a construct having 
a role quite analogous to that of the “visual 
image" in Wolpian desensitization (Wolpe, 
1958). 

Of the assumptions tested, the one most 
fundamental to the theory received support; 
Ss tended to show a greater emotional re- 
sponse to sentences of an affective nature 
than to neutral sentences. Since by definition 
affective stimuli illicit more emotion than 
neutral stimuli, the result is as much a vali- 
dation of methodology as a support for the 
particular hypothesis. The failure to obtain 
differential effects associated with relevance 
and sentence type is somewhat more sur- 
prising. With respect to relevance, it is un- 
questionably true that for a given S, certain 
words or sentences have more power as con- 
ditioned stimuli than others. It may be that 
the content areas chosen were such that the 
average S would be expected to have had 
some negative conditioning associated with 
each. For the preliminary ratings, the means 
for the top 10 and bottom 10 differed by only 
1.32 scale units, suggesting only fair dif- 
ferentiation. Nevertheless, taking this find- 
ing at its face value, it does suggest that in- 
ducing an individual to engage in negative 
self-referential statements produces an emo- 
tional response independent of whether he 
verbalizes the belief that the statement ap- 
plies to him or does not. 

The finding that responses to objective, 
inferential, and evaluative sentences did not 
differ significantly is directly counter to what 
the theory predicts. It might be argued that 
since Ss were free to engage in self-verbaliza- 
tion over the 10-sec. intersentence interval 
they might well have come to their own 
evaluative conclusions during this period. 
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Thus, the response to an objective sentence 
would have tended to be as great as it was 
when an inferential or evaluative sentence 
was presented. Were such the case one would 
have expected the change in responding to 
begin late in the intersentence interval, In 
fact, in most instances the change occurred 
during or immediately after the presentation 
of the sentence, suggesting that the response 
was to the sentence itself. However, as an 
additional control one might have Ss engage 
in some sort of competing activity, such as 
counting backwards, (Peterson & Peterson, 
1959) over this interval. 

It is possible that for young children, 
highly evaluative sentences containing words 
such as “awful” or “terrible” function as 
more powerful conditioned stimuli than sen- 
tences of a more objective nature. For the 
highly verbal college student, however, by a 
process of higher order conditioning and ad- 
ditional diréct conditioning, objective state- 
ments could well become just as effect in- 
ducing. 

In conclusion, the present experiments lend 
support to Ellis’ general contention that 
self-verbalization has a direct effect on emo- 
tional arousal but does not support the spe- 
cific hypothesis relating to type of sentence. 
Certain critical questions, which the authors 
have not dealt with, are worthy of mention. 
First, what is the relationship between self- 
verbalization and instrumental behavior? It 
is reasonable to suppose that once a verbal 
mediator comes to elicit negative effect or 
drive, this will motivate instrumental re- 
sponding. The second question has to do 
with the maintenance of sequences of implicit 
verbalizations. One might predict that when 
such a sequence elicited negative effect, this 
would function as punishment suppressing 
the particular sequence. Suppose, howevel, 
that the negative effect did drive an instru- 
mental response, culminating in drive Te- 
duction. Drive reduction might be expect 
to reinforce not only the instrumental be 
havior but also, the verbal mediational te 
sponse preceding it. This would account for 
the maintenance of the implicit verbal sè 
quence. Also, by virtue of its own stimulus | 
properties the verbal mediator may become 
a critical cue in a chain controlling the /* 
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strumental response (viz., Osgood, 1953). 
This is within the realm of speculation of 
course, and the nature of the relationship 
between implicit verbalization and instru- 
mental behavior is in need of experimental 
investigation. 
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OPERANT RESPONDING FOR VISUAL STIMULI 
DURING SENSORY DEPRIVATION: 


EFFECT OF MEANINGFULNESS! 


A. MICHAEL ROSSI? PETER E. NATHAN, ROBERT H. HARRISON 
AND PHILIP SOLOMON 
Harvard Medical School 


Slide series differing in amount of meaningfulness were used as reinforcers for 
conjugately programmed operant responding during 2-hr. sessions of sensory 
deprivation and control conditions. Ss were 36 student nurses between the ages 
of 17 and 22. Every S had the opportunity to respond for the slide series under 
both sets of conditions. Rates of responding under both conditions were higher 
for the more meaningful slide series, but rates of responding for the least 
meaningful reinforcers were higher under sensory deprivation conditions than 
under control conditions. The results are interpreted as indicating a need for 
nonmeaningful stimulation as well as for meaningful stimulation during 


sensory deprivation. 


A perusal of the literature on sensory de- 
privation (SD) research suggests that the 
major effects of SD are well documented, but 
the processes underlying these effects still are 
matters of conjecture. Theories invoked in 
formulating these conjectures range from 
neurophysiological to psychoanalytical, but 
almost all of them assume that various di- 
mensions of normally present exteroceptive 
stimuli (e.g., variety, complexity, intensity, 
etc.) are needed by human organisms to 
function effectively, and that the absence of 
these dimensions produces a “stimulus hun- 
ger” which, if sufficiently intense, gives rise 
to deleterious effects. This general assump- 
tion is supported by diverse studies on stimu- 
lus-seeking behavior during SD employing 
animal Ss (Fox, 1962; Kish, 1955; Moon 
& Lodahl, 1956; Premack, Collier, & Roberts, 
1957; Sackett, Keith-Lee, & Treat, 1963; 
Wendt, Lindsley, & Adey, 1963) and human 
Ss (Bexton, 1953; Goldstein, 1965; Jones, 
1961, 1964; Jones, Wilkinson, & Braden, 
1961; Myers, Murphy, Smith, & Goffard, 


1 This research was accomplished with the sup- 
port of the Office of Naval Research, ONR Con- 
tract Number 1866(52); ONR Contract Authority 
NR 142-115, 

2 Requests for reprints should be sent to A. 
Michael Rossi, Psychiatry Service, Boston City 
Hospital, South Department—BB, 745 Massachusetts 
Avenue, Boston, Massachusetts 02118. 
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1966; Rossi & Solomon, 1964a, 1964b, 1964c, 
1964d; Vernon & McGill, 1960; Zuckerman 
& Haber, 1965). 

An unresolved issue is whether the stimu- 
lus hunger is for meaningful stimuli only or 
for nonmeaningful stimuli as well (i.e. for 
stimulation, per se). The purpose of the pres- 
ent study was to provide experimental data 
bearing on this issue by examining the extent 
to which meaningfulness differentially affects 
the reinforcement value of visual stimuli 
during SD and the extent to which the rein- 
forcement value of meaningfulness differs un- 
der control and SD conditions. 


METHOD 
Subjects 


The Ss were 36 paid, volunteer, student nurses 
in training at the Boston City Hospital. They were 
recruited by a posted notice soliciting Ss “for 4 
psychological study . . . rate of pay, $1.50 per hour.” 
Their average age was 18 with a range of 17-22. 


Procedure 


The SD conditions were created in a 7X7X8 
ft. darkened, sound-attenuated, air-conditioned cubi- 
cle used in previous SD research (Rossi & Solomon, 
1964a). White noise was conveyed into the room 
via a wall loudspeaker at a level of loudness that 
excluded ambient sounds. 

The S sat in a reclining lounge chair facing 4 
Plexiglas covered 2 X 2 ft. wall portal whose center 
was on a horizontal plane with S's eyes. On the 
other side of the portal was a television monitor 
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Fic. 1. Schematic diagram of apparatus. 


enclosed in a light-proof box and below the portal 
was a row of 5 colored, 12-v. bulbs. During SD, 
everything in the room was invisible to S except 
when visual reinforcers were made available. The S 
held a hand switch which controlled (via a con- 
jugately programmed potentiometer) the brightness 
of the television screen when visual reinforcers were 
available. During these periods the television screen 
remained dark if S failed to respond, obtained maxi- 
mum brightness if S responded at about two re- 
sponses (hand-switch closures) per second, and 
reached intermediate brightness levels with inter- 
mediate response rates. The S’s responses were re- 
corded on a cumulative recorder. A schematic dia- 
gram of the apparatus is presented in Figure 1. 
Visual reinforcers were frames of six commercial 
film strips which portray the home life of typical 
families in various regions of the world. The film 
strips were composed of sequential photographs 
containing brief, printed subtitles. Forty frames 
from each film strip were made into slides to con- 
stitute a slide series. This allowed each slide series 
to be projected in three different sequences to repre- 
sent the following three degrees of meaningfulness: 
(a) most meaning was assumed when the slides 
were projected in their normal sequence; (b) less 
meaning was assumed when the slides were pro- 


3' The filmstrips were produced by Louis de 
Rochement Associates for United World Films, Inc., 
1445 Park Avenue, New York, N. Y. 10029. The 
filmstrips used in the present study were: “Eskimo 
Sea Hunters” (Northwestern Alaska), “Desert No- 
mads” (French Morocco), “Nomads of the Jungle” 
(Malaya), “Mountain Farmers" (Switzerland), 
"Farmer-Fisherman" (Norway), and "Farmers of 
India" (Middle Ganges Valley). 


jected in a sequence determined by a table of ran- 
dom numbers so that scenes and subtitles did not 
follow logical orders; and (c) least meaning was 
assumed when the slides were projected in a random 
order, upside down, and backwards so that the 
scenes were barely discernible and the subtitles were 
illegible. 

The six slide series were separated into two groups 
of three (triads). Every S had the opportunity to 
view both triads; one during a SD session and the 
other during a control session, In projecting each 
triad, one slide series was presented in its most 
meaningful sequence, one in its less meaningful se- 
quence, and one in its least meaningful sequence. 
Every slide series was projected equally often in 
each of the three sequences and the design also 
counterbalanced orders of sessions, orders of mean- 
ingfulness within a session, and combinations of 
triads and sessions. 

Following a brief screening interview, S was 
handed a copy of written instructions which de- 
scribed the experimental procedures and emphasized 
to S that Es “have no preconceived notion of what 
you will or should do under these circumstances, so 
please follow your natural inclinations.” 

After reading the instructions, S was required to 
operate the hand switch to view a practice slide 
series of 39 slides in order to become familiar with 
the equipment and procedure. The practice series 
contained an equal number of slides in the most, 
less, and least meaningful sequences. Following the 
practice period, the experimental sessions began. If 
S was scheduled for the SD control order of condi- 
tions, she was placed in SD for 2 hr., and the visual 
reinforcers were made available during the last half 
hour of the session. After release from SD, S left 
the laboratory for 1 hr., 25 min, On her return, S 
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sat in the experimental room with the door open, 
the white noise off, and normal sounds audible. 
After 5 min. for S to relax and readjust to the 
room, the visual reinforcers were made available. 
The Ss scheduled for the control SD order of con- 
ditions spent identical amounts of time in SD and 
out of the laboratory but in reverse sequence. Con- 
trol and SD sessions were always contiguous. 
Identical procedures were used when visual re- 
inforcers were made available during control and 
SD conditions. The onset of a white warning light 
signaled to S that the visual reinforcers were about 
to be made available, and S had been instructed to 
operate the hand switch three times to signal E 
that S saw the signal light and was not asleep. 
The white warning light remained on for 10 sec., 
then one of the colored lights came on and remained 
on for 5 min., 20 sec. While a colored light was on, 
a slide series was projected onto a screen in front 
of a television camera connected to S's television 
monitor (see schematic diagram in Figure 1). Each 
slide was projected for 8 sec. The interval between 
the termination of a colored light and the onset of 
the next white signal light within a session was 
6 min, 45 sec. With three slide series and two in- 
tervals between slide series, the total time for the 
presentation of one triad was exactly 3 hr. 


RESULTS 


The rate of Ss’ responses for the visual 
reinforcers provided the primary data of the 
study. These data were subjected to a com- 
plex analysis of variance with 36 identified 
sources of variance. The results indicated 
that only the variance attributed to the dif- 
ferent levels of meaningfulness had achieved 
statistical significance (p < .001). A perusal 
of the mean rates of responding presented in 
Table 1 indicates that this was attributable 
to the lower responding for the least mean- 
ingful slide series than for the other two 
series; there were no apparent differences in 
responding for the most meaningful and less 
meaningful series. 


TABLE 1 


MEAN RESPONSE RATES DURING SENSORY DEPRIVATION 
AND CONTROL SESSIONS FOR VISUAL REINFORCERS 
WITH THREE DEGREES OF MEANINGFULNESS 


Meaningfulness in visual reinforcer 

Session LU MED 

Most | Less | Least | Total 
Sensory deprivation 694 | 689 | 473 | 619 
Control 691 | 672 | 373 | 579 
Difference 3 17 | 100 40 


Note.—N = 36. 
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The results of the analysis of variance also 
indicated that the variance attributed to 
conditions (SD and control) approached, but 
did not reach, standard levels of statistical 
significance (p < .12). A graph of the mean 
response rates for visual reinforcers during 
both conditions, presented in Figure 2, sug- 
gests that the variance attributed to con- 
ditions was associated almost solely with 
differences in responding for the least mean- 
ingful reinforcers. The application of Coch- 
ran's approximate ¢ test confirmed this sug- 
gestion with the findings of no differences in 
responding for either the most or less mean- 
ingful reinforcers between SD and control 
conditions and significantly more responding 
for the least meaningful reinforcers under SD 
conditions when compared to control condi- 
tions (p < .01). 

A visual examination of individual cumula- 
tive records indicated that more than a third 
of the Ss showed high variability in rates 
of response during the presentation of the 
least meaningful reinforcers, but brief bursts 
of “stimulus testing” responses (a few re- 
sponses now and then apparently to deter- 
mine the nature of the visual reinforcer) 
were more frequent and sustained longer dur- 
ing SD than during control conditions. 


Discussion 


The results of this study suggest that dif- 
ferent degrees of meaningfulness in visual 
reinforcers produce different rates of operant 
responding by Ss both in and out of SD. The 
results also demonstrate that with the re- 
sponse costs (number of responses required 
to light up the television screen) used in the 
present study, the reinforcement value of 
more meaningful visual stimuli did not differ 
between control and SD conditions, but the 
reinforcement value of visual stimuli with 
least meaning was slightly greater during 
than during the control conditions. 

The increased reinforcement value of the 
least meaningful stimuli during SD suggest 
that a need for nonmeaningful stimulation 
(ie. stimulation, per se) is involved in the 


stimulus hunger created by SD. It remm is 


for future research to determine whether 
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Fic. 2. Button-press responses under control and sensory deprivation conditions for visual stimuli (slide 
series) with three degrees of meaningfulness. (Each slide in a series was projected for 8 sec.) 


differential reinforcement value of meaning- 
fulness can be increased or decreased with 
higher or lower response costs than were used 
in the present study and to determine the 
differential reinforcement value of meaning- 
fulness after more extended periods of SD. 

A tangential, but nonetheless important, 
aspect of this study was the demonstration 
of a method that provides sensitive, continu- 
ous, and automatic recordings of behavior 
during SD. The versatility of the method 
makes possible the investigation of the rein- 
forcement value of a wide variety of stimuli 
ranging from simple stimulus intensity to so- 
cial interaction via television under controlled 
and comparable conditions. This method may 
be able to generate data which will contribute 
to an objective description of the hypothe- 
sized stimulus hunger and the identification 
of stimulus dimensions which affect its in- 
tensity. Hopefully, this will lead to a greater 
understanding of the processes underlying 
the deleterious effects observed to occur dur- 
ing SD. 
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PARENTAL DOMINANCE, CONFLICT, AND DISCIPLINARY 


COERCIVENESS IN FAMILIES OF FEMALE 
SCHIZOPHRENICS* 


JOSEPH BECKER ? ann HERBERT SIEFKES 


University of Washington 


Relations between the premorbid adjustment of female schizophrenics and 
family interactions were investigated. Fifteen good premorbid (GPM), 15 poor 
premorbid (PPM), and 15 nonpsychiatric control (N-PC) daughters and 
parents were compared on measures from a structured situational test, parental 
self-reports, and daughters’ retrospective ascriptions of parental attitudes. 
Predictions of group differences in interparental dominance, and conflict level 
were weakly supported. N-PC parents attributed more disciplinary coerciveness 
to themselves (p< .025) than parents of schizophrenics, Greatest discrepancy 
between parents’ and daughters’ coerciveness ratings occurred in PPM families 
(p<.01), and discrepancy related inversely to quality of premorbid adjust- 
ment (p<.01). N-PC mothers reported the greatest range of coerciveness 
(p < 005), whereas fathers did not report nor daughters ascribe such differ- 
ences. PPM daughters ascribed stronger domineering or controlling tendencies 
to their mothers (p<.001) and fathers (p< .001) toward themselves than 


GPMs or N-PCs. 


Few studies of schizophrenic family inter- 
action have used female probands, and vir- 
tually none include any variant of a good- 
poor premorbid S analysis (Garmezy & Rod- 
nick, 1959). Schematically, the rationale of 
the present study was that parents of schizo- 
phrenics provide their children with poor 
models for internalizing integrated identities, 
or for acquiring effective extrafamilial inter- 
personal instrumental behaviors. Role rela- 
tions between parents of schizophrenics are 
characterized by power imbalance, and much 
conflict. Communication within families of 
schizophrenics is constricted and fragmented 
resulting in discrepant perceptions and ex- 
pectations among family members. More spe- 


1 This investigation was supported by grants from 
the Psychiatric Training and Research Fund of the 
Illinois Department of Mental Health, and the Uni- 
versity of Washington Graduate School Research 
Fund Initiative 171 Section. Appreciation is ex- 
pressed to Ethel Shaeffer, of the Biostatistics Di- 
vision of the University of Washington Medical 
School, and to Larry Menninga, of the Computer 
Center for statistical and programming consultation. 
Part of the data in this study were included in 
an unpublished master’s thesis submitted to the 
University of Washington School of Social Work 
by the second author. 

2 Requests for reprints should be sent to Joseph 
Becker, Department of Psychiatry, University of 
Washington, Seattle, Washington 98105. 


cifically, this study examined selected aspects 
of interparental dominance and conflict, and 
of parent-child coerciveness and dominance 
as a function of their schizophrenic daugh- 
ters’ premorbid adjustment. 

The first hypothesis was that parents of 
female schizophrenics have an opposite inter- 
parental dominance configuration. from par- 
ents of male schizophrenics (Farina, 1960). 
That is, families of female good premorbid 
(GPM) schizophrenics tend to be maternally 
dominated, those of poor premorbids (PPM) 
paternally dominated, and those of nonpsy- 
chiatric controls (N-PC) relatively equali- 
tarian. The prediction for GPMs derived 
from reports of female GPMs during con- 
joint therapy sessions (Becker, 1963, p. 49); 
the prediction for PPMs from the Fleck, 
Lidz, and Cornelison (1963) study compar- 
ing parental relations of male and female 
schizophrenics. Fleck et al. stressed the in- 
adequate role models that mothers of inferred 
PPM females provide; an effect reinforced 
by their husbands’ contemptuous derogation. 
These mothers were described as low in self- 
esteem, vague, anxious, depressed, and insen- 
sitive, in contrast with their husbands’ arro- 
gance, self-investment, and attention seek- 
ing. A Bales type analysis of female schizo- 
phrenic parental interaction by Cheek (1964) 
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gave inferential support to this formulation. 
The prediction of equalitarian relations be- 
tween N-PC parents stemmed from Farina’s 
findings. 

The second hypothesis was that greatest 
conflict occurs between parents of PPM 
schizophrenics and the least between parents 
of N-PCs. This assumption followed from 
Farina’s (1960) experimental findings, and 
Fleck et al.’s (1963) description of families 
of female schizophrenics as characterized by 
mutual parental undercutting and much overt 
conflict. 

The third hypothesis dealt with parental 
disciplinary: coerciveness or punitiveness. De- 
spite their schismatic relations, many parents 
of schizophrenics seem preoccupied with ap- 
parent compatibility or “pseudomutuality” 
(Wynne, Ryckoff, Day, & Hirsch, 1958). It 
was predicted, therefore, (a) that PPM par- 
ents report themselves as less coercive dis- 
ciplinarians than N-PC parents, whereas (b) 
PPM daughters ascribe more coerciveness to 
their parents than N-PCs. The greater par- 
ental coerciveness ascribed by PPM daugh- 
ters is construed as reflecting a rupture in 
familial pseudomutuality (Wynne et al, 
1958) which permits more veridical percep- 
tion. It was also predicted (c) that N-PC 
parents endorse the broadest range of coer- 
cive measures in coping with infractions, and 
PPM parents the narrowest range. That is, 
N-PC parents are more likely to discipline 
by simultaneously restraining, explaining, 
suggesting, and seeking causes than by harsh- 
ness, constriction, or distortion as do PPM 
parents (Lu, 1962; Wynne et al., 1958). Fur- 
ther, the greatest discrepancy between par- 
ents’ and daughters’ perceptions of parental 
coercive severity was predicted (d) within 
PPM families, and the least within N-PC 
families. Investigators of schizophrenic fam- 
ily dynamics have repeatedly stressed the 
prevalence of faulty communication and of 
shifting, poorly shared and internalized values 
(e.g., Ferreira & Winters, 1965). 

The fourth hypothesis dealt with each par- 
ents’ dominance or control in relation to the 
daughter as retrospectively reported by the 
latter. It was predicted that PPM daughters 
ascribe the strongest domineering tendencies 
toward their parents, and N-PCs the weakest 
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(Garmezy, Clarke, & Stockner, 1961; Heil- 
brun, 1960). 


METHOD 
Subjects 


The patient S criteria required adequately co- 
opertive, hospitalized females diagnosed as schizo- 
phrenic, aged 17-40, Caucasian, neither retarded nor 
brain damaged, and with their native born natural 
parents available. Parents were below age 68 and 
not incapacitated physically. Nonpsychiatric control 
families met the same criteria as the patients, but 
had no history of severe emotional disturbances. 
Controls were obtained through their ministers, and 
were paid $10.00. Most families in both groups came 
from large urban areas (Chicago or Seattle), and 
were of British, German, or Scandinavian back- 
ground as judged by surnames, 

Schizophrenic patients were rated on the modified 
Phillips Premorbid Scale (Farina, Garmezy, Zalusky, 
& Becker, 1962) and the Social Competence Scale 
(Phillips & Zigler, 1961). The Ss below the total 
patient median of 14 on the Phillips Premorbid 
Scale, and above the median of .83 on the Social 
Competence Scale were assigned to the GPM group, 
while those above the median on the former and 
below on the latter were placed in the PPM group. 
Families of three Ss whose two scores did not place 
them in either category, and of three more whose 
daughters were uncooperative were eliminated. Both 
scales were used to insure a maximally accurate 
social maturity dichotomy within a sample of 
limited size. 

Median length of hospitalization for GPMs and 
PPMs was 2 mo. The study sample included 15 
sets of parents with PPM schizophrenic daughters, 
15 with GPM daughters, and 15 with N-PC daugh- 
ters. The group differed significantly on the sup- 
plementary variables of age, education, and esti- 
mated IQ, but not on socioeconomic class (Hollings- 
head, 1957).8 Members of the PPM families tended 
to have less formal education and lower estimated 
verbal IQs than their GPM and N-PC counterparts. 


Procedure 


Parents: Private phase. Parents were seen Sepa- 
rately, but simultaneously. They filled out a basic 
data form, the CPI-Fe Scale, Franck Figure Draw- 
ings, and WAIS Vocabulary Subtest; parental speech 
samples were tape-recorded ;* and finally, the Jack- 
son Problem Questionnaire (JPQ) (Jackson, 1956) 
was orally administered following Farina’s procedure 
(1960). The 12 JPQ items require the respondent to 
indicate how the parent would cope with their 
child's aggression toward themselves or peers. 


8 Hollingshead, A. B. Two factor index of „social 
position. 1957; detailed data are available in a 
extended report from the senior author. a 

*Data from these measures are not directly T i 
vant to the concerns of this paper, and therefort 
will be reported elsewhere. 
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Parents: Joint phase. The parents were brought 
together and requested to discuss and reach agree- 
ment on each of the JPQ items, Items were read 
aloud by the E who tape-recorded the session. 

Daughters. The daughters were seen privately. 
Sequentially they were administered the WAIS Vo- 
cabulary Subtest, Garmezy-Shoben Attitude Scale 
(G-SAS) (Garmezy et al., 1961), CPI-Fe Scale, 
speech sampling, Franck Figure Drawings, and the 
JPQ. On the latter, daughters described what their 
parents joint reactions would have been. The G- 
SAS requires S to think back to his early teens and 
to indicate whether at that time his parents would 
have agreed or not with 75 statements regarding 
child-rearing attitudes and practices in the areas of 
Dominance, Overprotection, and Ignoring. Mother- 
father items of this scale were presented in counter- 
balanced order. 

Indexes of dominance and conflict. The indexes 
derived from the JPQ parental interaction were 
similar to those described in detail by Farina (1960). 
Scoring of the JPQs was done by an experienced 
rater with previously well-established interrater 
reliability.5 In addition, the relatively untrained 
junior author rated four protocols blindly. Interrater 
agreements were somewhat below those reported by 
Farina (1960), Spearman rho values for interrater 
agreement follow the indexes. Dominance indexes 
were speaks first (1.00), speaks last (1.00), speaks 
first and last (1.00), yielding maximum (.94), yield- 
ing minimum (.88), yielding average (.91), passive 
acceptance of solution (.95), total time spoken: 
father minus mother (.93); interruptions: father 
minus mother (.73), and disagreements and aggres- 
sions: father minus mother (.77). Conflict indexes 
were frequency of simultaneous speech (.92), total 
duration of simultaneous speech (.95), average dura- 
tion of simultaneous speech (.93), interruptions- 
father (.75), interruptions-mother (.74), total inter- 
ruptions, disagreements and aggressions-father (.71), 
disagreements and aggressions-mother (.79), total 
disagreements and aggressions, failure to agree 
(1.00), verbal activity (.95), and for coerciveness: 
the maximum (.85), minimum (.55), average (70), 
and range (.81) of the private and joint coercive- 
ness scores on the JPQs of the mothers, fathers 
and ascribed scores of the daughters. Each discrete 
component of a JPQ response is rated for its coer- 
civeness (Jackson, 1956). Parental dominance scores 
were those attributed by the daughters to their 
mothers and fathers on the G-SAS. 


RESULTS 


Since the groups differed significantly on 
many of the supplementary variables, corre- 


5Eilen Iwakami was the experienced rater who 
corresponded as necessary with A. Farina over ques- 
tionable scoring points. Her interrater reliability is 
discussed in an unpublished paper (Becker & Iwa- 
kami, 1967) available from the senior author. 
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lations between these and the dependent 
variables involved in testing each hypothesis 
were examined to determine whether covari- 
ance adjustments were needed.* 

Hypothesis 1. Dominance indexes and sup- 
plementary variables were essentially unre- 
lated, therefore each unadjusted index was 
subjected to a one-way analysis of variance 
multiple-range test (Dixon, 1965, pp. 572- 
606). These results are presented in Table 1. 
The findings afforded quite limited support 
for the prediction of greater paternal domi- 
nance in the PPMs, and greater maternal 
dominance in the GPMs. On 9 of the 10 
interdependent variables, the hypothesized 
trend of PPM paternal dominance occurred, 
but the predicted reversal in dominance pat- 
terns for PPMs and GPMs attained statis- 
tical significance (p < .05) for speaks first 
and last and total time spoken only. The 
GPM group was the most maternally domi- 
nated on all indexes except the yielding ones. 
Nonpsychiatric controls tended to be most 
equalitarian wth a slight inclination toward 
maternal dominance. 

Hypothesis 2. Conflict scores were adjusted 
for group differences in socioeconomic class 
and total time spoken. The prediction that 
most conflict would occur between the par- 
ents of PPMs and least between the N-PC 
parents received very modest support from 
the multiple covariate analyses of variance. 
On 10 of the 11 indexes the PPMs had the 
greatest amount of conflict, and on 9 of the 
11 the N-PCs had the least. But none of the 
group differences were statistically significant 
(p < .05). Further analysis of these data by 
Jonckheere’s nonparametric test for predicted 
order of outcome yielded similar results. 

Hypothesis 3a. Maximal parental coercive- 
ness was unrelated to the supplementary vari- 
ables. A 3 X 3 nested factorial analysis of 
variance (Dixon, 1965, pp. 503—504) of the 
groups’ (N-PC, GPM, PPM) JPQ maxi- 
mum coerciveness scores for mothers’ and 
fathers’ private and joint conditions yielded 
a significant group difference only (F = 4.70, 
df = 2/84, p< 025). Parents of N-PCs re- 
ported the strongest coerciveness (overall X 


6Tables of intercorrelations are available from 
the senior author on request. 


196 


JOSEPH BECKER AND HERBERT SIEFKES 


TABLE 1 
Means, STANDARD DEVIATIONS, AND MULTIPLE-RANGE ANALYSES OF VARIANCE OF DOMINANCE INDEXES 


Index* 


Speaks first 
x 


SD 
Speaks last 
X 


SD 
Speaks first & last 
x 


SD 
Yield maximum 


SD 
Yield minimum 
P4 


SD 
Yield average 
x 


SD i 
6 ON acceptance of solution 


SD 
Tol time speaking 


SD 
Interruptions: father minus mother 


SD 
Disagreements & aggressions: father minus mother 


SD 


Group? 
F Multiple-range 
comparisons 
N-PC GPM PPM 
—.47 | —253 1.33 | 1.55 
442 6.12 7.20 
—20| —173 1.60 | 1.27 
4.63 5.28 7.02 
—67| —427 2.93 | 2.63 PPM» GPM* 
7.24 7.00 | 10.98 
—.34 —.26 32 | .85 
1.58 1.28 1.67 
—.68 —.03 48 | 2.82 PPM» N-PC* 
99 1.14 1.76 
—.55 —.15 41 | 2.45 PPM>N-PC* 
87 1.10 1.50 
20 —.13 27 | .26 
1.15 1.41 245 
20.09 | —19.61 | 162.61 | 2.77 PPM>GPM* 
175.06 | 243.73 | 243.05 
113| —227 1.33 | 1.40 
3.76 9.65 4.97 
—07| —187| —140| 47 
5.13 5.32 5.40 


? Positive index scores reflect paternal dominance; negative ones maternal dominance. 


bN = 15 for each group. 
* p «.05. 


for N-PCs =7.23; GPMs = 6.25; PPMs = 
6.33) as predicted. 

Hypothesis 3b. Daughters’ ascribed joint 
parental coerciveness ratings on the JPQ were 
unrelated to the supplementary variables. 
They were therefore subjected to a multiple- 
range analysis of variance test. Parents of 
PPMs were rated (X — 7.83) more coercive 
(p < .05) than those of GPMs (X = 6.57), 
but parents of N-PCs were rated (X — 7.03) 
intermediately, and not different from the 
other groups. This hypothesis was not sup- 
ported. 

Hypothesis 3c. Mothers private self-re- 
ported JPQ coerciveness range scores were 
unrelated to the supplementary variables. A 
multiple-range analysis of variance of these 
scores yielded a highly significant F of 6.17 
($ < .005). As predicted, N-PC mothers had 


the broadest coercive range (X = 2.81), 
PPMs the narrowest (X = 1.75), and GPMs 
were intermediate (X — 1.91). Nonpsychi- 
atric controls differed significantly from 
GPMs and PPMs (p < .01), while the latter 
two did not differ. Neither fathers’ self-re- 
ported range scores, nor the parents’ joint 
JPQ range scores, nor the daughters’ ascribed 
JPQ range scores differed significantly. This 
hypothesis was partly supported. 

Hypothesis 3d. Discrepancy scores between 
parents’ joint JPQ and daughters’ ascribed 
JPQ maximum, minimum, and average coer- 
civeness were unrelated to the supplementaty 
variables. As shown in Table 2, the predicted 
discrepancy between parents’ and daughters 
perceptions of parental coerciveness was cols 
sistently greatest within the PPM families 
and least within the N-PC families. The 
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TABLE 2 


MurLrIPLE-RANGE ANALYSES OF VARIANCE OF DISCREPANCIES BETWEEN PARENT'S JOINT MAXIMUM, 
MINIMUM, AND AVERAGE JPQ COERCIVENESS AND DAUGHTERS’ ASCRIBED JPQ COERCIVENESS 


SD Fs Multiple-range comparisons 
3.99* 
1.52 PPM>GPM & N-PC* 
2.14 à 
1.82 
4.69** 
1.77 PPM>NPC*** 
1.74 
1.78 
525te* 
141 PPM» GPM* 
+ PPM>N-PC*** 


Source Xx 
Maximum coerciveness 
N-PC daughters minus parents —.08 
GPM daughters minus parents 24 
PPM daughters minus parents 1.63 
Minimum coerciveness 
N-PC daughters minus parents AL 
GPM daughters minus parents 1.12 
PPM daughters minus parents 2.08 
Average coerciveness 
N-PC daughters minus parents —.06 
GPM daughters minus parents 61 
PPM daughters minus parents 1.81 
a df =2/42. 
* p «.05. 
** p «.025. 
Mp <.01 


schizophrenic groups differed between them- 
selves only on the average coerciveness mea- 
sure. Discrepancy scores for patient's families 
correlated .46 (df = 28, p « .01) with poor- 
ness of daughters’ premorbid adjustment. 

Hypothesis 4. Daughters’ ascribed G-SAS 
dominance ratings for fathers and mothers 
were adjusted for positive correlations with 
fathers’ age (p < .05), and subjected to mul- 
tiple-range analyses of variance. The overall 
F (df= 2/42) for fathers was 8.53 (p< 
001); fathers were rated most domineering 
by PPMs (X= 11.41), least so by N-PCs 
(X = 5.34), and intermediately so by GPMs 
(X = 7.12). This same pattern occurred in 
the dominance ratings of mothers (overall 
F —1081, df = 2/42, p < .001; PPM X= 
13.07, N-PC X — 6.14, GPM X — 7.79). The 
GPM and N-PC groups did not differ be- 
tween themselves for either father or mother, 
while PPMs differed from both groups for 
both parents (p < .01). There was no dif- 
ference within any group in strength of domi- 
nance ascribed to either parent. The pre- 
diction that PPM daughters perceive both 
parents as having been most controlling was 
strongly supported. 


Discussion 


The gains from examining interaction with- 
in families of female schizophrenics as a 
function of the patients’ premorbid adjust- 
ment were variable. Factors which may have 
attenuated the findings include the limited 


range of premorbid adjustment within the 
sample, the sensitivity of the premorbid scale 
to developmental maturity in females, and 
the questionable validity of the interparental 
interaction measures (Fontana, 1966). Use 
of more extreme segments on the continuum 
of premorbid level and a larger sample size 
might well bolster the supportive trends on 
interparental dominance and conflict to sta- 
tistically reliable levels. However, the dangers 
of “probability pyramiding” (Neher, 1967) 
in using multiple indexes from the same inter- 
action source are considerable. Moreover, re- 
cently reported low intercorrelations between 
the dominance indexes from the structured 
situational test raise serious questions about 
their assessment of a common construct 
(Becker & Iwakami, 1967). While considera- 
ble construct validation of the premorbid ad- 
justment concept has occurred with male 
schizophrenics, its sex generality remains less 
clear. The salient determinants of psycho- 
sexual-social developmental maturity in fe- 
males may differ appreciably from males, and 
be underrepresented even in the revised Phil- 
lips Premorbid Scale. 

If Fleck et al.’s (1963) observations on pa- 
ternal dominance in families of PPM schizo- 
phrenics which are weakly buttressed by the 
data of the present study are valid, it may 
be speculated that Cheek’s (1964) “serren- 
dipitous” finding of greater activity and 
dominance in her PPM females than controls 
reflects the tendency of such patients to in- 
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appropriately overidentify with their fathers. 

Comparison of the interparental conflict 
data of this paper with those of Farina 
(1960) did not support the Fleck et al. 
(1963) contention that schismatic (chroni- 
cally discordant) parental relations are more 
characteristic of female schizophrenics’ fam- 
ilies than of males’. Rather, these conflict 
data indicated the opposite trend. To what 
extent these conflict findings were a func- 
tion of the specific task content and structure 
is indeterminate. The relationship of overt 
intrafamilial conflict per se to psychopathol- 
ogy does not appear to be a positive linear 
one in any event (Becker & Iwakami, 1967). 

Determination of the basis for the positive 
relation between severity of pathology and 
size of discrepancy in parents’ reported and 
childs’ ascribed coerciveness awaits further 
investigation. A number of alternative ex- 
planations are possible in terms of relations 
between actual behavioral events, construing 
of such events and communication and as- 
similation of such constructions. 

Similar ambiguity attends the explanation 
of the positive relation between severity of 
pathology and amount of ascribed parental 
dominance or control. It is unclear, for in- 
stance, as to whether PPM parents are ac- 
tually more controlling, or are perceived in 
this fashion partially for defensive purposes. 
Lu (1962), for example, observed that while 
parents of PPMs are somewhat controlling 
toward all their children, only the preschizo- 
phrenics sought to comply. By exaggerating 
parental restrictiveness and playing the “good 
child,” the preschizophrenic may defensively 
avoid anxiety producing awareness of their 
social inadequacies. 

Use of the broadest range of disciplinary 
methods by mothers of N-PCs should caution 
against preoccupation with parental consist- 
ency as opposed to appropriateness. Subjec- 
tive inspection of the content of mothers’ 
responses to the JPQ indicated much more 
stereotypy in the mothers of PPMs. Mothers 
of N-PCs seemed to take a broader range of 
antecedent, current, and prospective factors 
into account when disciplining their children. 
They would appear to provide their chil- 
dren with models of more flexible and exten- 
sive utilization of ego functions. 
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REGRESSION IN SCHIZOPHRENIC THOUGHT DISORDER * 
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of Wisconsin 


This study investigated the relevance of some of Chapman, Chapman, and 
Miller's findings concerning schizophrenic verbal behavior to the hypothesis of 
regression in schizophrenic thought. Chapman et al. found that schizophrenics 


misinterpret words in context by relying excessively on the strongest normal 


aspects of meaning, neglecting the weaker aspects of meaning. In the present 
study, the same task that discriminated schizophrenics from normals in this 
error tendency was administered to third, fourth, and eighth graders. Younger 
children differed from older children in the same ways that schizophrenics 
differed from normal adults. Thus, the hypothesis of regression in schizophrenic 
thought was confirmed at a descriptive level. 


Several clinical writers have stated that 
schizophrenic thought disorder consists of a 
regression to infantile levels of thought. This 
is the theoretical position of such psycho- 
pathologists as Arieti (1955),  Fenichel 
(1945), Freud (1950), Kantor and Winder 
(1959), Levin (1936), and Sullivan (1924). 
Cognitive theorists who have also endorsed 
this point of view include Goldman (1962), 
Piaget (1923), Vigotsky (1934), von 
Domarus (1964), and Werner (1957). Em- 
pirical studies of this hypothesis were re- 
viewed by Chapman, Burstein, Day, and 
Verdone (1961). The present investigation 
concerned the relevance to the regression 
hypothesis of some recent findings of 
Chapman, Chapman, and Miller (1964) 
about schizophrenic thought disorder. 

In this paper, “regression” is employed 
merely as a descriptive term referring to simi- 
larities observed between the behavior of chil- 
dren and schizophrenics, Thus, regression is 
not used here as an explanatory principle 
implying that the patients’ thought disorder 
is due to their having become like children. 
As Chapman et al. (1961) noted, such a 
usage would require demonstration of a fairly 


1This paper is based on a master’s thesis 
(Klorman, 1967) submitted by the first author to the 
University of Wisconsin and supervised by the second 
author. This research was supported by a grant 
(2 T1 MH-5363-18) from the National Institute of 
Mental Health, United States Public Health Service. 

2 Requests for reprints should be sent to Rafael 
Klorman, Department of Psychology, University of 
Wisconsin, Madison, Wisconsin 53706. 


blanket similarity between schizophrenics and 
children. 

Following Lewin (1951), regression is used 
here only to refer to a change to a less mature 
type of behavior. Regression, then, is not 
employed synonymously with “retrogression,” 
which Lewin defined as a return to a behav- 
ior that was demonstrably present at an ear- 
lier stage of a particular individual’s develop- 
ment. Empirical demonstration of retrogres- 
sion would require longitudinal data. 

Thus, the criterion used in the present 
study for inferring regression in schizophrenic 
thought was that both schizophrenics and 
children exceed normal adults in the commis- 
sion of a specific type of error on a given 
task, The task used in the present study was 
devised by Chapman et al. (1964) to test, 
their theory concerning schizophrenics’ mis- 
interpretation and misuse of words. These 
authors use the term “meaning responses” to 
refer to those internal events which represent 
the denotative referents of a word, and which 
mediate overt responses to words. The central 
tenet of this theory is that schizophrenics’ 
misinterpretations of the meanings of words 
arise in part from mediation of their overt 
responses to words by the strongest normal 
meaning responses with a neglect of weaker 
normal meaning responses. In contrast, nor- 
mal persons’ interpretations of words reflect 
the use of the weaker as well as the stronger 
meaning responses. This formulation assumes 
that meaning responses to à given word as 
well as their rank ordering on strength are 
the same for schizophrenics and normals. 
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Data supporting this assumption have been 
provided by Chapman et al. (1964) and 
Brelje (1967). Normal Ss are presumably 
able to use either the stronger or the weaker 
aspects of meaning of a word, whichever may 
be more appropriate for the task at hand. 
Schizophrenics, however, often mediate their 
interpretations of words and their overt re- 
sponses to words only by stronger meaning 
responses, regardless of their appropriateness. 

This formulation has been successful in 
predicting schizophrenics’ errors in conceptual 
sorting (Chapman et al., 1964), judgments of 
synonymity (Chapman & Chapman, 1965), 
syllogistic reasoning (Gruber, 1965), sorting 
of objects (Miller, 1967), semantic generali- 
zation (Mourer, 1966), and paired-associate 
learning (Warshauer, 1967). 

The study that is most relevant to the 
present one (Chapman et al., 1964, Experi- 
ment I) investigated schizophrenics’ misinter- 
pretations of multimeaning words, that is, 
words with more than one meaning in every- 
day usage. For example, the word “deck” may 
mean either “a part of a ship” or “a pack of 
playing cards.” A person receiving a message 
in which the word deck appears must decide, 
on the basis of contextual cues, which of its 
meanings is intended. Chapman et al. (1964) 
proposed that schizophrenics do not make an 
adequate response to those contextual cues 
that indicate that the weaker meaning of a 
multimeaning word is intended. Instead, the 
patients inappropriately interpret the word in 
accordance with its stronger meaning. How- 
ever, if the stronger meaning of the multi- 
meaning word were intended, schizophrenics’ 
performance would more closely approximate 
that of normal Ss. 

In order to test these hypotheses, the in- 
vestigators first assessed the relative strength 
of the meanings of 19 multimeaning words by 
asking normal judges to indicate and rank 
order the three most prominent aspects of 
meaning of these words. An index of the rela- 
tive strength of the meanings of a word was 
computed by averaging across judges the 
ranks for each statement of meaning. (A list 
of the words and the weights of their stronger 
and weaker meanings appear in Chapman et 
al., 1964). 
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The investigators used these judgments of 
strength of meaning to-construct a test. For 
each of the 19 multimeaning words, there 
were two items. One of these required a cor- 
rect interpretation mediated by a stronger 
meaning response and offered a misinterpreta- 
tion mediated by a weaker meaning response. 
An example follows for the word “bark.” 


After the noise was over, Janet said, “The bark is 
bad but it won’t hurt you.” 


This means: 
A. She was talking about the sound made by 
a dog. 
B. She was talking about the outer covering 
on a tree. 
C. She was talking about the color faa 
house. t 


Alternative A, which is based on the stronger 
meaning of bark, is correct. Alternative B, 
which is based on the weaker meaning of 
bark, is incorrect. 

A parallel item required a correct interpre- 
tation mediated by a weaker meaning re- 
sponse and offered a misinterpretation medi- 
ated by a stronger meaning response. 


The gardener said, “Here is an unusual bark." 
This means: 
A. He was talking about the sound made bya 


dog. 

B. He was talking about the outer covering 
of a tree. 

C. He was talking about the color of a 
house. 


In this item, the correct choice is B, the al- 
ternative based on the weaker meaning of 
bark. Alternative A, which is based on the 
stronger meaning of bark, is incorrect. — 

It should be noted that both items contain 
an irrelevant alternative which has no rela- 
tionship to the stimulus word bark. In both 
items above, Alternative C is the irrelevant 
one; however, in the test as à whole, the 
irrelevant alternative appeared equally often 
in all three positions. A certain number of 
errors might be made for other reasons than 
misinterpretations mediated by stronger or 
by weaker meaning responses. For example, 2? 
S might check alternatives randomly °° o 
the basis of their position or for other un 
identified idiosyncratic reasons. SU‘ " 
identified errors presumably would m) 
equally often in the marking of the irrelevan. 
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alternative as in the marking of the other two 
alternatives. Thus, there should be as many 
“random errors” hidden within markings of 
the other two alternatives as within mark- 
ings of the irrelevant alternative. Therefore, 
as a correction for such random errors, the 
number of markings of the irrelevant alterna- 
tive on items measuring misinterpretations 
mediated by stronger meaning responses was 
subtracted from the number of markings of 
misinterpretations mediated by the stronger 
meaning responses. The resulting corrected 
score of the tendency to make misinterpreta- 
tions mediated by stronger meaning responses 
is assumed to be relatively uncontaminated by 
random errors. A comparable correction for 
random errors was employed for items meas- 
uring misinterpretations mediated by weaker 
meaning responses. 

Chapman et al.’s findings confirmed their 
hypothesis. Although schizophrenics exceeded 
normals in errors of all types, they deviated 
more from normals on items in which the 
correct answer was a weaker response than on 
items of the reverse type. A difference score 
between the two types of misinterpretations 
significantly differentiated the two groups. 

In the present study, the hypothesis is 
that younger children exceed older children 
on this difference score. 


METHOD 
Subjects 


The Ss were pupils from the third (N = 32), 
fourth (N —33), and eighth (N — 32) grades of a 
parochial school in Madison. Each class was tested 
as a group during a regular class period and on the 
same day. Means and standard deviations of chrono- 
logical age were, third grade: M = 8.72, 5D = 49; 
fourth grade: M = 9.73, SD = 41; and eighth grade: 
M = 13.59, SD = 137. 


Task 


The task used in this study consisted of the 38 
items employed by Chapman et al. (1964, Experi- 
ment I).3 Six Ss circled more than one alternative or 
failed to circle an alternative on 1 or 2 items. For 
each such unscorable item, the corresponding paired 
item was disregarded in the computation of scores 
for the 5 involved. 


3 A copy of the test is available from either author. 
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RESULTS 


There were no significant between-sex dif- 
ferences within any grade in number of mis- 
interpretations, Therefore, the data for the 
two sexes were combined at each grade level. 
Table 1 shows the means and standard devia- 
tions of misinterpretations mediated by 
stronger and by weaker meaning responses 
and the score of the difference between both 
types of misinterpretations. All of these 
scores were corrected for random markings in 
the manner described above. 

As indicated by the Kruskal-Wallis test, 
the three grades differed significantly in the 
number of misinterpretations mediated by 
stronger meaning responses (H = 48.97, p < 
001), in the number of misinterpretations 
mediated by weaker meaning responses (H= 
12.46, p < .01), and in the score of the dif- 
ference between these two types of misin- 
terpretations (H = 35.50, p< 001). . 

The significance of the differences between 
pairs of means was tested by Mann-Whitney 
U tests. Here, as in all the analyses, two- 
tailed probability values were employed. Third 
graders exceeded eighth graders in misin- 
terpretations mediated by stronger meaning 
responses (z = 6.56, p < .001), misinterpre- 
tations mediated by weaker meaning responses 
(z = 4.08, p < .001), and in the score of the 
difference between these two types of misin- 
terpretations (z = 5.28, P< 001). 

Fourth graders exceeded eighth graders in 
number of misinterpretations mediated by 
stronger meaning responses (2 = 5.03, p< 
.001). Fourth graders also tended to make 
more misinterpretations mediated by weaker 
meaning responses than did eighth graders, 


TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR 
CORRECTED SCORES OF MISINTERPRETATIONS 


Third Fourth Eighth 
grade grade grade 
Error score 
M|SD|M|SD|M SD 
4,34 | 2.71 | 2.54 | 2.60 | .22 42 
Weaker, 1.25 | 1.72 | .36 | 1.30 03 | 17 
Stronger minus weaker 3:09 | 2.98 | 2.18 | 2.51 49. | 47 


Note.—Misinterpretations mediated by stronger and weaker 
meaning responses. 
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but this difference did not reach statistical 
significance (z = 1.35, p < .16). In the score 
of the difference between the two types of mis- 
interpretations, fourth graders significantly 
exceeded eighth graders (z = 4.76, p < .001). 

Similar effects of age were reflected in the 
differences between third and fourth graders. 
Third graders made more misinterpretations 
mediated by stronger meaning responses (z 
= 2.31, p < .03) and more misinterpretations 
mediated by weaker meaning responses (z = 
2.19, p < .04). Third graders tended to ex- 
ceed fourth graders in the score of the differ- 
ence between the two types of misinterpreta- 
tions, but this difference was not significant 
(z = 1.32, p < 17). 

The finding that younger children exceeded 
older children in misinterpretations mediated 
by weaker meaning responses would appear to 
contradict the hypothesis that younger chil- 
dren rely on stronger meaning responses at 
the expense of weaker meaning responses. This 
finding parallels Chapman et al.’s (1964) re- 
sults for schizophrenics and normals. These 
investigators argued that: 


the finding that the schizophrenics exceeded the 
normal Ss in misinterpretations mediated by weaker 
meaning responses... is nevertheless consistent 
with the theory if the items which contributed to 
the difference were those on which the judges were 
less unanimous as to which is the stronger meaning. 
The logic is that if a meaning response which is 
weaker by the group judgmental norms is never- 
theless the stronger of the two meaning responses to 
this word for a minority of the judges, it is also 
stronger for a minority of the schizophrenics [p. 64]. 


To test this hypothesis, Chapman et al. 
divided the 19 words into one set of 10 words 
with relatively high-interjudge agreement 
(89-100%) as to which of the two statements 


TABLE 2 


MEANS AND STANDARD DEVIATIONS FOR CORRECTED 
Scores OF MIsINTERPRETATIONS 


Third grade | Fourth grade} Eighth grade 


Type of word 
M SD M SD M SD 


Low agreement | 1.03 | 1.42 | .21 | 1.14 | .03 | .17 
High agreement | .22 | .94| .15 | .56| .00 | .00 


Note,—Misinterpretations mediated by weaker meaning 
responses on items based on words with high- and low-inter- 
judge agreement. 


of meaning is stronger and another set of 9 
words with relatively low-interjudge agree- 
ment (50-87%). Schizophrenics significantly 
exceeded normals in misinterpretations medi- 
ated by weaker meaning responses on the 9 
items based on words with low-interjudge 
agreement, but not on the 10 items based on 
words with high-interjudge agreement. 

A comparable analysis was performed in 
the present study. Table 2 shows the means 
and standard deviations of misinterpretations 
mediated by weaker meaning responses on 
items based on words with high- and low- 
interjudge agreement. 

The three grades did not differ significantly 
in number of misinterpretations on items 
based on words with high-interjudge agree- 
ment (H = 1.96, p < .50). In contrast, the 
three grades differed significantly in number 
of misinterpretations on items based on words 
with low-interjudge agreement (H = 15.42, p 
< .001). On this set of items, third graders 
exceeded both fourth graders (z = 2.62, p< 
-01) and eighth graders (z = 4.07, p < .001) 
in number of misinterpretations. Fourth grad- 
ers, however, did not differ from eighth grad- 
ers (= .35, p= .38). Thus, the difference 
between third graders and the other two 
grades in misinterpretations mediated by 
weaker meaning responses was accounted for 
by the items based on words with low-inter- 
judge agreement as to which meaning is 
stronger. 

To check for the possibility that the stem 
sentences in the set of items measuring mis- 
interpretations mediated by stronger mean- 
ing responses tended to be more ambiguous 
than those in the other set of items, ratings of 
ambiguity of the stem sentences were obtained 
from 100 introductory psychology students. 
Items of the following format were presented: 


After the noise ceased, Janet said, “The bark is 
bad, but it won’t hurt you.” 
The word “bark” may mean either 
a) “sound made by a dog” 
orb) “outer covering on a tree” 
In the sentence above, how clear is it that the 
intended meaning is “sound made by a dog”? 


This was followed by a rating scale from 1 
(“almost completely clear”) to 5 (“not at 
clear”). The items for which weak-meaning 


-_ 


any 
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responses were correct were rated as slightly 
more ambiguous (M — 1.74) than the items 
for which strong-meaning responses were cor- 
rect (M — 1.65). This small difference was 
statistically significant ( < .001), but may 
probably be attributed to the difference in 
strength of the two meanings of the multi- 
meaning words, rather than to differences in 
strength of the contextual cues provided by 
the sentences in which the words appeared. 
Nevertheless, subsets of the two types of 
items were matched on ambiguity ratings (M 
— 1.68) by dropping two items from each set, 
and the statistical analyses of the study were 
repeated. The results were substantially un- 
changed. On the difference score of misinter- 
pretations mediated by stronger meaning re- 
sponses minus misinterpretations mediated by 
weaker meaning responses, eighth graders 
still differed from both third graders (z — 
5.35, p < .001) and from fourth graders (2 
= 5.4, p < .001). Similarly, a reanalysis of 
Chapman et al.’s (1964) data comparing 
schizophrenics and normal Ss showed that, for 
the matched subsets of items, this difference 
score differentiated the groups (z = 3.55, P 
< .001). All of the other findings of signifi- 
cant differences in both studies also remained 
unchanged. These results clearly demonstrated 
that none of the findings of these two studies 
were due to a difference in ambiguity of the 
stem sentences in the items measuring the 
two types of error. 


Discussion 


The results of this investigation confirmed 
the hypothesis that younger children exceed 
older children in the differential tendency to 
interpret words in context by relying ex- 
cessively on the meaning responses which are 
stronger for normal adults. There is, ap- 
parently, a developmental progression in pro- 
pensity toward this differential error, and the 
performance of schizophrenics corresponds to 
an early stage of this developmental trend. 

However, one need not conclude from these 
data that schizophrenics and. young children 
exhibit this differential error tendency for the 
same reasons. There is, as yet, no clear expla- 
nation for schizophrenics’ excessive reliance 
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on stronger normal meaning responses to 
mediate their overt responses to words. One 
might speculate that the children who made 
these errors had not yet acquired the weaker 
meanings of the words used in the experi- 
mental task; however, evidence for this spec- 
ulation is not presently available. 
Nevertheless, the present investigation dem- 
onstrated that younger children parallel the 
tendency of schizophrenics to rely excessively 
on stronger normal meaning responses in their 
interpretation of words presented in the con- 
text of a sentence. The results of the present 
study support, at least on a descriptive level, 
the hypothesis of regression in regard to this 
aspect of schizophrenic thought disorder. 
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SIMILARITY OF PERSONALITY TRAIT INTERRELATIONSHIPS 
IN PERSONS WITH AND WITHOUT EPILEPTOGENIC 
CEREBRAL DYSFUNCTION * 


MELVIN L. SCHWARTZ, RAYMOND D. DENNERLL, Ax» YI-GUANG LIN 
Michigan Epilepsy Center, Detroit 


The problem was posed of whether or not personality tests measure the same 
traits in brain-injured and non-brain-injured samples. It was hypothesized 
that, in a sample of persons with epilepsy, personality trait interrelationships 
would be similar to those interrelationships found in non-brain-injured samples. 
Using the CPI and EPPS intercorrelation matrices of 199 epileptics, as con- 
trasted with normative sample matrices from both test manuals, the hypothesis 
was confirmed for the persons with epilepsy. Further confirmation should 
consider variables such as severity and laterality of pathology, among others. 
However, insofar as epilepsy is concerned, the implication from Goldstein’s 
work that brain injury per se produces a general qualitative dissimilarity in 
personality trait interrelationships does not appear tenable. 


The interrelationship of personality factors 
in cases of brain injury and also the relation- 
ship between various quantifiable aspects of 
personality in the brain injured has been a 
seriously neglected area in recent investiga- 
tions of patients with. neurological insults. 
Klebanoff, Singer, and Wilensky (1954) have 
pointed out that some of the more traditional 
psychological tests such as the Rorschach and 
the MMPI offer little that is new toward the 
understanding of these relationships. This has 
been confirmed for personality research in epi- 
lepsy by Tizard (1962). An important excep- 
tion to this conclusion lies in the work of 
Meier and French (1965) relating the MMPI 
to laterality dysfunction in psychomotor epi- 
lepsy, a disease perfectly suited to this meth- 
odology of combined personality-cognitive 
assessment by virtue of the reportedly high 
incidence of severe personality disturbances. 
Certainly an allied but basic issue in the 
neuropsychology of groups with brain dys- 
function, concerns whether or not tests of 
personality and cognitive functioning are 
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Health, Education, and Welfare, Washington, D. C. 
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Schwartz who is now at the Department of Psy- 
chology, Wayne State University, College of Liberal 
Atts, Detroit, Michigan 48202. 


measuring the same traits and abilities in such 
a sample as they do in psychopathological or 
non-brain-impaired (normal) samples. 

Goldstein (1940), working with clinical 
case material, has implied that the interrela- 
tionship of thinking processes in the brain 
injured undergoes qualitative rather than 
mere quantitative modification. According to 
this view, behaviors such as cognitive-intel- 
lectual abilities or personality traits are sup- 
posedly different in kind in brain-damaged 
as opposed to non-brain-damaged individuals. 
An investigation by Reitan (1958) undertook 
to assess the validity of this impression with 
tests measuring cognitive-perceptual-motor 
abilities. He found that the Wechsler-Belle- 
vue Form I and Halstead Neuropsychological 
Battery intercorrelation matrices were essen- 
tially similar for a normal and for a brain- 
injured sample deliberately selected to be 
heterogeneous for type of brain pathology. 
Less than 5% of the same pairs of intercorre- 
lations across the two matrices for each test 
were significantly different. Reitan concluded 
that “while brain-damaged patients tend to 
show definite impairment of abilities, their 
abilities remain essentially of the same kind as 
those persons without brain damage [p. 
345].” 

Notwithstanding, Reitan’s conclusion has 
not been tested for its applicability to person- 
ality trait interrelationships. The sparse re- 
search using factor-analytic methodology com- 
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paring factor loadings in brain-injured and 
other similar samples (Berger, Bernstein, 
Klein, Cohen, & Lucas, 1964; Dennerll, Den- 
Broeder, & Sokolov, 1964) is again over- 
whelmingly devoted to the cognitive-percep- 
tual-motor abilities with the usual neglect of 
personality variables. The present study, in 
attempting to extend Reitan’s conclusion, in- 
vestigated the interrelationships among two 
measures of personality, the California Psy- 
chological Inventory (CPI, Gough, 1957) and 
the Edwards Personal Preference Schedule 
(EPPS, Edwards, 1953) in a sample of per- 
sons with brain dysfunction of epileptogenic 
etiology and compared these interrelationships 
with those already established in normative 
groups on both tests (Edwards, 1953; Gough, 
1957). On the basis of clinical observations of 
the authors made at this epilepsy center it 
was hypothesized that there would be signifi- 
cant similarities in the personality test inter- 
correlation matrices of persons with epilepsy 
as compared to normative samples. 


METHOD 
Subjects 


The Ss were 199 persons with epilepsy (125 males 
and 74 females) with diagnoses confirmed by neuro- 
logical examination, seizure history, and EEG. They 
were a series of consecutive volunteers in an out- 
patient diagnostic and research center for epilepsy 
who were participating in a larger study of neuro- 
logical, social, and psychological factors in the em- 
ployability of epileptics. The mean age of the group 
was 26.4 yr. (range 15.0-55.4), mean education was 
11.4 yr. of completed schooling (range 7-17), and 
mean WAIS Full-Scale IQ was 97.7 (range 61-137). 
The sample was unselected with regard to type of 
seizure, and contained Ss with grand mal, petit mal, 
psychomotor, and mixed seizure types. The Ss came 
from such diverse sources as general practitioners, 
self-referrals, neurologists, schools, the Department 
of Vocational Rehabilitation, hospitals, out-patient 
clinics, employers, and concurrent case files of the 
Michigan Epilepsy Center. 


Procedure 


The CPI and EPPS, embedded in a larger battery 
of psychometric techniques were individually admin- 
istered to each S. They were scored according to the 
directions and values provided by the respective 
manuals. The correlational methodology used by 
Reitan was also used here to facilitate direct cross- 
study comparisons, This methodology employed 
inter- and intratest correlations as the basis for 
comparison among groups. In the intertest correla- 


M. L. Scuwartz, R. D. DENNERLL, AND Y1-Guanc LIN 


tions, each of the 18 CPI subscale raw scores was 
correlated with each of the 15 EPPS subscale raw 
scores. The consistency score was omitted from con- 
sideration in the latter. This provided an intercorre- 
lation matrix containing 270 rs for Ss with epilepsy. 
Another matrix for males only with epilepsy was 
also generated. These matrices were compared with 
normative data matrices provided by both test man- 
uals. Gough’s manual provided an EPPS-CPI inter- 
correlation matrix for 45 college males of high schol- 
astic aptitude (Gough, 1957; hereafter referred to as 
the “Gough matrix”). Intratest correlations in the 
epilepsy sample were established by correlating each 
CPI subscale with every other subscale on that test, 
The analogous procedure was carried out for the 
EPPS. Intrascale correlation matrices for “normal” 
Ss were also available in the respective test manuals. 
These normative intratest matrices were then com- 
pared with those evolving from the epilepsy sample. 

The specific comparisons between matrices were of 
two types: (a) pair-wise intercorrelation and (b) 
matrix correlation. In the pair-wise comparisons, 
every intertest subscale correlation in the normative 
samples was compared with the same correlation in 
the epilepsy samples. In the matrix comparisons, the 
CPI-EPPS matrix of the one sample was correlated, 
using each r as a score, with the same matrix in 
the other sample, yielding one r which expressed the 
overall degree of relationship between the two 
matrices. 


RESULTS 


The intercorrelation of CPI and EPPS 
scores yielded an 18 X 15 matrix of 270 7's 
for male epileptics separately, and for all 
(male and female) epileptics combined." 
There were 96 significant r’s (.05 level used in 
this and subsequent tests) in the male epi- 
lepsy group matrix and 110 significant 7’s in 
the combined-sexes matrix. The CPI-EPPS 
matrix in Gough's manual yielded only 28 
significant 7s, 19 of which were also signifi- 
cant in the male epilepsy matrix. The nearly 
four times as many significant 7's in both epi- 
lepsy matrices relative to Gough's data may 
lead to the deceiving impression that EPPS 
and CPI traits are more closely related m 
samples of epileptic Ss than in a non-brain- 
injured sample. Of course, for purposes of sta- 
tistical significance, each correlation was Con- 


3A 33 X33 matrix combining intra- as well e 
intertest correlations in the epileptic group oo 
have been generated. However, since the Goug 
manual did not contain either an EPPS or CPI intr 
correlation matrix for the 45 males on whom " 
reported CPI-EPPS intercorrelations, a contrast ^: 
epileptics and these normal males on jntratest ” 
was prohibited. 
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sidered relative to the parameters (mean, 
variation) of only the sample from which it 
was derived. Hence, the above values do not 
reflect any comparison between identical pairs 
of correlations across the two matrices. Such 
pair-wise comparisons would be required in 
order to discover the actual magnitude and 
thence, the significance of the differences be- 
tween each pair of correlations in the com- 
parison of any two matrices. 

For purposes of pair-wise comparisons, the 
270 r's in each CPI-EPPS matrix were con- 
verted to z values using Fisher’s r to 2 trans- 
formation (Edwards, 1960). The 2 value for 
each r in each matrix was tested for signifi- 
cance against the corresponding z value of the 
same r in the other matrix. Comparison of 
the matrices of the normative sample males as 
compared to epileptic males yielded 16 or 
5.9% of pairs with differences significant at 
or below the .05 level. By chance alone, 5% 
of the pairs would be expected to differ sig- 
nificantly, suggesting that the differences be- 
tween corresponding correlations in the two 
matrices are essentially negligible. Compari- 
son of the Gough matrix with an all female 
epilepsy matrix yielded 15 or 5.6% signifi- 
cant differences and with the combined male 
and female epilepsy matrix, 12 or 4.4% sig- 
nificant differences. These latter two difference 
tests may be somewhat artificial in that they 
compare female epileptics with normal males. 
Nevertheless, they do suggest that neither the 
epilepsy variable nor the sex variable are 
sufficient to produce major trait interrelation- 
ship differences between the normals and the 
epileptics. 

Although the lack of differences between 
epileptics and normals is certainly the pri- 
mary finding of this study, there did remain 
four identical interrelationships in each of the 
three epilepsy matrices (male, female, and 
combined sexes) which were significantly dif- 
ferent (p<.05) from the corresponding 
Gough matrix r’s. The interrelationship be- 
tween Autonomy (EPPS) and Intellectual 
Efficiency (CPI) in Gough's sample was +.26, 
whereas in the three epilepsy samples, the re- 
spective values were —.18, —.25, and —.13. 
The three remaining significant correlational 
differences were: Abasement (EPPS)-Domi- 
nance (CPI): epilepsy r’s = —.36, —.37, 


—.40; Gough: +.01; Nurturance (EPPS)- 
Dominance (CPI): epilepsy ”’s = —.13, 
—.15, —.4; Gough: +.36; Nurturance 
(EPPS)-Sociability (CPI): epilepsy 7’s= 
—.15, —.18, —.14; Gough: +.26. Interpreta- 
tion of so few identical significant differences 
would be premature if not hazardous without 
a replication, however. 

Finally, the correlation between entire 
matrices was examined. Using each correla- 
tion in each matrix as a score, the Gough 
matrix correlated .47 with the male epilepsy 
matrix, 44 with the female epilepsy matrix, 
and .51 with the combined male and female 
epilepsy matrix. These intermatrix rs are 
not as high as those reported by Reitan for 
cognitive tests (.79, .78, .85). Personality 
trait intercorrelations, however, are usually 
lower than the intercorrelations of cognitive or 
intellectual abilities. Nevertheless, these three 
intermatrix r’s are all statistically significant 
(p < .001) and represent a sufficient degree 
of agreement to state that there is a striking 
similarity in personality trait interrelation- 
ships in the epileptic and normative samples. 
Further support for this conclusion was also 
found in the total matrix correlations of .92 
and .85 (both p< .001) for normal males 
versus epileptic males and normal females 
versus epileptic females, respectively, between 
the CPI 18 X 18 intracorrelation matrix for 
each group. Similarly, for the 15 x 15 EPPS 
intracorrelation matrices, the same compari- 
sons yielded 7’s of .79 and .82 (both p< 
001). 

Discussion 


The results demonstrate very striking simi- 
larities between a group of Ss with epilep- 
togenic dysfunction and two normative non- 
brain-damaged groups in the interrelationship 
of CPI and EPPS personality traits. Although 
there may very well be equally striking quan- 
titative differences between this sample of Ss 
with brain dysfunction and the normal sam- 
ples used here,* the presence of brain dys- 


4 Either test may differentiate epileptics from non- 
epileptics on elevation (mean subscale values) or 
certain specific scale interactions, even though, as 
these results imply, the internal factorial structures 
are not at variance for these groups. However, this 


kind of differentiation, if it exists, is not the focus of 
this study. 
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function did not modify the correlational 
interrelationships between the various traits 
measured by the two tests. Therefore, Rei- 
tan’s finding of essentially similar interrela- 
tionships between cognitive abilities in a 
brain-damaged versus a non-brain-damaged 
sample can be extended to the area of person- 
ality traits in cases of brain dysfunction of an 
epileptogenic variety. The results do not offer 
any support for those theories of the epilep- 
tic personality which posit that there are uni- 
tary, consistent, or characteristic personality 
features of all forms of the disease. However, 
as Tizard (1962) has concluded in her review 
of the theories of the epileptic personality, 
“progress in this field depends on a recogni- 
tion and study of the complex environmental 
and pathophysiological factors involved, and 
on the development of reliable criteria with 
which to classify epileptics [p. 207].” 
Certainly the results of this study are lim- 
ited in generalization to the epilepsies. Fur- 
ther validation of these findings must await 
studies manipulating the massive network of 
variables underlying various neurological in- 
sults, Site and type of lesion, severity, and 
laterality of pathology, for example, must all 
be varied systematically, with similar results, 
before a sweeping generalization of the quali- 
tative similarity of personality traits and 
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abilities of a brain-damaged to a non-brain- 
damaged population is made. 
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MULTIDIMENSIONAL ANALYSIS OF “HYPNOTIC” BEHAVIOR * 


THEODORE XENOPHON BARBER? anp DAVID SMITH CALVERLEY 
Medfield Foundation, Harding, Massachusetts 


Ss who had not previously participated in a hypnotism experiment were told 
simply to close their eyes for 5 min. and place themselves in hypnosis. These 
Ss were almost as responsive as a group of Ss who were exposed to a formal 
hypnotic-induction procedure on four dimensions of “hypnotic” behavior: (a) 
trancelike appearance (e.g, apparent limpness-relaxation) ; (b) responses to 
test suggestions to hallucinate, to be amnesic, etc.; (c) reports of unusual 
experiences such as reported "disappearance" of the body or body parts; and 
(d) testimony of having been hypnotized. An additional group of Ss (controls) 
who were told only to close their eyes for 5 min. were generally less responsive 
than the place-yourselí-in-hypnosis group and the hypnotic-induction group on 


the dimensions of hypnotic behavior. 


During the past decade a wide variety of Ss 
have been exposed to repeated suggestions 
that they were becoming relaxed, drowsy, 
sleepy, and were entering a trance state. 
Typically, these repeated suggestions gave rise 
to one or more of the following sets of behav- 
iors: (a) Ss appeared passive, limp, very 
relaxed, as if they were in a trance; (b) they 
manifested a rather high level of response to 
standardized test suggestions (e.g, sugges- 
tions for limb rigidity, hallucination, and am- 
nesia); (c) they reported that they had un- 
usual experiences; and (d) they testified that 
they were hypnotized. The authors believe 
that a general theory is needed that will 
explain each of these four sets of behaviors 
that are typically elicited by procedures that 
are commonly labeled as hypnotic inductions. 
However, the authors are not as yet con- 
vinced that a theory which succeeds in ex- 
plaining these behaviors will necessarily em- 
ploy constructs such as hypnosis, hypnotized, 
or hypnotic trance. Furthermore, it is the 
authors’ opinion that a satisfactory theory 
will be based on empirical studies which (a) 


l'This research was supported by grants (MH- 
07003 and MH-11521) from the National Institute 
of Mental Health, United States Public Health 
Service. The authors are indebted to Mildred O’Brien 
of Worcester City Hospital, Phyllis Olson of Memorial 
Hospital of Worcester, and A. Simoens of Rhode 
Island Medical Center for urging their students to 
Participate in the experiment and for providing 
experimental rooms at their respective hospitals. 

? Requests for reprints should be sent to Theodore 
Barber, Research Department, Medfield Foundation, 
Harding, Massachusetts 02042. 
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establish reliable methods for measuring each 
of the four sets of behaviors, (b) determine 
the strength of association between the be- 
haviors (R-R laws), and (c) specify the ante- 
cedent variables that are functionally related 
to each of the behaviors (S-R) laws. Before 
an investigation is presented that was con- 
cerned with the latter aims, the presentation 
will be clarified by describing more precisely 
the four sets of behaviors that need to be 
explained, These behaviors, which the authors 
are labeling as dimensions of “hypnotic” 
behavior, are as follows: 

1. Trancelike appearance. Weitzenhoffer 
(1957, p. 210; 1963, p. 42) has described 
four facets of the trancelike appearance that 
are typically manifested by Ss who have been 
exposed to a hypnotic-induction procedure, 
namely, (a) a type of limpness-relaxation, 
(b) loss of spontaneity, (c) psychomotor re- 
tardation, and (d) "trance stare." In the in- 
vestigation presented below the authors will 
delineate methods for measuring each of these 
four components of trancelike appearance and 
will attempt to relate each component to 
antecedent variables that are present in the 
situation. 

2. Response to test suggestions. This set of 
behaviors is elicited by directly stating (sug- 
gesting) to S that he will experience certain 
sensations or events or that he will perform 
certain behaviors. For instance, it is suggested 
to S that he cannot bend his arm, he will see 
and hear objects (that are not present), and 
he will forget everything that occurred. The 
set of behaviors that is elicited by suggestions 
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of this type is by far the best quantified of 
the four under consideration. Recent work by 
Weitzenhoffer and Hilgard (1959, 1962), 
Shor and Orne (1962), London (1963), and 
Barber (1965) has yielded a series of stand- 
ardized scales which satisfactorily measure 
response to test suggestions for limb rigidity, 
hallucination, amnesia, analgesia, age regres- 
sion, posthypnotic behavior, and so on. 

3. Reports of unusual experiences. Gill and 
Brenman (1959, pp. 13-17) have presented 
a detailed description of the verbal behaviors 
that are subsumed under this dimension. They 
state that Ss who have been exposed to a 
hypnotic-induction procedure typically report 
(a) alterations in the size of the body or body 
parts, (5) changes in equilibrium, (c) feeling 
as if part of the body or the entire body had 
disappeared, (d) feelings of unreality, (e) 
changes in experienced temperature, and (f) 
feeling as if the E's voice is either very near 
or very far. In the investigation to be de- 
Scribed the authors will attemp to quantify 
and to relate to antecedent variables 
each of these six types of reports of unusual 
experiences. 

4. Testimony of having been hypnotized. 
Hilgard and Tart (1966, p. 205) have under- 
scored the importance of this dimension, 
writing that Ss’ reports that they entered a 
hypnotic state “furnish a somewhat crude but 
useful answer to the need for evidence that 
a person is hypnotized. . . .” Methods for 
measuring this dimension have been presented 
in several recent studies (Barber & Calverley, 
1966; Hilgard & Tart, 1966; Tart & Hilgard, 
1966). 

The experiment presented in the following 
section was designed (a) to quantify each of 
the four dimensions of hypnotic behavior, (5) 
to ascertain the interrelations among the four 
dimensions, and (c) to isolate antecedent 
variables that play a role in determining to 
what degree S will respond on each dimension. 
One question at the forefront of discussion is, 
Are the four sets of hypnotic behaviors inter- 
correlated? For instance, if a S is highly re- 
sponsive to test suggestions, does he also 
manifest a trancelike appearance, report 
many unusual experiences, and testify that 
he was deeply hypnotized? Another question 
is derived from the following consideration: 
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Hypnotic-induction procedures include several 
distinguishable antecedent variables. For in- 
stance, S (a) sits quietly with eyes closed for 
a period of time and (5) is told directly or 
indirectly that he is expected to enter hyp- 
nosis. Are the four sets of hypnotic behaviors 
functionally related to these two distinguish- 
able antecedent variables that are included in 
hypnotic-induction procedures? An additional 
question derives from the fact that Ss have 
different preexperimental expectations con- 
cerning how they will perform in a hypnotic 
experiment. Can Ss' performance on each of 
the four behavioral dimensions be predicted 
by assessing their expectations prior to the 
experiment? 


METHOD 


Overall Experimental Design 


The Ss were assigned to three groups (two experi- 
mental groups and one control group). Each S was 
tested individually by one E (DSC). The E was not 
acquainted with Ss prior to their entrance into the 
experimental room. 

The Ss assigned to one of the experimental groups 
were exposed individually to a standardized hypnotic- 
induction procedure (hypnotic-induction group). The 
Ss allocated to a second experimental group were 
told individually to close their eyes for 5 min. and 
place themselves in hypnosis (place-yourself-in- 
hypnosis group). The Ss allocated to a third group 
were individually told only to close their eyes for 
5 min. (controls). 

The Ss assigned to the hypnotic-induction and 
place-yourself-in-hypnosis groups were assessed pre- 
experimentally on expectations (self-predictions) of 
hypnotic depth. After imposition of the experimental 
and control treatments, all Ss were rated individually 
on trancelike appearance as indicated by limpness- 
relaxation, loss of spontaneity, trance stare, and 
psychomotor retardation. The hypnotic-induction and 
place-yourself-in-hypnosis groups were also rated on 
response to test suggestions, for example, arm im- 
mobilization, hallucination, and amnesia. Posterpeti- 
mentally, all Ss filled out a questionnaire whic 
assessed reports of unusual experience and testimony 
of having been hypnotized. 


Recruitment of Subjects 


All Ss were female student nurses. They wete 
“coerced volunteers” in that they were told by th 
supervisors that they were expected to parka aa 
in the experiment in order to become better à! 
quainted with psychological research (cf. Boucher 
Hilgard, 1962). z 

Experimental subjects, The nursing supervisors # 
two hospitals directed their students, 112 in all, 
report individually at scheduled times to a SI 
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room to participate in a psychological experiment 
involving hypnosis. Two Ss failed to keep their ap- 
pointments. The remaining 110 were assigned 
randomly, in order of their appearance in the ex- 
perimental room, to one of the two experimental 
groups (hypnotic-induction or place-yourself-in-hyp- 
nosis group) with 55 to each group. 

Control subjects. A control group was included in 
the experiment in order to determine to what extent 
the four sets of hypnotic behaviors are due to the 
fact that Ss sit quietly for a period of time with 
eyes closed. If the control Ss were to serve this pur- 
pose, it was imperative that they should not have 
interpersonal contacts with the experimental Ss. If 
the control and experimental Ss talked with each 
Other, it appears very likely that the controls would 
learn from the experimental Ss that the investigation 
involved hypnosis and, consequently, the controls 
might interpret their instructions (to close their eyes 
for 5 min.) as instructions to place themselves in 
hypnosis. To prevent this contamination, the control 
and experimental groups were recruited from dif- 
ferent institutions; that is, the controls were 50 stu- 
dent nurses, who were in training at another hospital, 
and who were directed by their supervisors to report 
individually at scheduled times to a specific room to 
participate in a psychological experiment. All 50 met 
their appointments. 

It is worth noting that, although the control and 
experimental groups were in training at different 
hospitals, they: came from a homogeneous popula- 
tion. The control and experimental Ss were of the 
same sex (female), of the same age (17-22), training 
for the same occupation (nursing), raised in the same 
geographical locality (eastern Massachusetts and 
Rhode Island), and of similar ethnic-religious back- 
ground (predominantly Irish and Italian Catholic). 


Expectations of Hypnotic Depth 


After each S in one of the two experimental groups 
Was greeted and asked her name and age, she was 
given a mimeographed preexperimental questionnaire. 
One item on this questionnaire asked whether S had 
previously permitted anyone to try to hypnotize her. 
A second item was worded thus: 


I expect (predict) that in this experiment I 


will be (please fill in percentages so as to sum 
to 100%): 


(a) deeply hypnotized % of the time 
(b) medium hypnotized % of the time 
(c) lightly hypnotized % of the time 
(d) not hypnotized % of the time 


Total = 100% 


This expectations-of-hypnotic-depth item was 
Scored thus: The percentage figures placed by S in 
Parts (a), (b), (c), and (d) were multiplied by 
3, 2, 1, and 0, respectively, and the resultant figures 
Were added and divided by 100. Thus, S's score was 
2.0 if she expected that she would be medium hypno- 
tized 10096 of the time and her score was 1.0 if she 
expected to be lightly hypnotized 100% of the time. 
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Since the authors did not want the control Ss to 
define the experimental situation as hypnosis, they 
were not given the preexperimental questionnaire. 


Experimental and Control Conditions 


Upon completing the preliminaries described above, 
the Ss were treated as follows: 

Hypnotic-induction group. The 55 Ss allocated to 
this group were exposed individually to Form A of 
the Stanford Hypnotic Susceptibility Scale (SHSS, 
Weitzenhoffer & Hilgard, 1959, pp. 13-26). Spe- 
cifically, each S assigned to the hypnotic-induction 
group was first exposed to statements intended to 
establish rapport and to suggestions of postural sway 
and then was exposed to a hypnotic-induction pro- 
cedure comprised of repeated suggestions of eye 
heaviness, eye closure, relaxation, deep sleep, and 
deep hypnosis Upon completion of the hypnotic- 
induction procedure, and without a break in the 
continuity of presentation, each S was assessed on 
response to the 10 final test suggestions that are 
included in the SHSS: namely, hand lowering, arm 
immobilization, finger lock, arm rigidity, hands 
moving together, verbal inhibition, hallucination of 
a fly, eye catalepsy, posthypnotic response, and 
amnesia.* 

Place-yourself-in-hypnosis group. The 55 Ss as- 
signed to the place-yourself-in-hypnosis group were 
not exposed to the hypnotic-induction procedure. 
Each S in this group was first told individually that 
her participation in the hypnosis experiment could 
contribute to scientific knowledge, she was next as- 
sessed on the postural sway test (from the SHSS), 
and then was instructed as follows: 

*During the next five minutes I want you to close 
your eyes and place yourself in hypnosis. Let your- 
self go as deeply into hypnosis as you possible can 
and wait for further instructions," 5 

On completion of the 5-min. period, the final 10 
test suggestions included in the SHSS were adminis- 
tered. 

Control group. As stated above, 50 control Ss were 
included in the experiment to determine the effects 


8Two minutes before the end of the hypnotic- 
induction procedure each S in this group was told, 
“I want you to open your eyes, to remain in hyp- 
nosis, and to wait for further instructions. Now open 
your eyes and wait for further instructions.” The 
E observed the S’s eye movements for 15 sec, and 
rated trance stare in accordance with the criteria 
presented in the next section of this paper. The E 
then stated, “Close your eyes,” and continued the 
hypnotic-induction procedure for an additional two 
min. 
1 The E read the SHSS to S with the exception of 
four paragraphs that were presented from memory. 
The memorized portion was presented when E was 
rating Ss on trancelike appearance (see next section 
of this paper). 

5 Trance stare was rated during the fourth 
minute of the 5-min. period in the same way as for 
the hypnotic-induction group (see Footnote 3). 
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due simply to sitting quietly for a period of time 
with eyes closed. Soon after each S in the control 
group had entered the experimental room, she was 
individually instructed thus: 

“I want you to close your eyes during the next 
five minutes. Now close your eyes and keep them 
closed and wait for further instructions.” € 


Trancelike Appearance 


According to Weitzenhoffer (1957, 1963) and other 
authorities (Erickson, Hershman, & Secter, 1961, pp. 
53-58; Pattie, 1956), hypnotized Ss typically mani- 
fest a trancelike appearance. As indexes of trance- 
like appearance these authors list (a) limpness- 
relaxation, (b) loss of spontaneity, (c) trance stare, 
and (d) psychomotor retardation. Each of the four 
indexes was measured in the present experiment. 
Each S was asked to open her eyes for 15 sec. and 
trance stare was scored during this eyes-open period 
(see Footnotes 3, 5, and 6). Limpness-relaxation and 
loss of spontaneity were assessed during the 3-min. 
period which preceded the eyes-open interval and 
psychomotor retardation was rated toward the end 
of the experimental session. Specifically, the following 
criteria for scoring each of the four indexes of trance- 
like appearance were derived from Weitzenhoffer’s 
(1957, pp. 210-211; 1963, p. 42) detailed description 
of the phenomena. 

Limpness-relaxation was considered to be present 
if S appeared relaxed or was slumped in the chair, 
her head was tilted downward toward the chest or 
backward or sideways over a shoulder, and her hands 
either hung limply on the sides or rested limply on a 
support (Weitzenhoffer, 1957, p. 210). The E used 
a stopwatch to record the amount of time S met 
these criteria. Scores of 3, 2, 1, and 0, respectively, 
were assigned if, during the 3-min. period which 
preceded the instructions to open the eyes for 15 
sec, S manifested limpness-relexation more than 
90% of the time, 50% to 90% of the time, less than 
5096 of the time, and not at all. 

"To score loss of spontaneity E recorded the num- 
ber of times, during the 3-min. period preceding the 
instructions to open the eyes for 15 sec, that S 
Spoke, opened her eyes, or voluntarily changed posi- 
tion of body or limbs (Weitzenhoffer, 1957, pp. 
210-211). A score of 3 was assigned for loss of 
spontaneity if, during the 3-min. period, S did not 
speak, open her eyes, or change position of body or 
limbs. If S changed position of body or limbs once, 
she was given a score of 2; if she changed position 
twice, she was given a score of 1; and if she spoke, 
opened her eyes, or changed position of body or 
limbs more than twice, she received a score of 0. 


9 To rate trance stare the controls were told, 
during the fourth minute of the 5-min. period, “I 
want you to remain like you are and open your 
eyes. Now open your eyes and wait for further 
instructions." The E observed the S's eye movements 
for 15 sec. and, after rating trance stare, asked S 
to close her eyes again. 
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To score trance stare E carefully observed S’s eye 
movements during the 15-sec. period when her eyes 
were open during the experiment. In accordance with 
the criteria presented by Weitzenhoffer (1963, p. 42), 
trance stare was considered to be present if S did 
not move her eyes around the room; that is, if she 
“stared” at or focused upon a point in space or at 
a particular object. Scores of 3, 2, 1, and 0, respec- 
tively, were assign if trance stare was present more 
than 90% of the time, 50% to 90% of the time, less 
than 50% of the time, and not at all. 

Immediately before E stated that the experiment 
was over, Ss in the control group were told to extend 
the right arm straight out and then to extend the 
left arm straight out. The Ss in the experimental 
groups were also told to extend the right and left 
arms straight out (at the beginning of the hand 
lowering and the arm rigidity test suggestions of the 
SHSS). The time in seconds between the end of the 
instructions to extend each arm and the completion 
by S of movement of the arm to horizontal position 
was carefully measured with a stopwatch. The score 
for psychomotor retardation was the average of these 
two measurements, Thus, slower arm movements 
(longer time) indicated more psychomotor retarda- 
tion (Weitzenhoffer, 1963, p. 42). 

Reliability of ratings of trancelike appearance. The 
ratings of trancelike appearance were found to be 
reliable (5 < .001) in a separate pilot study with 
three individuals (who did not participate in the 
experiment that is described in this paper). When 
asked to participate in the pilot study, these indi- 
viduals (one medical technician and two secretaries) 
were told (a) they would each be exposed 10 times 
to a hypnotic-induction procedure, (b) they were to 
act, in each session, as if they became hypnotized 
to a deep, medium, or light level, and (c) they were 
to choose, prior to each session, which level of 
hypnosis they would simulate without informing the 
investigators as to the level they had selected. During 
each of the 30 individual sessions, two investigators 
(the co-authors of this paper) rated S on the indexes 
of trancelike appearance. Satisfactory interobserver 
agreement was obtained. Using the criteria delineated 
above, the interrater reliabilities (Pearsonian correla- 
tions), averaged over the 30 sessions, were .87, 99, 
92, and .91 for limpness-relaxation, loss of spon- 
taneity, trance stare, and psychomotor retardation, 
respectively. 


Reports of Unusual Experiences 


Immediately after it was stated that the experi- 
ment was over, each of the 160 Ss was given à 
mimeographed postexperimental questionnaire. The 
first part of this questionnaire included six items 
that were designed to assess unusual experiences such 
as changes in the size of the body and feelings of 
unreality. In constructing these items the authors 
adhered very closely to Gill and Brenman’s (1959, 
pp. 13-17) description of the phenomena. The items 
were worded as follows: 
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1. During the experiment, did you experience 
any alterations in the size of your body or in the 
size of parts of your body? For instance, did your 
whole body or your legs, arms, head, mouth, lips, 
etc. feel large or swollen or small or shrunken? 
Ves) = "No. 

2. During the experiment did you experience any 
changes in equilibrium such as the following: 
sensations of giddiness or dizziness; as if you were 
rocking back and forth; as if the chair was sliding 
away from you; as if your body had lost its 
moorings and was floating up near the ceiling; or 
as if you were sinking endlessly down into space? 
Yes____.. No. 

3. At some time during the experiment did you 
feel as if parts of your body (your arm, foot, leg, 
etc.) or your entire body had “disappeared,” that 
is, as if you had lost parts of your body or all 
of your body? Yes____.. No. 4 

4. When you were asked to open your eyes 
during the experiment, did things seem unreal, 
shadowy, dreamlike, or as if covered by a veil or 
haze? Yes... No. 

5. At any time during the experiment did you 
suddenly feel very cold or very hot? Yes__—. 
No. à 

6. At any time during the experiment did you 
feel as if the experimenter’s voice was very close 
(as though he were talking into your ear) or as 
if the experimenter’s voice was going further and 
further away? Yes____.. No. 


Each of the six items received a score of 1 for a 
yes answer and a score of O for a no answer. 


Testimony of Having Been Hypnotized 


The second part of the postexperimental question- 
naire included the following item: “I believe that 
during the experiment I was: deeply hypnotized 
% of the time; medium hypnotized 96 
of the time; lightly hypnotized % of the 
time; not hypnotized % of the time.” This 
testimony-of-having-been-hypnotized item was scored 
in the same way as the expectations-of-hypnotic- 
depth item that was included in the preexperimental 
questionnaire, 

The postexperimental questionnaire also included 
an additional item: “What basis did you use to 
judge your depth of hypnosis?" Also, Ss in the 
Place-yourself-in-hypnosis group were asked, “How 
did you go about placing yourself in hypnosis?” 

Immediately after completing the postexperimental 
questionnaire, each S was told that the experiment 
would be ruined if she discussed it with the other Ss 
before all had been tested. Each S stated that she 
understood the importance of this admonition and 
Promised to adhere to it. 


Perception of the Situation by the Control 
Group 


A series of precautions were taken to prevent the 
control group from defining the situation as hyp- 
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nosis. For instance, the controls did not come in 
contact with the experimental Ss (to whom the 
situation had been defined as hypnosis), they were 
not given the preexperimental questionnaire which 
assessed expectations of hypnotic depth, and they 
were not exposed to the rapport instructions, the 
postural sway test, the hypnotic-induction procedure, 
and the test suggestions that are included in the 
SHSS. To determine whether the precautions were 
successful, Ss in the control group were treated as 
follows when they were assembled in class the day 
after the experiment was completed, 

The controls were asked to write the answers to 
several questions. Since it appeared likely that ano- 
nymity would produce more truthful responses, they 
were told not to sign their names. Three of the 
questions, which were written on the blackboard 
were: 


(a) Write everything you heard about the ex- 
periment before you were tested. (b) During the 
experiment, when you eyes were closed, did you at 
that time think the experimenter was trying to 
hypnotize you or wanted you to be hypnotized? 
(c) Did you think that the experiment might have 
been concerned with hypnosis prior to the time 
when you saw the question on hypnosis on the 
questionnaire you received at the end of the 
experiment ? 


In reply to the first question, no S mentioned the 
word hypnosis. One S answered yes to the second 
and third questions, and the remaining 49 answered 
no. The authors concluded from these data that the 
efforts to prevent the control Ss from defining the 
situation as hypnosis were quite successful. The an- 
swers to the above three items strongly indicated 
that, with one exception, the 50 Ss in the control 
group did not believe, when they were sitting quietly 
with eyes closed for 5 min., that they were supposed 
to enter hypnosis or that they were being tested for 


hypnosis. 


Previous Participation in a Hypnotism 
Experiment 

The preexperimental questionnaire, which was 
given to the 110 Ss in the two experimental groups, 
asked if S had previously permitted anyone to try 
to hypnotize her. The 50 control Ss were asked the 
same question after the experiment was completed. 
Four of the 110 Ss in the two experimental groups 
and none in the control group stated they had been 
previously exposed to hypnosis and that the hypnosis 
was relatively unsuccessful—they either were not 
hypnotized at all or were hypnotized only to a very 
light level. In the present experiment, 2 of the 4 Ss 
who had prior experience were assigned to the place- 
yourself-in-hypnosis group and 2 to the hypnotic- 
induction group. Since the performance of each of 
the 4 experienced Ss on the dimensions of hypnotic 
behavior was either slightly below the mean or very 
close to the mean performance of the inexperienced 
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Ss in their respective groups, the results for the 
place-yourself-in-hypnosis group and the hypnotic- 
induction group will be presented without making 
special distinctions between the 4 experienced and 
the 106 inexperienced Ss in these groups. 


Analyses of Data 


The scores obtained on each dependent variable 
by the hypnotic-induction, place-yourself-in-hypnosis, 
and control groups were subjected to one-way analy- 
sis of variance. When the variance analysis yielded 
a significant effect, it was further localized by per- 
forming Duncan multiple-range tests upon the group 
means. Several variables that were measured only 
for the two experimental groups were tested for 
significance by ¢ tests. Also, Pearson product-moment 
correlations were computed among each of the major 
variables. These intercorrelations were calculated 
separately for each of the treatment groups. 


RESULTS AND DISCUSSION 


Comparison of Control Group with 
Experimental Groups 


Both the control group and each of the 
experimental groups were assessed on trance- 
like appearance, reports of unusual experi- 
ences, and testimony of having been hypno- 
tized. Table 1 shows that the control group 
did not differ significantly from either the 
hypnotic-induction or the place-yourself-in- 
hypnosis group on one of the indexes of 
trancelike appearance (loss of spontaneity) 
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and on three of the six reports of unusual 
experiences (alterations in the size of the 
body or body parts, changes in equilibrium, 
and changes in experienced temperature). 
These results indicate the following proposi- 
tion which might well be taken into con- 
sideration in formulating a theory of hypnotic 
behavior: 

Proposition 1. Some of the behaviors that 
have been traditionally associated with the 
word hypnosis—for example, loss of spontane- 
ity, and verbal reports of having had some 
types of unusual experiences (such as altera- 
tions in the size of the body, in equilibrium, 
and in experienced temperature)—may be due 
simply to the fact that S sits quietly for a 
period of time with eyes closed. 

Although Ss in the control group mani- 
fested a few hypnotic behaviors, they differed 
from Ss in the experimental groups on most 
of the behavioral dimensions. As Table 1 
shows, the control group obtained significantly 
lower scores than either of the experimental 
groups on three of the indexes of trancelike 
appearance (limpness-relaxation, trance stare, 
and psychomotor retardation). Also, the con- 
trol group obtained lower scores than both 
experimental groups on three of the six items 
which assessed reports of unusual experiences 


TABLE 1 


Mean Scores (AND STANDARD DEVIATIONS) ON PREEXPERIMENTAL EXPECTATIONS AND ON > 
Four Drwensions or Hypnotic BEHAVIOR 


Behavioral dimensions 


Experimental groups 
Control group 


Hypnotic- | Place-yourself- 
induction in-hypnosis 
Preexperimental expectations (self-predictions) of hypnotic depth 1.5, (.9) 1.5, (.9) 
i celike Uus (3) 
impness- ation 2.2, (1.0) 1.5, (1.0) To C 
Loss of spontaneity 2.6, I 2.3, (1.0) 2.6, (.8) 
Trance stare 4 1.8, (1.2) 1.3 (1.2) .9. (.9) 
Psychomotor retardation 2.9, (1.8) 1.8, (.8) 1.3. (4) 
Response to 10 test-suggestions 5.8, (2.9) 4.8y (2.9) 
EIS of unusual parios 
terations in size of body or body parts 51, (.54 47, (.49) 36, (47) 
Changes in equilibrium 85, (35) | .76, (42) | .66. et) 
“Disappearance” of body or body parts 20, (.40) 16, (.36) 04 ay 
Feelings of unreality 69, (45) | .69, (45) | .26» (43 
Changes in experienced temperature 136, (47) 42, (48) .34, (46) 
E's voice seemed very close or far «Tla (44) .64, (47) 34, (46) 
Testimony of having been hypnotized 1.5, (1.0) 1.5, (1.0) 3p CS) 


Note.—Standard deviations are in parentheses, Means in the same row containing the same subscript letter do not differ from 


each other at the .05 level, 
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TABLE 2 


PERCENTAGE OF SUBJECTS TESTIFYING THAT THEY WERE HYPNOTIZED TO A DEEP, 
MEDIUM, LIGHT, or Less THAN LIGHT LEVEL 


Hypnotic-induction 


Place-yourself-in- Control group 


Self-ratings of hypnotic depth group hypnosis group 
Medium deep to deep (scores of 2.5-3.0) 2505 25% 0% 
Medium to medium deep (scores of 2.0-2.45) 15% 15% 2% 
Light to medium (scores of 1.0-1.95) 31% 25% 12% 
Less than light (scores of .10-.95) 22% 23% 20% 
Not hypnotized (scores of .0) 1% 11% 66% 


(“disappearance” of the body or body parts, 
feelings of unreality, and E's voice seeming 
very near or very far). (It is worth noting, 
however, Table 1, that one-fourth of the 
controls reported that “things seemed unreal” 
and one-third reported that E's voice seemed 
either very close or very far.) Also, the great 
majority of the controls rated themselves as 
not having been hypnotized whereas the great 
majority of Ss in both experimental groups 
rated themselves as having been hypnotized 
to some degree (Table 2). In summary, these 
data suggest the following proposition. 

Proposition 2. Although a few behaviors 
that have been traditionally associated with 
the word hypnosis can be attributed to the 
fact that S sits quietly for a period of time 
with eyes closed, the following hypnotic be- 
haviors cannot be clearly attributed to this 
factor: (a) trancelike appearance as indicated 
by limpness-relaxation, trance stare, and psy- 
chomotor retardation, (5) reports of certain 
types of unusual experiences such as feelings 
of unreality, and (c) testimony of having 
been hypnotized. 


How do Inexperienced Subjects “Place 
Themselves in Hypnosis?” 


The reader will recall that 55 Ss were told 
to place themselves in hypnosis and that 53 
of these had not previously participated in a 
hypnotic experiment. Postexperimentally, Ss 
in this group were asked, “How did you go 
about placing yourself in hypnosis?” Answers 
to this question could be classified into four 
categories. These categories (and the per- 
centage of Ss giving answers within each 
Category) were as follows: (a) “didn’t do 
anything” or “thought of other things" 
(1395); (b) “just relaxed,” “tried to relax,” 


or “tried to relax and to go to sleep” (57%); 
(c) tried to make my mind a blank" or “tried 
not to think of anything” (7%); and (d) 
“tried to relax” and also “tried not to think 
of anything” (22%). These data suggest two 
propositions that appear relevant to a theory 
of hypnotic behavior: 

Proposition 3. Present-day Ss who have not 
had prior hypnotic experiences have clear-cut 
preconceptions pertaining to what hypnosis is 
supposed to involve. 

Proposition 4. When asked to place them- 
selves in hypnosis, Ss who have not had prior 
hypnotic experience typically try to relax and, 
in some instances, also try not to think of 


anything. 


Comparison of Place-Y ourself-in-Hypnosis 
Group with Hypnotic-Induction Group 


Is a formal hypnotic-induction procedure 
necessary to elicit a trancelike appearance, 
high response to test suggestions, reports of 
unusual experiences, and testimony of having 
been hypnotized, or can one elicit these be- 
haviors to the same degree by simply instruct- 
ing Ss (who have not previously participated 
in hypnotic experiments) to place themselves 
in hypnosis? The data summarized in Tables 
1 and 2 indicate that the place-yourself-in- 
hypnosis group did not differ significantly 
from the hypnotic-induction group in reports 
of unusual experiences and in testimony of 
having been hypnotized and that the two 
groups differed only to a small degree in 
trancelike appearance and in response to test 
suggestions. 

Reports of unsual experiences and testi- 
mony of having been hypnotized. Table 1 
shows that Ss who were told to place them- 
selves in hypnosis did not differ from those 
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that received the formal hypnotic-induction 
procedure on any of the six items which as- 
sessed unusual experiences. The Ss in both 
groups reported the same degree of experi- 
enced alterations or disappearance of the 
body or body parts, changes in equilib- 
rium, feelings of unreality, etc. Furthermore, 
Table 1 shows that, on the average, Ss in 
the place-yourself-in-hypnosis group and also 
those in the hypnotic-induction group rated 
their depth of hypnosis as between light and 
medium (M = 1.5). Table 2, which presents 
the frequency distribution of the self-ratings 
of hypnotic depth, shows that 25% of the Ss 
in each of the experimental groups (and none 
in the control group) rated themselves as 
having been deeply or medium deeply hypno- 
tized (self-ratings of 2.5-3.0). In brief, these 
data suggest the following proposition: 

Proposition 5. With inexperienced Ss (who 
have not previously participated in hypnotic 
experiments), the statement “Place yourself 
in hypnosis” is as effective as a formal 
hypnotic-induction procedure in eliciting re- 
ports of unusual experiences and testimony of 
having been hypnotized. 

Trancelike appearance. Table 1 shows that 
the two experimental groups did not differ 
significantly on one of the indexes of trance- 
like appearance—loss of spontaneity. How- 
ever, Table 1 also shows that the hypnotic- 
induction group manifested more trance- 
like appearance than the place-yourself-in- 
hypnosis group as indicated by the remain- 
ing three indexes—limpness-relaxation, trance 
stare, and psychomotor retardation. Although 
the differences on the latter three indexes 
were statistically significant, they were not 
very large. For instance, the average time 
required to move the arm to a horizontal 
position (the measure of psychomotor retar- 
dation) was about 3 sec. for the hypnotic- 
induction group and about 2 sec. for the 
place-yourself-in-hypnosis group. It appears 
possible that the small difference on three of 
the four indexes of trancelike appearance were 
due to the following: the hypnotic-induction 
group, but not the place-yourself-in-hypnosis 
group, received repeated suggestions of relaxa- 
tion, drowsiness, passivity, and sleep. These 
suggestions may have implied to Ss that they 
were to become limp and relaxed, they were 
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not to move their eyes actively around the 
room (trance stare), and they were expected 
to carry out limb and body movements slowly 
(psychomotor retardation). Further research 
is needed to ascertain if this hypothesis is 
more valid than other possible hypotheses in 
accounting for the differences. 

Response to test suggestions. Both experi- 
mental groups were assessed on response to 
the final 10 test suggestions of the SHSS. The 
Ss exposed to the formal hypnotic-induction 
procedure typically passed 1 more test sug- 
gestion than those told to place themselves 
in hypnosis (Table 1); on the average, the 
hypnotic-induction group passed about 6 of 
the 10 test suggestions (M = 5.8) whereas 
the place-yourself-in-hypnosis group passed 
about 5 (M = 4.8). The 1-point difference in 
suggestibility between the two groups, which 
was statistically significant ( < .05), could 
have been produced by anyone of a number 
of antecedent variables that were included in 
the hypnotic-induction treatment but not in 
the place-yourself-in-hypnosis treatment. For 
instance, the former treatment but not the 
latter included (a) motivational instructions 
(“keep up your attention and interest and 
continue to cooperate. . . . Nothing will be 
done that will cause you any embarrassment. 
Most people find this a very interesting 
experience. . . .”) and (5) suggestions that 
S could now experience unusual effects (e.g. 
“You are going to experience many things 
that I will tell you to experience"). Further 
studies are needed to determine whether it 
was one or both of these antecedent variables 
or other variables that produced the 1-point 
difference in suggestibility. 

In summary, the data presented in the 
preceding two paragraphs suggest the follow- 
ing proposition: ` 

Proposition 6. A formal hypnotic-induction 
procedure is a little more effective than the 
statement “Place yourself in hypnosis” in 
eliciting a trancelike appearance and in facili- 
tating response to test suggestions. } 

There are two considerations that pertain 
to the data presented in this section and in 
the preceding one: 


1. Adults in Western culture are well 
acquainted with the traditional connotations 


Ee — ———MaÓ€ 
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of the word hypnosis (London,1961; Orne, 
1959). Their familiarity with the traditional 
lore appears to play an important role in 
determining their performance in a hypnotism 
experiment. When present-day Ss who have 
not previously participated in a hypnotism 
session are told to place themselves in hyp- 
nosis, that is, when they are asked to use 
what they have previously heard about hyp- 
nosis to structure their behavior, they typi- 
cally perform as follows: they try to relax 
and they manifest a trancelike appearance, 
show a rather high level of response to test 
suggestions, report unusual experiences, and 
testify that they were hypnotized; that is, 
they typically behave very much in the man- 
ner traditionally expected of “hypnotized” Ss. 

2. The statement “Place yourself in hyp- 
nosis” appears to be sufficient to produce 
most of the effects associated with formal 
hypnotic-induction procedures. This experi- 
mental outcome raises several questions with 
respect to previous treatises which (a) pre- 
sented elaborate or involved techniques 
for “inducing hypnosis” (Erickson, 1948; 
Weitzenhoffer, 1957), or (b) derived elabo- 
rate theoretical formulations from complex 
manipulations involved in formal hypnotic- 
induction procedures (Gill & Brenman, 1959; 
Kubie & Margolin, 1944). The present data 
appear to make it incumbent upon those who 
emphasize the importance of the manipula- 
tions involved in formal hypnotic-induction 
procedures first to demonstrate rigorously 
what specific effects are produced by 
hypnotic-induction procedures that are not 
produced by the simple statement “Place 
yourself in hypnosis” and then to delineate 
empirically which of the many variables in- 
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cluded in formal hypnotic-induction pro- 
cedures are relevant and which extraneous in 
producing each of the effects. 


Preexperimental Expectations as Predictors of 
Hypnotic Performance 


With respect to both the hypnotic-induction 
and the place-yourself-in-hypnosis groups, 
there were small positive correlations between 
Ss’ preexperimental expectations (self-predic- 
tions) of hypnotic depth and the degree to 
which they responded to test suggestions, re- 
ported unusual experiences, and testified post- 
experimentally that they were hypnotized 
(Table 3, Row 1). The correlations between 
preexperimental expectations and trancelike 
appearance were also in the positive direction 
but were very small and nonsignificant. 

The small but significant positive correla- 
tions between preexperimental expectations 
and response to test suggestions confirms pre- 
vious results presented by Melei and Hilgard 
(1964), Dermen and London (1965), and 
Barber and Calverley (1966). Since previous 
studies had not attempted to relate pre- 
experimental expectations to reports of un- 
usual experiences or to testimony of having 
been hypnotized, the small but significant 
positive correlations obtained in the present 
study between these variables constitute new 
findings. These data suggest the following 
proposition which is relevant to a theory of 
hypnotic behavior: 

Proposition 7. The degree to which Ss par 
ticipating in a hypnotic experiment respond to 
test suggestions, report unusual experiences, 
and testify that they were hypnotized is 
related, in part, to their preexperimental 


TABLE 3 


INTERCORRELATIONS AMONG FivE VARIABLES 


ERIMENTAL EXPECTATIONS AND 


(eek 
Four Dimensions OF "HYPNOTIC BEHAVIOR) 


Reports of un- | Testimony of 
R te H " 
Behavioral dimensions Traneelike | | Response fona | usual experi! | having been 
Preexperimental expectations of hypnotic depth 10 (.21) .33 (.40) .23 (.43) Er (.45) 
Trancelike appearance 48 (.31) ie (ao A (oh 
Response to test suggestions .63 (46) * 


eports of unusual experiences 


Note,—In each cell the correlations for the hypnoti 
in-hypnosis group is listed next in parentheses. Correlation: 


ic-induction group is listed first and the correlation for the place-yourself- 
is of .27 and above are significant at the .05 level. 
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expectations as to how deeply they will be 
hypnotized. 


Antecedent Variables Determining Subjects’ 
Testimony of Having Been Hypnotized 


Situations commonly labeled as hypnotic 
typically include many denotable antecedent 
variables. Which of the antecedent variables 
are important and which irrelevant in deter- 
mining Ss’ testimony that they were hypno- 
tized? The data obtained in the present 
experiment throw some light on this question. 

First, Table 3 shows that, for both experi- 
mental groups, Ss’ preexperimental expecta- 
tions played a role in determining whether 
they would testify postexperimentally that 
they were hypnotized. Second, the final 
column of Table 3 shows that, for both ex- 
perimental groups, tesitmony of having been 
hypnotized was positively correlated with each 
of the other three dimensions of hypnotic 
behavior—trancelike appearance, response to 
test suggestions, and reports of unusual 
experiences, 

However, of the Ss who obtained above 
average scores on two or all three of the other 
behavioral dimensions, only half rated them- 
selves as having been deeply hypnotized. Why 
did half of these Ss rate themselves as not 
deeply hypnotized and the other half rate 
themselves as deeply hypnotized? An answer 
to this question was provided by the following 
data. Immediately after Ss had rated their 
“depth” they were asked, “What basis did 
you use to judge your depth of hypnosis?” 
The Ss who had obtained above average 


scores on two or all three of the other behav- 
ioral dimensions (trancelike appearance, re- 
sponse to test suggestions, and reports of 
unusual experiences) and had also rated their 
depth as “deep” typically stated that they 
judged their level as deep because they had 
) responded positively to the test suggestions 
(for hand lowering, finger lock, hallucination, 
eye catalepsy, etc.). Apparently, these Ss per- 
ceived “deep hypnosis” as involving high 
suggestibility. The remaining Ss, who rated 
their depth as “light” or “medium” even 
though they had obtained average scores on 
at least two of the three remaining behavioral 
dimensions, gave one or more of the follow- 
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ing reasons why they believed they had not 
entered deep hypnosis: they were aware of 
what they were doing, were aware of their 
surroundings, could think of extraneous 
things, could hear extraneous sounds, or did 
not have complete amnesia for the session, 
Apparently, these Ss did not rate their level 
as deep because their experiences did not 
dovetail in all respects with their preconcep- 
tions of what deep hypnosis is supposed to 
involve. 

A recent study by Barber, Dalal, and 
Calverley (1968) also isolated an additional 
antecedent variable that is related to Ss’ 
testimony of having been hypnotized. The 
latter study demonstrated that Ss’ self- 
ratings of their hypnotic depth are influ- 
enced by the E’s explicit or implicit state- 
ments that he believed they were or were not 
hypnotized. 

In summary, the data of the present in- 
vestigation, taken together with the data re- 
cently presented by Barber et al. (1968), 
indicate the following proposition: 

Proposition 8. Whether Ss who have par- 
ticipated in a hypnotic experiment testify 
postexperimentally that they were hypnotized 
to a deep, medium, light, or less than light 
level depends on at least four sets of ante- 
cedent variables: (a) Ss’ preexperimental con- 
ceptions of what hypnosis is supposed to 
involve; (b) their preexperimental expecta- 
tions as to how deeply they will be hypno- 
tized; (c) the degree to which, during the 
experiment they responded to test sugges- 
tions, manifested a trancelike appearance, and 
reported that they had unusual experiences; 
and (d) whether E stated or implied that he 
thought they were or were not hypnotized. 

The authors believe that the data presented 
in the preceding sections have the following 
implications: (a) Many denotable antecedent 
variables appear to play a role in determining 
whether Ss will testify that they were or were 
not hypnotized; (5) much further work is 
needed to delineate the main effects and the 
interactive effects of all the important ante- 
cedent variables that influence Ss’ testimony; 
and (c) it remains to be determined if Ss 
testimony that they were or were not hypno- 
tized is also related to another antecedent 
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factor—namely, whether or not they actually 
experienced an altered state of consciousness 
(hypnosis or hypnotic trance). 


Interrelations of Four Behavioral Dimensions 


It has been noted that there were small 
positive correlations between Ss' preexperi- 
mental expectations and each of the four sets 
of dependent variables (dimensions of hyp- 
notic behavior) and also between one of the 
four dimensions (testimony of having been 
hypnotized) and the remaining three dimen- 
sions. Now look at the relationships among 
all four behavioral dimensions. Table 3, which 
presents the correlations for the two experi- 
mental groups, shows that there were signifi- 
cant positive intercorrelations (ranging from 
.28 to .75) among each of the four sets of 
dependent variables. These data indicate 
a general proposition that could well be 
included in a general theory of hypnotic 
behavior. 

Proposition 9. In a hypnotic situation, Ss 
who are responsive on one of the dimensions 
of hypnotic behavior (trancelike appearance, 
response to test suggestions, reports of un- 
usual experiences, or testimony of having been 
hypnotized) also tend to be responsive on the 
other three dimensions. 

Although the correlations among the four 
dimensions are statistically significant, the 
small to moderate magnitudes of the correla- 
tion coefficients (which typically account for 
about one-quarter of the variance and in no 
case account for more than 56% of the vari- 
ance) indicate that a substantial proportion 
of Ss who obtained high scores on one of the 
dimensions did not obtain high scores on the 
Other dimensions. This outcome, indicating 
that response on each of the behavioral di- 
mensions is at least somewhat independent of 
response on the other dimensions, has the fol- 


" Although the intercorrelations among the behav- 
ioral dimensions were significant for both the 
hypnotic-induction and the place-yourself-in-hyp- 
nosis group, they were not significant for the 
control group, With respect to the control group, 
the correlations were as follows: —.07, between 
trancelike appearance and reports of unusual experi- 
ences; —.06, between trancelike appearance and 
testimony of having been hypnotized; and .17, be- 
‘tween reports of unusual experiences and testimony 
of having been hypnotized. 
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lowing implications: It may be fruitful, at the 
present time, for behavioral scientists (a) to 
distinguish the four behavioral dimensions 
conceptually and to treat each experimentally 
as a separate dependent variable, (5) to use 
cautiously such terms as hypnotized, hypnotic 
state, and trance which indiscriminately 
refer to, and are themselves indiscriminately 
inferred from, all four dimensions (Barber, 
1964, 1969; Chaves, 1968), and (c) to de- 
lineate experimentally the critical antecedent 
variables that determine high and low per- 
formance on each of the four dimensions. 
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EFFECT OF PRAISE AND CENSURE ON THE 
PERFORMANCE OF SCHIZOPHRENICS * 
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Poor premorbid schizophrenics were shown to be primarily motivated to 
avoid censure and good premorbids to be relatively more sensitive to praise. 
It was demonstrated that when censure was used on a task with only two 
possible responses, poor premorbids perform better than good premorbids. 
The opposite relationship was demonstrated for the effects of praise, to which 
the good premorbids were more responsive. On a task with many responses 
the praise or censure gave little information as to the correct response. On this 
task, censure was demonstrated to be relatively more disruptive to the poor 
premorbids than praise, but the opposite to hold for the good premorbids. 
Comparisons with neutral conditions and normal Ss were also made. 


Data are not in agreement with regard to 
the effects of praise and censure on the per- 
formance of schizophrenics. In some studies, 
it was found that schizophrenics performed 
more poorly under censure than under neutral 
conditions (Alvarez, 1957; Dunham, 1959; 
Garmezy, 1952; Neiditch, 1963; Webb, 
1955). Other investigators have found, how- 
ever, that schizophrenics performed better 
under censure than under neutral conditions 
(Atkinson & Robinson, 1961; Cavanaugh, 
1958; Cavanaugh, Cohen, & Lang, 1960; Ol- 
son, 1958). In some studies schizophrenics 
when praised did not show an increment in 
performance in comparison to their per- 
formance under neutral conditions (e.g. 
Cavanaugh et al., 1960); although in a few 
studies their performance did improve under 
praise (e.g., D’Alessio & Spence, 1963). 

Review of the literature suggests that the 
level of premorbid adjustment and the nature 
of the task can account for the differential 
performance shown by schizophrenics under 
praise and censure. Garmezy (1965) postu- 
lated that good premorbid schizophrenics are 
less sensitive to censure than poor premorbid 
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schizophrenics; also, on the basis of life his- 
tory data (Phillips, 1953), one would expect 
to find differences in sensitivity to praise and 
censure between good and poor premorbid 
schizophrenics. Poor premorbids begin to 
avoid social contact in early childhood, while 
good premorbids continue to participate in 
social activities throughout their premorbid 
period. Thus, the premorbid adjustment of 
poor premorbids is predominately one of 
avoiding social contacts, while that of good 
premorbids is patterned around an approach 
to others. 

The nature of the task may also be im- 
portant. Atkinson and Robinson (1961) and 
Johannsen (1961) postulate that the schizo- 
phrenic’s reaction to censure is dependent on 
the number of ways available to avoid cen- 
sure in the situation. However, the findings of 
some studies are contradictory to this inter- 
pretation (Fischer, 1963; Goodstein, Guertin, 
& Blackburn, 1961; Losen, 1961). The im- 
portant factor may not be the number of ways 
available to avoid censure, but rather the 
amount of information given S concerning the 
means to avoid it. Thus, in a simple situation 
where only two responses are possible, cen- 
sure should provide complete information 
about what is the correct response and the 
most efficient way to avoid censure is to give 
the correct response. However, on a complex 
task where many responses are possible, cen- 
sure would not give information and should 
produce task irrelevant responses. 

The present study was designed to test 
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explicitly this interpretation of past research. 
Specifically, on a simple task censure should 
improve performance for poor premorbid 
schizophrenics who are primarily motivated to 
avoid censure, while good premorbids should 
show some sensitivity to praise under these 
conditions. On the other hand, in a complex 
task where the praise or censure does not 
provide information as to the correct response, 
task irrelevant responses should appear and 
the schizophrenic deficit be found. Under 
these conditions the poor premorbid schizo- 
phrenics should show a greater decrement in 
performance under censure than good pre- 
morbids, but the good premorbids should be 
relatively more sensitive to the effects of 
praise. In analysis of variance terms this is a 
prediction of a triple interaction of Level of 
Premorbid Adjustment X Difficulty of the 
Task X Effects of Praise or Censure. This 
interpretation is consistent with the reported 
empirical results on schizophrenic perform- 
ance and resolves most of the apparent con- 
tradictions. 
METHOD 
Subjects 


The Ss were 36 normals, and 36 good and 36 poor 
premorbid schizophrenics. All Ss were selected in 
terms of the following criterion: (a) males between 
20 and 55 yr. of age; (b) with WAIS vocabulary 
scores of at least 6; (c) with no known pathology 
of the central nervous system; (d) with no known 
history of alcoholism; and (e) not undergoing shock 
therapy (although some Ss had probably received 
shock treatment at some time in the past). The 
normal Ss were chosen from the population of male 
employees of Embreeville State Hospital, who had 
no history of a neuropsychiatric illness. Good and 
poor premorbid Ss were chosen from diagnosed 
schizophrenics on the basis of their scores on the 
Phillips Scale of Premorbid Adjustment in Schizo- 
phrenia (Phillips, 1953). Following Garmezy and 
Rodnick (1959) the ratings of premorbid adjust- 
ment were based on an interview rather than case 
history data. The range of possible scores was 0- 
30, a high score indicating poor premorbid adjust- 
ment. A score of 18 and above or 13 and below 
were required in order to be included in the sample. 
The reliability of the ratings was determined by hav- 
ing an independent rating made on a tape recording 
of 20 of the interviews; this yielded a correlation 
coefficient of .86. 

Half of the Ss of the normal, good, and poor 
premorbid groups were assigned to the low-com- 
plexity condition, and half to the high-complexity 
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condition. All Ss were run under a neutral, a praise, 
and a censure condition. 

The 36 good premorbids consisted of 27 para- 
noid, 3 catatonic, 2 simple, 2 chronic undifferenti- 
ated, and 3 acute undifferentiated schizophrenics, 
The 36 poor premorbids included 22 paranoid, 3 
catatonic, 3 hebephrenic, 3 simple, and 5 chronic 
undifferentiated schizophrenics. The mean age at first 
hospitalization of the good premorbids was 32.1 yr, 
and for the poor premorbids 23.8 yr. The mean 
length of total hospitalization was 30 mo. for the 
good premorbids and 70 mo. for the poor premor- 
bids. Age of first hospitalization and length of hos- 
pitalization were significantly different (p < .01), as 
is typically the case for good and bad premorbid 
schizophrenics (e.g, McInnis & Ullmann, 1967). 


Task 


The stimuli consisted of 144 five-letter words, 
which occur with equal and high frequency in the 
English language (Thorndike & Lorge, 1944). Each 
word was typed on a white cardboard square which 
was placed behind a cellophane window of a 2-in. 
square coin holder. Either two or six of these coin 
holders, each displaying 1 word in each window, 
were mounted on the blank pages of a scrap book 
to form six books each containing eight pages. The 
S was required to guess and remember which of the 
words on each of the eight pages had been arbi- 
trarily chosen by E to be the correct response for 
that page. Complexity was varied by presenting Ss 
in the low-complexity condition with 2 words so 
that they had to eliminate only one alternative in 
order to learn the correct response. In the high- 
complexity condition, however, each page contained 
6 words. In this condition, therefore, S had to 
learn which of the six alternatives was the correct 
response for each of the eight pages. Two pilot 
studies were run to obtain three sets of words of 
comparable difficulty for each of the complexity 
conditions. The order of presentation of these three 
sets of words was counterbalanced for each of the 
subgroups so that each set appeared in the neutral, 
praise, and censure positions an equal number of 
times. 


Reinforcement 


In the neutral condition, Ss were given no social 
reinforcement. In order for them to obtain feedback 
about the correctness of their responses, the col” 
holders were mounted on the pages by means of 
hinges of cellophane tape. The correct word ap- 
peared in the space behind the coin holder as Wi 
as on its face, while the spaces behind the holders 
displaying the incorrect words were blank. There- 
fore, after S announced his guess, he could Dum 
behind the holder displaying the word to see if 
was correct. 

In the praise and censure conditions, Ss Were 
given information about the correctness of p^ 
guess by means of two boxes, each measuring - S 
8 X 8 in, placed side by side on a table at which 
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was seated and located 14 ft. in front of and facing 
S. The boxes could be illuminated with the word 
"right" appearing on the panel of one and the word 
“wrong” on the panel of the other. A failure to 
respond was treated as an incorrect response. In 
the instructions for the neutral conditions, Ss were 
given only the task requirements. However, the 
instructions stressed the importance of being right 
in the praise condition and of not being wrong in 
the censure condition. 


Procedure 


All data were collected by the first author who 
conversed with each S for a few minutes to permit 
S to become acclimated to the experimental situa- 
tion. In the first session all Ss were run under the 
neutral condition. In a second session, 2 days after 
the first, Ss learned one set of words under praise and 
one set of words under censure, the two orders 
counterbalanced throughout the design. There was a 
15-min. break between the two sets and during this 
period Ss were given either a cup of coffee or glass 
of coke depending on their preference. 

The Ss were allowed to spend 10 sec. on each 
page. After they announced their guess and checked 
to see if they were correct in the neutral condition 
they were then told to turn to the next page. If Ss 
failed to make a guess by 7 sec. they were told that 
their time was nearly up. Each run through the 
eight pages of the book was designated as 1 trial. 
The Ss were run until they met the criterion of 1 
perfect trial or until they had completed 8 trials on 
the simple task or 15 trials on the complex task. 


RESULTS 
Performance of Schizophrenic Subjects 


The major interest focuses on the relative 
performance of the schizophrenic Ss under 
praise and censure on the simple and complex 
task. The data are illustrated in Figure 1. 
Three terms of the analysis of variance 
reached significance. Of principle interest, 
however, is the predicted triple interaction; 
F=4.58, df = 1/68, p < .05. The interac- 
tion resulted from the good premorbids per- 
forming relatively better than the poor pre- 
morbids on a simple task under praise but 
with the poor premorbids performing rela- 
tively better under censure. However, on the 
complex task the poor premorbid schizo- 
phrenics made relativley fewer errors under 
praise than under censure, as compared with 
the good premorbids. The nature of the triple 
interaction is in the predicted direction. 

Overall, the good premorbid schizophrenics 
Performed better than the poor premorbid 
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REINFORCEMENT CONDITION 


Fic. 1. Mean number of errors for good and poor 
premorbid schizophrenics under praise (P) and 
censure (C) on the simple (left panel) and the 
complex (right panel) task. 


schizophrenics, F = 4.87, df=1/68, p< 
05. The fact that this better performance 
occurred primarily on the complex task, as 
shown in Figure 1, did not yield a significant 
interaction between level of premorbid ad- 
justment and task complexity, F = 3.86, df 
= 1/68, p < .10. 

The third significant term resulted from the 
fact that more errors were made on the com- 
plex than the simple task as would be ex- 
pected, F = 142.78, df = 1/68, p < .001. 


Comparison with Normal Subjects 


Of secondary interest is a comparison of 
the performance of schizophrenics with the 
performance of normals which allows an eval- 
uation of the relative deficit under the various 
conditions. On the simple task there was no 
significant schizophrenic deficit between the 
normals and poors, F = 1.77, df = 1/34, or 
between the normals and the goods, F = .04, 
df = 1/34. In fact the goods actually made 
fewer errors under praise than the normals 
and the poors fewer errors under censure than 
normals as may be seen in Table 1, although 
averaged over the conditions the normal Ss 
were slightly superior. The picture is radically 
changed, however, on the complex task where 
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the schizophrenic deficit was expected to show 
itself. The normals performed better than the 
good premorbids, F = 6.25, df = 1/34, p< 
05; and the special sensitivity of the poor 
premorbids to censure was reflected in the 
large deficit they showed on the complex task, 
F = 25.58, df = 1/34, p < .001. 


Comparison of Neutral with Social 
Reinforcement 


Of additional interest is the within-Ss com- 
parison of the neutral condition with praise 
and censure for the three groups. The normals 
did not change their performance on the 
simple task when either praise or censure 
were introduced with ¢ (17) =.155 and £ 
(17) = .600, respectively. This may be con- 
trasted with the fact that poor premorbids on 
the simple task made significantly less errors 
under censure than under neutral conditions 
with ¢ (17) = 1.854, p < .05, one-tailed, but 
not under praise where ¢ (17) = 1.390, p= 
ns. On the other hand, good premorbids per- 
formed significantly better under praise than 
under neutral conditions with t (17) = 1.949, 
p< .05, one-tailed, but not under censure 
where £ (17) = .073, p = ns. 

On the complex task, however, there was a 
uniform finding of better performance under 
both praise and censure than under neutral 
conditions for all groups. This uniformity of 
better performance under conditions of social 
reinforcement than under the neutral condi- 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF THE NUMBER OF 
ERRORS UNDER PRAISE AND CENSURE MADE BY THE 
NORMALS AND BY THE Goop AND BAD PREMORBID 
SCHIZOPHRENIC SUBJECTS ON THE SIMPLE AND 
Comprex TASK 


Reinforcement condition 


Task Neutral Censure 


Simple 
Normals 13.3 9.. 
Good premorbids | 17.3 | 10. 
Poor premorbids | 20,4 | 10. 
Complex 
Normals 46.5 
Good premorbids | 58.7 
Poor premorbids | 76.8 
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tion was contrary to expectations. Thus, on 
the complex test the schizophrenic decrement 
in performance, which was quite large in some 
instances, occurred only in comparison with 
normal Ss but not in comparison with the 
schizophrenic's own performance under the 
neutral condition, Thus, even on the complex 
task the social reinforcement facilitated per- 
formance over neutral conditions. 


Discussion 


The results confirmed the principal expec- 
tations. Normal Ss reacted similarly to praise 
and censure, whereas schizophrenics responded 
differentially to these reinforcers. The per- 
formance of schizophrenics under praise and 
censure was dependent both upon their pre- 
morbid adjustment and on the difficulty of 
the task. Poor premorbids showed facilitation 
on the simple task and interference on the 
complex task under censure in comparison to 
their performance under praise. This per- 
formance difference reflected their presumed 
strong motivation to avoid censure, which in 
the case of the simple task could be accomp- 
lished most efficiently by giving the correct 
answer, but on the complex task, by schizo- 
phrenic withdrawal which produced the large 
decrement under censure relative to the ef- 
fects of praise. On the other hand, good pre- 
morbids showed the opposite relationship, 
reflecting their responsiveness to praise and 
their lower sensitivity to censure. 

Similar findings for a simple task were 
reported by McInnis and Ullmann (1967) 
who obtained an interaction between re- 
sponsiveness to positive and negative rein- 
forcement by short- and long-term hospital- 
ized schizophrenics. The direction of the dif- 
ferences corresponded perfectly with the find- 
ing of greater responsiveness to censure by 
poor premorbids and to praise by good pre- 
morbids on a simple task. As noted previ- 
ously, in both studies the two schizophrenic 
groups differed on both length of hospitaliza- 
tion and premorbid history. The analysis fo- 
cusing on premorbid history is supported by 
the differential sensitivity obtained in both 
studies. McInnis and Ullmann's primary 1n- 
terest was in length of hospitalization. They 
expected, and found, short-term hospitaliza- 
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tion to be related to greater sensitivity to 
both positive and negative reinforcement. In 
this study, no such main effect was found for 
the poor-good premorbid dimension on the 
simple task. This failure to confirm their re- 
sults is not surprising, however, for in this 
study good premorbids had an average hos- 
pitalization which was nine times longer than 
the short-term schizophrenics in the McInnis 
and Ullmann study. Nearly all of the Ss in 
this study would be regarded as long term by 
their standards. Thus, long-term hospitaliza- 
tion may reduce responsiveness to both posi- 
tive and negative reinforcement, but differ- 
ential sensitivity seems to be a function of 
premorbid history. 

The second problem of theoretical interest 
is the evaluation of the absolute level of 
performance by comparing the performance 
of schizophrenics with that of normals. This 
is a between-Ss comparison rather than the 
within-Ss comparison used in the preceding 
section to evaluate the relative effects of praise 
and censure. This second comparison offers 
further support for the hypothesis that schizo- 
phrenic deficit is of a motivational nature. 
When incentives appropriate to the motiva- 
tions of the two types of schizophrenics were 
used on a task where the motivation would 
result in a correct answer the difference be- 
tween normals and schizophrenics was mini- 
mal with the schizophrenic performance 
slightly better. On the other hand, there were 
large differences between the performance of 
normal and schizophrenic Ss on the complex 
task when the motivation was more likely to 
produce schizophrenic withdrawal rather than 
the correct answer. The consideration of task 
difficulty along with premorbid history can 
thus account for the fact that in previous 
studies schizophrenics sometimes did better 
and sometimes worse than normals under 
Praise and censure. 

A final observation of interest is the fact 
that schizophrenics did not show a decrement 
in performance under praise and censure on 
the complex task relative to their owm per- 
formance under neutral conditions, This find- 
ing was unexpected. It must be remembered, 
however, that all Ss were run in the neutral 
condition first and were treated kindly by E. 
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It may be that these Ss had more confidence 
in E and were more at ease during the second 
session, or simply that a practice effect pro- 
duced the absolute differences. The neutral 
condition was always presented first in order 
to insure a more sensitive within-Ss compari- 
son of the relative effects of praise and cen- 
sure. This particular design was chosen be- 
cause the neutral versus reinforcement com- 
parison was regarded as the least important. 
The results tentatively support the suggestion 
that under experimental conditions which put 
the schizophrenic patient at ease he will re- 
spond to social reinforcement. Such a finding 
is consistent with a motivational interpreta- 
tion of schizophrenic deficit, which exists in 
an absolute sense in comparison to normals, 
but not in comparison to his own perform. 
ance under neutral conditions. Such an hy 
pothesis is difficult to evaluate, however 
without an independent measure of whether 
E has enlisted the cooperation and trust of 
the schizophrenic patient. Such trust may well 
be a necessary condition for E to become 
effective as a reinforcing agent for the schizo- 
phrenic. These observations must remain sug- 
gestive, however, for the design was not ap- 
propriate or intended to give an unambiguous 
answer on these issues. 
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VISUAL MONITORING PERFORMANCE IN THREE 
SUBGROUPS OF MALE DELINQUENTS * 


JAMES BURDEANE ORRIS? 
University of Illinois 


A questionnaire measuring psychopathic, neurotic, and subcultural delinquency 
was administered to 268 male delinquents. Three experimental groups of 15 
boys each were formed from Ss who scored above the mean on one scale and 
below the mean on the other two. It was predicted that the more psychopathic 
Ss would show a greater performance decrement on a vigilance task. A signifi- 
cant effect was found between experimental groups and a significant decrement 


over trials for both signals detected 
performed consistently poorer than the 


The present study is an investigation into 
the nature of the psychopathy dimension of 
delinquency identified in previous research 
(e.g, Peterson, Quay, & Cameron, 1959; 
Quay, 1964). It was hypothesized that more 
psychopathic (P) Ss would show a perform- 
ance decrement relative to more neurotic (N) 
and more subcultural (S) Ss on a task re- 
quiring continuous attention. This prediction 
was made on the basis of a theoretical ap- 
proach (Quay, 1965) to psychopathy which 
postulates that psychopaths are pathological 
stimulation seekers and are highly susceptible 
to boredom, thus likely to exhibit a decre- 
ment in monitoring efficiency because of a 
tendency to look elsewhere for added stimu- 
lation. 

The classical vigilance experiment (e.g. 
Mackworth, 1961) in which S has to watch 
for and report transient signals presented 
against a background of neutral signals was 
chosen as the experimental task. 


METHOD 


Subjects. A factor analytically derived question- 
naire measuring the P, N, and S dimensions of de- 
linquency (Quay & Peterson, 1964) was used to 
select Ss. From a sample of 268 institutionalized 
male delinquents three experimental groups of 15 Ss 
each were formed from boys who scored above the 
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Philadelphia, Pennsylvania 19122. 


and reaction times; psychopathic Ss 
other two groups. 


mean on one scale and below the mean on the other 
two. 

Apparatus, The apparatus was a visual monitoring 
display consisting of a voltmeter with the dial re- 
versed so that the pointer would be seen against a 
white background. The pointer was deflected from 
resting position to middial position at a regular rate 
of one deflection per second. The critical signal was 
a deflection of the pointer approximately three- 
fourths of an inch greater than the standard halfway 
deflection. The Ss responded to the signal by press- 
ing a hand-held microswitch. The experimental room 
was dark except for a small light used to illuminate 
the display. A fan was used to produce a constant 
noise which masked any extraneous sounds. 

In an adjoining room was a timer which started 
automatically whenever a signal was presented to S 
and which stopped whenever 5 made a response. The 
timer could be stopped by E if S did not respond 
and was reset by E after each signal. 

Procedure. Each S participated in a 100-min. 
practice session in the morning and an identical 
100-min. criterion session in the afternoon; sessions 
were divided into five 20-min. trials for statistical 
analysis. Ten signals were presented during each 
trial; intersignal intervals from 20 sec. to 240 sec. 
were arranged in a different random order for each 
of the five trials. 

Response measures were both number of signals 
detected and reaction times. If S responded within 
4 sec, E recorded his reaction time; otherwise a 
“miss” was recorded. 


RESULTS 


Figure 1 presents data for the criterion 
session with number of signals detected per 
trial as the performance measure. There was a 
significant effect for the three experimental 
groups, F = 4.36, df = 2/42, p < .05, and a 
significant effect’ over trials, F = 8.40, df= 
4/168, p< .01. The interaction between 
groups and trials was not significant, nor 
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Fic. 1, Mean number of signals detected per trial during criterion session. 


were the groups significantly different on the 
first trial. 

Criterion sessions using reciprocal latency 
as the performance measure are presented in 
Figure 2. The effect for groups, F — 5.03, df 
= 2/42, p < .05, and for trials, F = 11.60, 
df = 4/168, p< 01, were both significant, 
while the interaction between groups and 
trials was not significant; the groups were 
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significantly different on the first trial, F = 
3.39, df = 2/42, p < .05. Since the N and S 
Ss were essentially identical, this difference 
was due to the poorer performance of the P 
Ss. 


Discussion 


While, as predicted, the more psychopathic 
Ss exhibited a significant vigilance decrement, 
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Fic. 2. Mean reciprocal latency per trial during criterion session. 
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the lack of a significant interaction between 
groups and trials indicated that the groups 
did not adapt or habituate to the setting at 
significantly different rates. Examination of 
the performance curves reveals that the mean 
scores for the P group were below the other 
groups for all trials. The difference between 
groups was significant even on the first trial of 
the criterion session for the reciprocal latency 
scores, but not for number of signals de- 
tected. Thus the vigilance decrement oc- 
curred early and was maintained throughout. 

Clinically it was observed that the P group 
engaged in more self-stimulation, such as 
singing and talking, during the course of the 
experiment than did the other two groups. 
Such observations should be quantified in sub- 
sequent research. 


REFERENCES 


MackwonrH, N. H. Researches on the measurement 
of human performance. In H. W. Sinaiko (Ed.), 
Selected papers on human factors in the design and 
use of control systems. New York: Dover, 1961. 

Peterson, D. R, Quay, H. C., & Cameron, G. R. 
Personality and background factors in juvenile 
delinquency as inferred from questionnaire re- 
sponses. Journal of Consulting Psychology, 1959, 
23, 395-399, 

Quay, H. C. Personality dimensions in delinquent 
males as inferred from the factor analysis of be- 
havior ratings. Journal of Research in Crime and 
Delinquency, 1964, 1, 33-37. 

Quav, H. C. Psychopathic personality as pathological 
stimulation seeking. American Journal of Psychi- 
atry, 1965, 122, 180-183. 

Quay, H. C. & Peterson, D. R. The questionnaire 
measurement of personality dimensions associated 
with juvenile delinquency. Unpublished manu- 
script, University of Illinois, 1964. 


(Received October 27, 1967) 


Journal of Abnormal Psychology 
1969, Vol. 74, No. 2, 230-236 


BEHAVIORAL AND PHYSIOLOGICAL CHANGES DURING 
PROLONGED IMMOBILIZATION PLUS 
PERCEPTUAL DEPRIVATION * 


JOHN P, ZUBEK,? L. BAYER, S. MILSTEIN, anp JEAN MARY SHEPHARD 
University of Manitoba 


The Ss who successfully completed 1 wk. of immobilization plus perceptual 
deprivation (IPD group) showed a greater slowing of occipital EEG activity, 
and a poorer performance on a battery of intellectual and perceptual-motor 
tests than did Ss exposed to a similar duration of either immobilization (I 
group) or a recumbent control condition (RC group). During the 1-wk. 
period, the IPD group also showed a significant increase in urinary excretion 
of noradrenaline, but not of adrenaline, relative to the I and RC groups. No 
significant differences were observed on behavioral measures of subjective 


stress and mood. 


In a series of experiments from the Mani- 
toba laboratory (Zubek, 1968; Zubek & Mac- 
Neill, 1966; Zubek & Wilgosh, 1963), it has 
been shown that Ss whose level of kinesthetic- 
proprioceptive stimulation was reduced by 
immobilization but who otherwise were ex- 
posed to a normal and varied sensory environ- 
ment for a week exhibited a variety of cog- 
nitive and perceptual-motor deficits, unusual 
subjective phenomena, a slowing of the elec- 
trical activity of the brain, a reduced urinary 
excretion of catecholamines (adrenaline and 
noradrenaline), and no changes in subjective 
stress and mood. However, a comparison of 
the results of the immobilized Ss with those 
of control Ss revealed that most of the effects 
were associated with either the recumbent po- 
sition or with a combination of recumbency 
and restricted motor activity. Since only a few 
of the changes resulted solely from immobili- 
zation, the purpose of the present study was 
to determine whether increasing the severity 
of the experimental condition by the addi- 
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tional restriction of visual and auditory stim- 
ulation (perceptual deprivation) produces 
greater behavioral and physiological changes 
than immobilization alone. 


METHOD 
Subjects 


A group of 30 male university students were 
placed individually in a “coffnlike” box, located 
inside a soundproofed room, for a prescribed period 
of 1 wk. The S was requested to lie in the box with 
his head in a padded head holder and his feet in a 
set of V-shaped restraining holders. The arm, legs, 
and trunk were immobilized by straps fastened to 
the base of the box and kept sufficiently loose so as 
not to interfere with circulation. To minimize possi- 
ble pain and physical discomfort, Ss were unstrapped 
periodically, for example, for 15 min. at mealtimes 
and for approximately 70 min. in the afternoon (for 
testing and washroom purposes). They were also 
unstrapped for 9 hr. during the night but were not 
allowed to stand up or sit up (for details of the 
immobilization box and procedure see Zubek & Mac- 
Neill, 1966). In addition to the restriction on motor 
activity, Ss constantly wore a white face mask 
(except during the periodic test administrations) 
which permitted diffuse light (20 ftc. under the 
mask) but eliminated pattern vision and were €X- 
posed to white noise, somewhat above the 
threshold of hearing, produced by a speaker located 
behind S's head. The overhead lights and the white 
noise were kept on during the entire 1-wk. period. 
Conversation between S and E was kept to a mini- 
mum. At least one E was on duty at all times t0 
take care of such needs as feeding, occasional adjust- 
ments of restraining straps, etc. A moderate degree 
of unavoidable social interaction existed, therefore, 
between S and E, more so than normally occurs mn 
the usual perceptual deprivation experiment. 
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Because of the severity of the experimental con- 
dition, only 12 of the 30 Ss, the successfuls, were 
able to endure the 1-wk. period. The remainder, the 
quitters, terminated the experiment after a mean 
duration of 33.4 hr.; 13 within the first day and 
the remaining 5 Ss by the end of the fourth day? 


Measures 


Intellectual battery. This battery, developed by 
Zubek, Sansom, and Prysiazniuk (1960), consists of 
11 different tests of a primary mental ability nature, 
with eight equivalent forms per test, measuring 
numerical reasoning, arithmetic problems, abstract 
reasoning, verbal reasoning, verbal fluency, space 
relations, digit span, recall, recognition, cancellation 
of numbers, and dexterity. This battery was ad- 
ministered before and at daily intervals during the 
l-wk. experimental period. Total administration 
time was approximately 50 min. 

Perceptual-motor tests. Two tests, namely depth 
perception and size constancy, were administered 
before and after the 1-wk. period. Depth perception 
was measured by the Howard-Dohlman apparatus 
and size constancy by means of a black, equilateral 
triangle, of variable height, presented at a distance 
of 15 ft. The S was required to adjust the height of 
this triangle until it looked the same in size as the 
near one, the standard triangle (for further pro- 
Cedural details see Zubek & MacNeill, 1966). These 
two tests were employed because earlier research has 
indicated that neither performance measure is 
affected by either a week of immobilization (Zubek 
& MacNeill, 1966; Zubek & Wilgosh, 1963) or 1 to 
2 wk. of perceptual deprivation (Zubek, 1964; Zu- 
bek, Aftanas, Hasek, Sansom, Schludermann, Wil- 
gosh, & Winocur, 1962). 

EEG activity. Electroencephalograms were taken 
from 9 of the 12 successful experimental Ss before 
and after the 1-wk. period. The records were taken 
at the same time of the day with precautions to 
eliminate any drowsiness. The tracings were taken 
on an Offner eight-channel, Type T, machine. Six- 
teen Ediswan monopad electrodes were applied to 
the head in International Federation 10/20 posi- 
tions and two S.L.E. suction electrodes to the ears. 
Average, bipolar, and monopolar recordings were 
taken. In order to obtain a quantitative measure of 
EEG changes, the mean occipital lobe frequency of 
each S, before and after the 1-wk. period, was de- 
termined according to the method of Engel, Romano, 
Ferris, Webb, and Stevens (1944). This involved 
counting, by means of a Marshall ruler, the number 
of waves occurring in each of 200, 1-sec. samples of 
artifact-free occipital lobe tracings and then obtain- 
ing the mean frequency per second. In a few in- 
stances, 200 artifact-free samples could not be ob- 
tained and therefore a somewhat smaller sample had 
to be employed. EEG records were also taken from 
11 of the 18 quitters before and after their pre- 
Mature termination of the experiment. 

Biochemical and affective measures. Urine samples 
Were taken at 24-hr. intervals for 3 days prior to 
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the experiment, at daily intervals during the experi- 
ment, and for 2 days after its termination resulting 
in twelve 24-hr. samples for each S who successfully 
endured the 1-wk. period. An analysis for cate- 
cholamines, known to be involved in various stress- 
ful situations (e.g, Elmadjian, Hope, & Lamson, 
1957; Frankenhaeuser & Patkai, 1964; Levi, 1964), 
was then performed by the fluorimetric procedure of 
Euler and Lishajko (1961) and was expressed as 
total excretion of adrenaline and noradrenaline, 
separately, per 24-hr. period. In addition to this 
biochemical analysis, an appraisal was made of Ss’ 
affective state by two paper-and-pencil tests. The 
first was the Subjective Stress Scale (SSS) which 
has been shown to differentiate well between indi- 
viduals who have experienced situations inducing 
various degrees of stress (Berkun, Bialek, Kern, & 
Yogi, 1962; Myers, 1964). The SSS is a 15-item 
Thurstone scale, with higher scores indicating more 
adverse reactions. The second test, developed by 
Myers, Murphy, Smith, and Goffard (1966) specifi- 
cally for perceptual deprivation experiments, is a 
measure of overall mood. It consists of a 114-item 
adjective checklist which S answers as applying 
“not at all,” “somewhat or slightly,” or “mostly or 
generally” to him during a particular time period. 
Both the mood test and the SSS were administered 
at daily intervals before, during, and after the 
experimental period, that is, at intervals correspond- 
ing to the 24-hr. urine samples. 


Comparison Groups 


The results of the 12 successful, perceptually de- 
prived immobilized Ss (IPD group), on the bio- 
chemical and affective measures, were compared 
with the results of 16 immobilized (I group) and 16 
recumbent control Ss (RC group) from a recent 
study (Zubek, 1968) employing the same coffinlike 
box, 1-wk. duration, and measures as the IPD 
group but using Ss exposed to normal visual and 
auditory stimulation. Furthermore, to control for 
possible seasonally associated changes in catechola- 
mine excretion both studies were conducted during 
the same time of the year. The results of the IPD 
group on the remaining measures, namely intellectual, 
perceptual-motor, and EEGs were compared with 
the results of another 1-wk. study (Zubek & Mac- 
Neill 1966) employing 32 immobilized and 32 re- 
cumbent control Ss. To control for possible differ- 
ences in the level of initial ability, the preexperi- 
mental scores of the 12 successful IPD Ss, on the 11 
intellectual and 2 perceptual-motor measures, were 
matched with the initial scores of 12 of the 32 im- 
mobilized (I group) and 12 of the 32 recumbent 
control Ss (RC group). 


RESULTS 


Intellectual battery, An analysis of variance 
on the scores (mean of 7 daily tests admin- 
istered during the 1-wk. period) of the three 
groups of preexperimentally matched Ss (12 
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per group) was performed on each of the 11 
tests of the intellectual battery. Following this 
analysis, the significance of the differences be- 
tween the various groups of Ss was evaluated 
by two-tailed ¢ tests. 

Significant F values were obtained for six 
of the tests, namely verbal fluency (F = 4.27, 
p < .05) space relations (F = 5.80, p < .01), 
abstract reasoning (F = 3.83, p < .05), dex- 
terity (F = 6.32, p < .01), cancellation (F = 
12.99, p < .001), and recall (F = 7.27, p < 
.01). The performance of the IPD group was 
significantly worse than that of either the I or 
RC groups on tests of verbal fluency (?'s 
< .05), space relations (p< .01; p< .05), 
abstract reasoning (~’s < .05), and dexterity 
(p < .01; p< .05). None of the differences 
between the I and RC groups were significant. 
On the other hand, the performance on can- 
cellation and recall was poorer in the IPD 
than in the I group ($ < .001; p < .01) and 
in the I group than in the RC group (? < .01; 
p < .05). Although the F values were signifi- 
cant for only 6 of the 11 intellectual tests, 
it is important to note that the IPD group 
did worse than the I group on all of the 
remaining five tests. 

No statistical analysis of the results of the 
18 IPD quitters was performed since most 
of them terminated the experiment prior to 
the administration of the first intellectual 
battery on Day 1. 

Perceptual-motor tests. 'The F values for 
the three groups of matched Ss (12 per 
group) on the measure of size constancy 
and depth perception were both significant, 
F=8.75, p < 001; F = 344, p < 05. A t- 
test analysis revealed that the IPD group per- 
formed significantly worse than did either the 
I or RC group on size constancy (larger 
comparison triangles were chosen) and depth 
perception (p< .01; p< .05, respectively). 
None of the differences between the I and RC 
groups was significant. 

EEG records. The relative change in mean 
occipital lobe frequency during the 1-wk. 
period, for each of the IPD (N = 9), I 
(N —12), and RC (N = 12) groups was 
—142, —.59, and —.11 cps, respectively 
(F = 8.77, p < .001). A ż test revealed that 
the IPD group showed a significantly greater 
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decrease in mean frequency than the I group 
(» < .01) and the I group a greater decrease 
than the RC group (p< .01). An analysis 
was also made of the results of 11 IPD quit- 
ters receiving EEG measures before and after 
a mean duration of 29.5 hr. (range of 3-96 
hr.). The mean decrease was —.72 cps, a 
change significantly smaller than the decrease 
of —1.42 cps shown by the nine, 1-wk. IPD 
Ss (p= .01). The rank-order correlation be- 
tween duration and the magnitude of the 
EEG change of the 11 quitters was +.74 
(p< 01). 

Catecholamine determinations. Figure 1 
(bottom) shows the 24-hr. urinary excretion 
of adrenaline of the 12 successful IPD, 16 I, 
and 16 RC Ss. Although all three groups show 
a decreased excretion during the 1-wk. experi- 
mental period, it is interesting to note that 
the excretion levels of both the IPD and I 
groups are consistently higher than the level 
of the RC group throughout the entire ex- 
perimental period. However, an analysis of 
variance on the difference scores between the 
preexperimental (mean of 3 days) and 
“during” experimental periods (mean of 7 
days) of the three groups of Ss yielded a non- 
significant F value. 

Figure 1 (bottom) also shows the pre- and 
postexperimental adrenaline levels of the 18 
IPD quitters. It can be seen that these levels 
are consistently lower than those of the three 
other groups on the 3 days before and the 
2 days after the termination of the experi- 
ment. Despite this trend, none of the differ- 
ences were significant. The results of the 
quitters during the experimental period are 
not shown in Figure 1 since most of them 
terminated the condition prior to the end of 
the first day and thus a 24-hr. urine sample 
was not available for analysis. Furthermore, 
a proportional conversion of their adrenaline 
values into a 24-hr. period was considere 
to be inappropriate because of the exis 
tence of diurnal variations in catecholamine 
excretion (Euler, Hellner-Bjorkman, & Orwen, 
1955). ; 

Figure 1 (top) shows the urinary excretion 
of noradrenaline, the neurohormone of the pê- 
ripheral sympathetic system (Euler, 1950) 
the successful IPD, I, and RC Ss. It can be 
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IPD Quitters N=18 
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Fic. 1. The 24-hr. urinary excretion of adrenaline 


Ss before, during, and after a 1-wk. period. (The 


R.C. Group (Recumbent Controls) 
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I Group (Immobilization) 4—4 N=16 
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(bottom) and noradrenaline (top) of four groups of 
catecholamine levels of the IPD quitters, during the 


experimental period, are not shown in the figure since most of them terminated the condition prior to the 
end of the first day and thus a 24-hr. urine sample was not available for analysis.) 


seen that both the I and RC groups show a 
pronounced decrease in noradrenaline excre- 
tion, and of approximately the same magni- 
tude, throughout the entire 1-wk. period, an 
effect probably related to the maintenance of 
the recumbent position which is known to 
reduce the excretion of both noradrenaline 
and adrenaline (Euler, Luft, & Sundin, 1955; 
Sundin, 1958). On the other hand, the IPD 
group, after an initial decrease, shows a grad- 
ual increase in noradrenaline excretion reach- 
ing a peak during the fifth day. An analysis 
of variance on the difference scores between 
the preexperimental and during experimental 
Periods of the three groups of Ss yielded 
a significant F value (F = 3.88, p <.05). 
Furthermore, the IPD group showed a sig- 
nificantly greater mean increase in nor- 
adrenaline excretion relative to that of either 
the I or RC group (’s < .05). The differ- 


ence between the I and RC groups was not 
significant. 

A comparison of the pre- and postexperi- 
mental noradrenaline levels of the 18 IPD 
quitters with those of the three successful 
groups revealed no significant differences. It 
is interesting to note, however, the presence 
in the quitters of a trend toward increased 
noradrenaline excretion on the day prior to 
the experiment and relative to the preceding 
2 days, a trend also seen for adrenaline. This 
increased catecholamine excretion which has 
also been reported for other studies, for ex- 
ample, in perceptual deprivation quitters a 
day prior to isolation (Zubek & Schutte, 
1966) and before centrifuge (Goodall & 
Berman, 1960) or drug studies (Elmadjian 
et al., 1957) is believed to be indicative of 
apprehension about experimental participa- 
tion. Surprisingly, however, no such trend 
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toward increased catecholamine excretion ap- 
pears to be present in the three groups of Ss 
who successfully endured the 1-wk. period. 

Affective measures. The results of the two 
paper-and-pencil tests, which were adminis- 
tered at the time intervals corresponding to 
those of the 24-hr. catecholamine determina- 
tions, revealed that the IPD group showed a 
consistently higher subjective stress score and 
a more negative mood, relative to both the 
I and RC groups, during the 1-wk. experi- 
mental period. However, on neither of the 
two affective measures were any of the 
differences statistically significant. 


Discussion 


The results of this study have demonstrated 
that increasing the severity of the immobiliza- 
tion condition by the additional restriction 
of visual and auditory stimulation (perceptual 
deprivation) produces greater behavioral 
impairments than does immobilization alone. 
This was shown by the presence of a signifi- 
cant deficit in depth perception and size con- 
stancy in the IPD but not in the I group. 
Furthermore, since it has been reported that 
neither of these perceptual-motor measures 
is affected by a week of either sensory or per- 
ceptual deprivation in an isolation chamber 
(Zubek et al, 1962; Zubek, Pushkar, 
Sansom, & Gowing, 1961), it would appear 
that an impairment of depth perception and 
size constancy will only occur after a severe 
and prolonged restriction of visual, audi- 
tory, and kinesthetic-proprioceptive stimula- 
tion. Significant impairments in the IPD 
relative to the I group were also observed on 
6 of the 11 tests of the intellectual battery 
and a poorer performance, though not sta- 
tistically significant, on the remaining 5 tests. 
Accompanying these greater behavioral im- 
pairments, the IPD group showed a signifi- 
cantly greater decrease in mean occipital lobe 
frequency (—1.42 cps), after the 1-wk. period, 
and relative to that of either the I or RC 
group (—.59, —.11 cps, respectively). In ad- 
dition, since a high positive correlation (.74) 
was obtained between the pre-post difference 
scores of the IPD quitters and their duration, 
the results suggest the presence of a greater 
slowing of EEG activity with increasing dura- 
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tions of the IPD condition. This decrease in 
frequency of 1.42 cps in the 1-wk. IPD 
group, it is important to note, is greater than 
that obtained in an earlier study (Zubek & 
Welch, 1963) employing a week of either 
sensory (—.85 cps) or perceptual isolation 
(—1.21 cps) with no immobilization. This 
difference in magnitude of EEG slowing may 
account, for example, for the deficit in depth 
perception and size constancy in the IPD 
condition but no change in performance on 
these two measures after the same duration 
of sensory and perceptual isolation (Zubek et 
al. 1962; Zubek, Pushkar, Sansom, & Gowing, 
1961). Some experimental support for this 
hypothesis has been reported by Pollack 
(1963) who noted that various psychiatric 
treatments which induced different degrees of 
EEG slowing were accompanied by deficits on 
a variety of intellectual and perceptual tests. 
Furthermore, the magnitude of these deficits 
was related to the degree of induced EEG 
slowing. In view of this association, it is pos- 
sible that the greater intellectual impairments 
observed in the IPD relative to the I group 
may also be related to the difference in EEG 
slowing occurring between these two groups 
of Ss. 

The results of the biochemical determina- 
tions revealed no significant differences in 
adrenaline excretion in the three groups of 
l-wk. Ss during the experimental period. 
However, for noradrenaline the excretion level 
of the IPD group increased significantly rela- 
tive to both the I and RC groups. Since no 
significant differences on either catecholamine 
measure were obtained between the I and RC 
group, or in an earlier 1-wk. study (Zubek 
& Schutte, 1966) comparing perceptually iso- 
lated and recumbent control Ss, the results 
suggest that an increase in noradrenaline but 
not in adrenaline will only occur after 4 
severe restriction of visual, auditory, and 
kinesthetic-proprioceptive stimulation. — | 

Although the IPD group showed a signifi- 
cant increase in noradrenaline, this biochemi- 
cal effect was not accompanied by any S£ 
nificant changes on the subjective stress an 
mood scales, a disparity sometimes not in 
other studies relating biochemical and behav- 
ioral measures of stress (e.g., Cofer & Appley; 
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1964; Lazarus, 1964). Furthermore, this in- 
crease in noradrenaline, particularly notice- 
able during the last 3 days of the experi- 
mental period, does not appear to be related 
to Ss’ greater degree of outwardly expressed 
anger (“anger-out”) which several groups of 
investigators (e.g., Elmadjian et al., 1957; 
Funkenstein, King, & Drolette, 1957) have 
claimed is associated with an increased pro- 
duction of noradrenaline. An examination of 
the appropriate items in the 114-item mood 
scale, administered daily, revealed no evidence 
for an increased incidence of anger-out or 
aggressive emotional reactions in the IPD 
group relative to the I and RC groups. An 
observation of their daily behavior also failed 
to detect any differences in this respect. A 
more likely explanation of the elevated nor- 
adrenaline level is that it is a reflection of 
an accumulating, growing emotional tension, 
of a general nature, developing over time. 
The differential pattern of results obtained 
in the IPD group, namely an increased excre- 
tion of noradrenaline but not of adrenaline 
has also been observed in other “stress” con- 
ditions, for example, during water immersion 
(Graveline, Balke, McKenzie, & Hartman, 
1961), performance of insolvable tests 
(Frankenhaeuser & Patkai, 1964), and in 
hockey coaches during a game (Elmadjian et 
al., 1957), Although the reasons for this dif- 
ferential pattern are not clear, such results 
‘might be expected in view of the different 
site of production of adrenaline and nor- 
adrenaline (Euler, 1956). 
: Finally, an interesting trend was observed 
in the IPD quitters, namely, a consistent 
tendency to possess a lower base-line level 
of adrenaline, relative to the successful 1-wk. 
Ss, during the 3 days before and 2 days after 
the experiment. This trend is significant in 
the light of two earlier 1-wk. studies indi- 
cating a similar, but reliable, base-line dif- 
ference in adrenaline between successful and 
unsuccessful perceptual isolation Ss (Zubek 
& Schutte, 1966) and a difference, bordering 
on statistical significance, between successful 
Ss and quitters undergoing prolonged im- 
mobilization (Zubek, 1968). One possible 
explanation for the nonsignificant results in 
the IPD quitters may be the greater severity 
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of their experimental condition as indicated 
by a quitting rate of 60% versus approxi- 
mately 40% for the Ss in the other two con- 
ditions. Despite this nonsignificant trend, it is 
intriguing that the base-line levels of adren- 
aline but not of noradrenaline, of the three 
types of quitters, should all be low. 
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EFFECTS OF TWO INDEPENDENT STRESSES UPON 
AUTONOMIC RESPONDING * 


JAMES H. GEER? 
State University of New York at Stony Brook 


AND 


KENNETH KLEIN ë 


University of Minnesota 


In an attempt to determine whether the GSR and cardiac response to “aver- 
sive” photographs contained both orienting and emotional components, 40 Ss 
were shown either photographs of dead bodies or photographs of live indi- 
viduals projected upside down. In addition, half of the Ss expected, but never 
received, shocks. The data were interpreted to indicate that in addition to the 
orienting response, an emotional component was present in the response to 
the aversive photographs. The effect of shock threat was to increase reactivity 
to all stimuli. Analysis revealed that the GSR was most responsive to the 
content of the stimuli while the cardiac response was most sensitive to the 


shock threat. 


In a recent study (Geer, 1966) Ss selected 
as being frightened of spiders but not of 
snakes yielded larger GSRs when shown pic- 
tures of spiders than when shown pictures of 
snakes, In that study it was noted that there 
were two possible explanations for the in- 
crease in GSR responding to the aversive pic- 
tures. One explanation was that the stimuli 
aroused a negative emotion; the second was 
that the stimuli aroused an increased orient- 
ing response (OR) that did not necessarily 
contain elements of negative emotion. The 
present study was designed to test the alterna- 
tive explanations in an altered experimental 
design. Different stimuli were used, and Ss 
were not selected for their fearfulness of those 
stimuli. The aversive stimuli used in this 
study were color photographs of victims of 
violent death, which were considered noxious, 
but not painful. 

A second major interest of this study is 
the attempt to assess the effects of an inde- 
pendent threat or stress upon responding to 
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aversive and nonaversive stimuli. The inde- 
pendent threat was produced by telling Ss 
that they would receive electrical shock at 
points throughout the experiment. It was 
predicted that the effect of the shock threat 
would be to raise responsiveness to both the 
aversive and the nonaversive stimuli. 

The final area of interest that is examined 
in this study relates to the autonomic systems 
that are being measured. Both cardiac activ- 
ity and GSRs were collected and examined. 
Cardiac activity was considered to be of par- 
ticular importance since Graham and Clifton 
(1966) suggest that different components of 
the cardiac response reflect orienting and 
defensive responses. If that suggestion is 
accurate, it should be possible to further 
evaluate the OR versus negative emotion 
interpretations. 


METHOD 


The Ss were 40 female undergraduates enrolled in 
introductory psychology at the University of Penn- 
sylvania. Four of these Ss were eliminated on the 
basis of recording and procedural errors. All Ss were 
met by E who conducted them to a sound and 
electrically shielded room. They were seated in a 
reclining chair that was placed directly in front of 
a reverse projection window upon which the stimuli 
were delivered. Electrodes for recording cardiac and 
GSR activity were attached, and S was given general 
instructions to move as little as possible and to look 
at each picture. All Ss were told that they would 
be viewing pictures of both live individuals and 
pictures of dead bodies and that E would be record- 
ing their physiological reaction to those stimuli. In 
addition one-half of the Ss, randomly selected, were 
told that from time to time “electrical stimulation” 
would be applied through two electrodes placed on 
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the left hand. The remaining Ss were told that this 
second set of electrodes were for measuring an 
aspect of skin reactivity. In fact, no shocks were 
given any Ss. A brief rest period of approximately 
5 min. followed while recording equipment was 
checked and calibrated. 

All Ss were shown 10 color photographs of male 
undergraduates. The photographs, taken against a 
brick wall background, were of models bare from 
the waist up. For each S the same 10 photographs 
were shown; however, the order was randomized. 
The presentation of 10 photographs was used to 
permit partial habituation of responses to “neutral” 
stimuli. Following Trials 1-10 with these habituation 
stimuli, the second experimental manipulation was 
introduced. On Trials 11-15 one-half of the Ss, 
randomly selected and equally divided between shock- 
threat and no-shock-threat conditions, saw photo- 
graphs of more undergraduate males, but the photo- 
graphs were projected upside down. This was the 
nonaversive condition. The remaining Ss on Trials 
11-15 were shown color photographs of male victims 
of sudden or violent death. This was the aversive 
condition. Each photograph was of the upper torso 
of the individual and in most instances the autopsy 
table top served as the background. Blood and 
other evidence of trauma were present in the photos. 
The order of stimuli on Trials 11-15 was randomized 
for each S. 

Photographs were presented at intertrial intervals 
of from 50 to 70 sec. with a mean intertrial interval 
of 60 sec. Intertrial intervals were controlled by a 
Programmed timer, and the sequence of intervals 
varied for each S. Each photo was projected by a 
Kodak Carousal projector for 5 sec.; the stimulus 
duration was controlled by an electronic timer. 
GSRs and cardiac rates were recorded on a Beckman 
Model RB dynograph using that instrument’s 
couplers as the response sensing devices. Beckman 
electrodes and electrode paste were used for all data 
collection. 


RESULTS 


A GSR was defined as a measurable de- 
crease in skin resistance that began within 
1 to 5 sec, after stimulus onset. The measure 
of the response on a given trial was the log 
of the change in conductance. The formula 
used was (x n x ) X 10? + 1.0 where R; is 

2 1 
the resistance in ohms at the start of the in- 
flection and Re is the resistance at the point 
of maximum deflection. For each S the mean 
GSR on Trials 8, 9, and 10 was computed 
and those data were subjected to a 2 x 2 
analysis of variance. The same analysis was 
performed on the mean log of the basal con- 
ductance (Ri) for Trials 8, 9, and 10. These 
analyses produced no significant results, indi- 
cating that neither basal skin resistance nor 
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Note.—Mean of Trials 8, 9, and 10 from Trials 11, 12, 1 
and 15, 
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reactivity to the last three habituation stimuli 
were significantly affected by instructions, 
There were no detectable differences between 
groups assigned to the aversive and non- 
aversive conditions. 

Each S’s mean GSR for Trials 8, 9, and 10 
was computed and subtracted from each 
GSR S gave on Trials 11 through 15. The 
principle dependent variable used for the GSR 
was the mean of those difference scores. This 
measure has the advantage of subtracting 
each S’s general GSR reactivity from the 
response to the experimental stimuli. For a 
S to obtain a positive score she had, on the 
average, to respond with greater amplitude 
to test than to the mean of the last three 
habituation stimuli. Table 1 contains the 
group means for the GSR dependent variable. 
A 2X2 analysis of variance of those data 
yielded a highly significant (F = 14.33, df= 
1/32, < .001) aversive versus nonaversive 
stimulus effect while neither the Shock In- 
structions nor the Aversive versus Nonaver- 
sive Stimuli x Shock Instructions interaction 
approached significance. Since a check of 
homogeniety of variance determined that that 
assumption for the analysis of variance was 
violated, several nonparametric analyses were 
conducted to verify the analysis reported 
above. The median test indicated (x? = 12.00, 
df = 3, p < .01) that the four groups differed 
in reactivity to stimuli. The same statistic 
indicated that the aversive group was con- 
siderably more reactive (y? = 11.8, df =l, 
$ < .001). A similar analysis for the shock- 
threat conditions did not approach statistical 
significance. The Man-Whitney U test was 
used to compare individual groups. The shock- 
threat-aversive group differed significantly 
(2 < .002) from both nonaversive groups. No 
other individual comparisons were significant. 


TABLE 1 


MEAN CHANGE IN THE LoG or CHANGE 
IN CONDUCTANCE SCORES 


Dead | Upside | total 
bodies photos | | 
Shock threat ab | 112) 108 
No shock threat 550 064 ? 
Total 1.494 A76 
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These analyses were interpreted to indicate 
that GSR reactivity was greater to dead 
bodies as stimuli than to upside down photo- 
graphs, and that the difference was more 
pronounced under shock threat. 

According to Sokolov (1963) the OR 
habituates while the defensive response either 
does not or does so at a reduced rate. This 
suggests that frequency analyses may be use- 
ful in discriminating between the two types 
of responses. Thus, a 2 X 2 analysis of vari- 
ance of the number of GSRs to the five ex- 
perimental stimuli was performed. The re- 
sults were similar to the amplitude data: 
aversive versus nonaversive stimulus effect 
was highly significant (F = 22.48, df = 1/32, 
$ «.001) while neither shock instructions 
nor the interaction term approach statistical 
significance. These results support the inter- 
pretation that the aversive stimuli, pictures 
of dead bodies, elicited more than a simple 
OR to stimulus change. 

The heart rate data were analyzed by 
analyses of variance as the homogeniety of 
variance assumptions were met with these 
data, An analysis was performed on the mean 
heart rate of the two interpulse intervals that 
preceded the stimulus on Trial 11. As with 
the comparable GSR data, no effects ap- 
proaching statistical significance were found. 
Apparently instructions did not affect basal 
heart rate levels. Following the suggestion 
by Lang and Hnatiow (1962) and Graham 
and Clifton (1966) the cardiac response was 
considered to be made up of an accelerative 
and a decelerative component. The acceler- 
ative component of the biphasic cardiac re- 
sponse was measured by finding the difference 
in heart rate between the mean of two inter- 
pulse intervals that immediately preceded 
stimulus onset and the fastest beat that oc- 
curred within 1-5 sec. after stimulus onset. 
As with the GSR data, the mean of the accel- 
erative component was computed for Trials 
8, 9, and 10 and this value was subtracted 
from each accelerative component on Trials 
11, 12, 13, 14, and 15. Table 2 contains the 
group means for the accelerative component. 
An analysis of variance was performed upon 
the mean of each individual’s difference 
scores. A significant shock instructions effect 
(F = 5.12, df = 1/32, p < .05) was found, in- 
dicating that threat of shock was associated 


239 


TABLE 2 


MEAN CHANGE IN CARDIAC ACCELERATION 
(BEAT PER MINUTE) 


Dead | Upside 


T: 
bodies Lu ‘otal 
Shock threat 3.82 -16 4.58 
No shock threat —.26. —4.60 —4.86 
Total 3.56 —3.84 


Note,—Mean of Trials 8, 9, and 10 from Trials 11, 12, 13, 14, 
and 15. 


with an increase in the accelerative compo- 
nent, A decrease inethe accelerative com- 
ponent was found to stimuli presented 
under no shock threat. The aversive versus 
nonaversive stimulus effect just missed 
accepted levels of statistical significance 
(F=3.14, df=1/32, p « 10, p>.05). 
These results are taken as a strong indication 
that shock threat yielded increased cardiac 
acceleration. The interaction term did not 
approach statistical significance. A frequency 
analysis of the number of increases in the 
cardiac accelerative component yielded the 
same finding of a significant effect of shock 
instructions (F = 6.26, df = 1/32, p < .05); 
neither aversive versus nonaversive stimulus 
effects nor the interaction effects approach 
significance. 

The decelerative component of the cardiac 
response was measured by subtracting the 
slowest beat subsequent to the previously 
noted accelerative beat from the prestimulus 
measure. The decelerative beat was restricted 
to the 15 sec. that followed stimulus onset. 
As with the accelerative component, indi- 
vidual reactivity was calculated by deter- 
mining the mean of the decelerative compo- 
nent on Trials 8, 9, and 10. The dependent 
variable was the difference, for each S, be- 
tween the decelerative component on Trials 
11 through 15 and the mean of Trials, 8, 9, 
and 10. Table 3 contains the group means 
for the difference in decelerative components. 
A 2x2 analysis of variance was performed 
upon the data and the results closely parallel 
that for the accelerative component. The ef- 
fect of shock instructions (F — 8.62, df — 
1/32, p < 01) was significant. Table 3 indi- 
cates that the decelerative component was 
larger under shock threat than under condi- 
tions of no threat. The aversive versus non- 
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TABLE 3 
MEAN CHANGE IN CARDIAC DECELERATION 
(BEATS PER ) 
Dead Doade T 
bodies EET otal 
Shock threat 3.48 1.60 5.08 
No shock threat —.72 —4.26 —4.98 
Total 2.76 —2.66 


Note.—Mean of Trials 8, 9, and 10 from Trials 11, 12, 13, 14, 
and 15, 


aversive stimulus effect again was not signifi- 
cant but was in the direction of increased 
reactivity (decelerative) to dead bodies. The 
interaction effect was also negligible in this 
instance. A frequency analysis of the number 
of increases in the decelerative component 
yielded similar results: shock instructions 
were significant (F = 18.8, df = 1/32, p< 
.001) while aversive versus nonaversive 
stimulus effects and the interaction did not 
attain statistical significance. The heart rate 
data were interpreted to indicate that both 
components of the biphasic cardiac response 
were sensitive to the shock threat, and less 
sensitive to content of the stimuli. 


Discussion 


The GSR was apparently more affected 
by the aversive photos of dead bodies than 
by upside down photographs, as indicated by 
larger and more frequent GSRs among Ss in 
the aversive group. The data also suggested 
that this effect was enhanced by shock threat. 
The cardiac data are consistent with the GSR 
data though they do not actually achieve 
accepted levels of statistical significance. For 
both the accerelative and decelerative com- 
ponents, dead bodies tended to elicit greater 
cardiac reactivity and this was particularly 
true under conditions of shock threat. 

The use of upside down photographs was 
designed to provide nonaversive stimuli that 
were more dissimilar from habituation stimuli 
than were the aversive stimuli. It should be 
noted that stimulus difference is used here 
only in reference to the absolute physical 
properties of the stimuli. It may be argued 
that the upside down photographs differed 
from the habituation photographs only in a 
trivial sense, but, it is indeed just that point 
which this study attempts to illustrate. 
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Sokolov’s (1963) theory concerning the OR 
states that an OR is elicited by stimulus 
change and that OR amplitude increases with 
increased stimulus change. Further Sokoloy 
(1963) is explicit in referring only to physi- 
cal properties of the stimulus. “The term 
[meuronal model] suggests that the nervous 
system produces an exact model of the 
properties of external objects acting on the 
sense organs [p. 286].” 

In an attempt to verify that the physical 
characteristics of the stimuli differ as pro- 
posed above, judges were asked to rate 
stimulus differences on the dimension of 
physical characteristics. Instructions included 
explicit directions to ignore startle or shock 
value and to respond as if they (the judges) 
were viewing photographs of unknown objects. 
Judges, without exception, rated the physical 
characteristics of the aversive stimuli to be 
more similar to the habituation stimuli than 
were the nonaversive stimuli. It should be 
noted that all judges pointed out that this was 
true only if one totally ignored stimulus con- 
tent. The point to be made is that when con- 
sidering stimuli presented to an S, one cannot 
rely only upon sense receptor stimulation but 
must also consider prior experiences of S. — 

The augmented GSR to the aversive stimuli 
is viewed as indicating that more than à 
simple OR to stimulus change was aroused. 
This view is supported by the analysis that 
revealed reduced habituation of the GSR to 
the aversive stimuli. While what this extra 
something is cannot be stated with assurance, 
it seems reasonable to infer that it reflects 
the negative emotion elicited by the aversive 
stimuli. What cannot be determined is 
whether this response to the perception of the 
content of the stimuli results in an augment 
OR, or a more direct reflection of the negative 
emotion as such. If Sokolov's (1963) defini- 
tion of the defensive response as being 
“closely connected with the sensation of pain 
[p. 15]” is maintained, then the stimuli use 
in this study did not elicit the defensive 
response. It is clear that the content of the 
stimulus is an important variable, and the 
content may elicit more than an OR t° 
stimulus change. Perhaps the effect of certam 
stimulus content is similar to what occus 
when a stimulus change acquires signal proP- 
erties. When a stimulus or stimulus chang? 
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acquires signal properties, as in the case of 
a conditioned stimulus, OR habituation is in- 
hibited. Only further research can clarify 
these issues. 

Consider now the effect of the indepenedent 
stress, the threat of shock. Several interesting 
things may be noted concerning this ma- 
nipulation. First, there is no evidence that 
general arousal, as indexed by either basal 
levels of GSR or cardiac rate, was effected 
by the shock threat. The effect of the shock 
threat was upon reactivity to novel stimuli. 
It was as if S under shock threat was primed 
to respond to changes in his environment. It 
should be noted that in no case did the inter- 
action of instructions and stimuli approach 
statistical significance. That is, independent 
Stress seems to effect reactivity in general 
and does not act to differentially increase 
reactivity to more arousing stimuli. 

The cardiac response was separated into 
accelerative and decelerative components as 
suggested by Graham and Clifton (1966). 
There were no detectable differences between 
the components of the response. Both compo- 
nents indicated that the cardiac response to 
novel stimuli is increased when S is under 
the threat of shock. The GSR data, while not 
attaining statistical significance, were in the 
same direction as the cardiac data and pro- 
vided further evidence that the effect of shock 
threat was to augment reactivity to all novel 
stimuli, aversive or not. 

The differential sensitivities of the cardiac 
and GSR are of considerable interest. First, 
it should be noted that the two systems did 
not yield contradictory findings. The results 
indicated that the GSR was most sensitive 
to the content of the stimuli while the cardiac 
tesponse was most sensitive to the shock 
threat. Why this should be the case is not 
clear. It is, however, consistent with Lacey’s 
(1950) notion of relative independence of 
the components of the autonomic system. The 
data are also consistent with the finding by 
Dykman, Reese, Galbrecht, and Thomasson 
(1959) that in normal humans the GSR is 
more stimulus bound than cardiac or respira- 
tion reactions, and the interpretation of Gold- 
Stein and Acker (1967) that skin resistance 
Measures index, “the readiness of the orga- 
nism to attend to, or be alerted by, external 
stimuli [p. 28].” At a speculative level, it 
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might be suggested that the cardiac response 
is best manipulated by potentially important 
physical threats to the individual. Photographs 
of dead bodies provide no actual threat to 
the physical integrity of the individual. 

The finding that cardiac acceleration and 
deceleration were consistent is not in complete 
accord with Graham and Clifton's (1966) 
suggestion that they index different responses. 
These authors suggested that the accelerative 
component may be associated with the defen- 
sive or perhaps startle reflex, and the de- 
celerative component with the OR. If one ac- 
cepts Sokolov's (1963) contention that the 
defensive reflex is elicited through painful 
stimuli, then the data are not supportive of 
Graham and Clifton's position since no actual 
pain was aroused. If, however, a broader 
interpretation of defensive reflex is employed, 
one that includes potentially painful stimuli, 
or startle stimuli, then it could be argued 
that stimulus change elicited the OR and the 
shock threat elicited the defensive reflex. 
Thus, in the present study both aspects of the 
cardiac response may have been affected. 
Until further research clarifies the startle, 
orienting, and defensive components of the 
cardiac response, any interpretation must 
remain tentative. 
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SCHIZOPHRENIA AND THE SCANNING COGNITIVE CONTROL: 
A REEVALUATION 
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In recent studies it has been inferred from performance on perceptual tasks, 
that paranoid schizophrenics showed extensive scanning and nonparanoid 
schizophrenics showed minimal scanning. Eye movements during free-search and 
size-judgment tasks were recorded for paranoid and nonparanoid schizophrenics, 
psychotic depressives, and control Ss. Medicated patients showed lower eye- 
movement rate and fixation rate than nonmedicated patients and controls but 
there were no significant differences due to schizophrenia. There was a 
significant interaction between patient type and medication for size-judgment 
means, but all such differences were reduced to nonsignfiicance by training. 
However, even after training, patients showed greater variability in size 
judgments than control Ss. The results failed to support the notion of ex- 
tensive or minimal scanning behavior of schizophrenics. Errors by the psychotic 
patients in making size adjustments were not correlated with scanning behavior. 


The concept of scanning—the extensiveness 
with which stimuli are sampled when attend- 
ing to a sensory or perceptual field—was 
introduced by Gardner, Holzman, Klein, 
Linton, and Spencer (1959) to refer to certain 
stylistic consistencies in perceptual and cogni- 
tive functioning of normal Ss. A factor from 
perceptual judgment measures loading on size 
estimations and size constancy judgments was 
interpreted as reflecting scanning although no 
direct visual scanning measures were included 
in the study. Gardner and Long (1962a, 
1962b) obtained direct measures of visual 
scanning by recording eye movements during 
size-estimation tasks. The various eye-move- 
ment measures for different conditions of the 
size-matching task involving minor changes in 
the standard stimulus were postively corre- 
lated. This provided some evidence of indi- 
vidual consistencies in direct measures of 
visual scanning although the correlations be- 
tween the eye-movement measures and the 
size judgments were low. 

Silverman (1964a) employed the concept 
of scanning to summarize and explain results 
of a number of perception experiments in- 
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volving schizophrenics, and as the basis for 
a general theory of schizophrenic behavior. 

Silverman (1964b) tested 26 nonparanoid 
and 17 paranoid male schizophrenics all re- 
ceiving tranquillizer medication on the stan- 
dard Gardner et al. (1959) size-estimation 
task; the patients were asked to adjust à 
circular patch of light to the same size as | 
each of three hand-held disks. Nonparanoid 
schizophrenics showed consistent overestima- 
tion and paranoid schizophrenics showed 
underestimation. These differences were sig- 
nificant at the .01 level or better. 

On the basis of the Gardner and Long stud- 
ies, Silverman interpreted the differences M 
size estimation as reflecting differences M 
scanning behavior without obtaining any 
direct measures of visual scanning. It was 
concluded that paranoid schizophrenics 
showed extensive scanning—continually al- 
ternating fixations, looking back and forth— 
and nonparanoid schizophrenics showed mini- 
mal scanning—maintaining fixation with lit- 
tle eye movement. 

The aim of the present study was to test 
directly the postulated differences in scanning 
behavior between paranoid and nonparamo 
schizophrenics and normal Ss. 


Definition of Visual Scanning 


In the Gardner and Long (1962b) study a 
which the scanning process was lefined fl 
rectly by eye-movement measures, the s! 
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estimation procedure involved S difference in 
judgment time. The eye-movement measures 
(particularly those found to correlate signifi- 
cantly with size judgments) were confounded 
with judgment time. In this case a high scan- 
ner was a S who repeatedly looked back and 
forth from the standard to the comparison 
stimulus and showed long judgment time. 
However, in other tasks (autokinesis, size 
constancy, size judgment) from which Silver- 
man (1964a) inferred scanning differences, 
judgment time or exposure time was mostly 
held constant. In these cases Silverman im- 
plies that differences in the eye-movement 
rate define the scanning process. The problem 
arises that groups of extensive and minimal 
scanners defined by the different criteria (a) 
total number of eye movements in a variable 
judgment time task and (5) rate of eye move- 
ments in a constant exposure task may be 
unrelated or negatively related. 

Before the present study was undertaken, 
the stability and generality of individual con- 
sistencies in measures of rate and number of 
eye movements of normal Ss were investi- 
gated. The corneal photography technique 
was used to record eye movements in a wide 
range of conditions including perceptual judg- 
ments (McKinnon, 1966). 

This study showed stability of eye-move- 
ment measures over time and the generality 
of individual consistencies in saccadic eye- 
movement rate over a broad range of experi- 
mental conditions. Visual scanning rate was 
unrelated to “functional efficiency” of the 
visual system defined by fixation steadiness, 
tracking efficiency, and binocular coordina- 
tion. There was restricted evidence for indi- 
vidual consistencies in adjustment time and 
related measures of the number of saccadic 
eye movements for size- and brightness-ad- 
justment tasks. However, none of the eye- 
Movement measures was significantly corre- 
lated with size judgments. For experimentally 
independent conditions, visual scanning rate 
was consistently negatively correlated with 
time taken to make perceptual adjustments. 

These findings indicate difficulties in in- 
ferring scanning differences from perceptual 
judgments alone and the need to specify ob- 
Jective criteria of scanning. 
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In the present study the perceptual and 
scanning behavior (defined by the number and 
rate of eye movements) of medicated- and 
nonmedicated-schizophrenic groups, psychi- 
atric, and normal controls was examined for 
free-search and size-judgment conditions. 

It is important to specify conditions under 
which differences in size judgments or scan- 
ning behavior of schizophrenics and normal or 
psychiatric control Ss are expected to occur 
and whether these can be eliminated. 

Stannard, Singer, and Over (1966) re- 
ported that differences between the perform- 
ance of schizophrenic and normal Ss on a 
size-constancy task under normal motivation 
conditions were eliminated when behavior 
was manipulated using impersonal reinforce- 
ment in an operant procedure. 

In the present study a training procedure 
in the size estimation task involving verbal 
feedback from E was employed to attempt to 
eliminate any deficit shown by patients in size 
judgments. 

If size underestimation and overestimation 
are associated with extensive and minimal 
scanning, respectively, then if training elimi- 
nates size-judgment deficit, it could be ex- 
pected to alter scanning patterns. 


METHOD 


Subjects. Six groups of 14 Ss, each including seven 
males and seven females, were tested. The six groups 
were: (a) nonparanoid schizophrenic, medicated; 
(b) nonparanoid schizophrenic, nonmedicated; (c) 
paranoid schizophrenic, medicated; (d) paranoid 
schizophrenic, nonmedicated; (e) psychotic depres- 
sive, nonmedicated; and (/) psychiatric nurses (con- 
trol). 

The psychiatric patients were selected from admis- 
sion wards at two metropolitan psychiatric hos- 
pitals. Clinical diagnosis was made by the psychia- 
trist treating the patient and doubtful cases were 
excluded. All patients were tested within 4 wk. of 
admission with not more than a total of 2 yr. of 
hospitalization. There was an age range of 20-50 yr. 
(mean age of the nonparanoid schizophrenic was 
34.5 yr, SD —9.8 yr.; paranoid schizophrenic: M 
—388 yr, SD=11.0 yr.; psychotic depressive: M 
—44.6 yr., SD = 10.1 yr). 

Most of the schizophrenic patients were receiving 
tranquillizing drugs in the course of treatment. Pa- 
tients whose medication could not be suspended 
(about 35%) were eliminated and schizophrenics 
were randomly allocated to medicated and non- 
medicated groups. The medicated groups were main- 
‘tained on current drugs. For the nonmedicated 
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groups, drugs were suspended 30 hr. prior to test- 
ing. The nonmedicated condition was designed to 
eliminate some of the gross drug effects but should 
not be interpreted as showing performance in the 
complete absence of drug effects. 

Control Ss were psychiatric nurses from Glades- 
ville Hospital, New South Wales, Australia (a mean 
age of 27.4 yr, SD—72 yr). 

Apparatus. Eye-movement recordings were made 
using the EDL Reading Eye camera (Taylor, 1960). 

The stimulus display for the free-search condition 
consisted of two white 35-mm. squares with centeres 
100 mm. apart on a 14 X 9 cm. midgray screen and 
viewing distance of 38 cm. 

Size matching was made for two circular patches 
of light projected onto a 15 X 10 cm. white screen, 
the left standard circle was 40 mm. diameter and 
the right comparison circle was adjusted by S. Dis- 
tance between the two circles was 10 cm. and view- 
ing distance was 38 cm. The two circles were pro- 
duced by identical Prado 35-mm. slide projectors 
with 200-w. white bulbs and circular apertures re- 
placing the slide cartridges 60 cm. behind the trans- 
lucent screen. White reduction plates with 18 mm. 
diameter circular holes were placed in front of the 
final lenses. For the comparison stimulus, an iris 
diaphram was used to produce a variable sized 
smooth circular patch of light. The variable aper- 
ture was activated either by a handle on S's right or 
from the rear by E. 

Procedure. After preliminary explanations and es- 
tablishment of rapport, all Ss were tested on the 
free-search condition and then on the size-adjust- 
ment task under pretraining, training, and extinction 
conditions. 

A practice trial with no specific instructions and a 
blank stimulus field was given to familiarize Ss with 
the camera operation. There was a free-search test 
trial of 20 sec. in which S was instructed simply to 
look at the display. Previous testing indicated eye- 
movement measures based on this period were 
reliable. 

After a rest period of 2 min. there were 2 prac- 
tice trials followed by 6 test trials (pretraining). A 
further 1-min. rest period was followed by 15 train- 
ing trials in which E responded to each adjustment 
with “you are correct” (for the range 38-42 mm. 
diameter of the comparison circle) or “you set the 
adjustable circle too small” or “you set the adjusta- 
ble circle too large.” Then there were 6 test trials 
(posttraining), an additional 15 trials, and a third 
set of test trials (extinction), all without verbal 
feedback. All trials were identical size-adjustment 
trials, except that during the 15 training trials Ss 
received verbal feedback, and that during the three 
sets of test trials eye movements were recorded. The 
starting points for ascending and descending test 
trials for the comparison circle were 30, 50, 35, 45, 
30, 50 mm. in diameter and the intertrial interval 
was 10 sec. 

Eye-movement recordings were made for the free- 
search and three sets of six size-adjustment test 
trials and the following measures were obtained: 
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free search and size adjustment—(a) movement rate: 
number of large excursion eye movements between 
the two stimuli per minute; (b) fixation rate: 
number of fixations or fixation changes per minute; 
size adjustment only—(c) adjustment rate: number 
of completed size adjustments per minute (inversely 
related to mean judgment time). (d) number of 
movements: number of large eye movements per 
completed size adjustment. (e) number of fixations: 
number of fixations per complete size adjustment. 


RESULTS 
Free-Search Condition 
The means for movement rate and fixation 


rate for the six S groups are shown in Table | 
1. It appeared that the nonmedicated groups | 


were similar but that the two groups receiving 
medication showed lower scanning rate. 

Since the six S groups were not matched for 
age, age scores were employed as covariates 
in the analyses. Factorial analyses of covari- 
ance, Six S Groups X Sex, were made sepa- 
rately for movement and fixation rate meas- 
ures. Post hoc contrasts for S groups were 
tested using the R technique (Rodger, 1967) 
with alpha level .05. The main effect (maxi- 
mized contrast) for the six S groups was sig- 
nificant for both measures (movements: F — 
1.87, df = 5/71, p< .05; fixations: F= 
1.85, df = 5/71, p < .05). The eye-movement 
rates and fixation rates of the two medicated 
groups were significantly lower than the non- 
medicated groups. None of the contrasts be- 
tween the four nonmedicated groups including 
the normal control group were significant. The 
main effect for sex was significant only for the 
movement-rate scores, with females showing 
higher movement rate than males (move- 
ments: F = 7.38, df = 1/71, p < .05; fixe 
tions: F = .84, df = 1/71, p > .05). Interac- 
tion effects of S Groups X Sex were not Sig- 
nificant. 

These results provide no support for the 
hypothesis that schizophrenics show extreme 
scanning behavior in the free-search condition. 


Size Judgments 


The mean size-judgment scores for the SP 
S groups under pretraining, posttraining, 47 
extinction conditions are shown in Table 2. 
It appeared that group differences shown for 
the pretraining condition were elimina! by 
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TABLE 1 
MEANS FOR MOVEMENT AND FIXATION RATE, FREE SEARCH 
Group 
Measures 
Nonparanoid Nonparanoid P: id P: jl 
schizophrenic schizophrenic schizophrenic HUNE oed Normal control 
medicated nonmedicated medicated nonmedicated | nonmedicated 
Movement rate 
Male 21.7 37.7 26.3 29.9 28.5 37.4 
Female 30.6 38.2 32.5 46.4 48.1 424 
Combined 25.6 37.9 29.3 38.1 38.3 39.9 
Fixation rate 
Male 60.0 87.5 52.0 81.0 82.5 88.0 
Female 72.5 79.5 75.9 90.6 86.5 80.0 
Combined 66.2 83.5 63.8 85.8 84.5 84.0 


the training procedure. There was a greater 
spread of size-judgment means for the ex- 
tinction condition, being intermediate between 
pretraining and posttraining judgment means. 
Because of the extreme changes in variance 
of sources for different conditions, analyses of 
covariance were made for each condition sepa- 
rately. The maximized contrast for the six S 
groups was significant only for the pretraining 
condition (F = 1.94, df = 5/71, p < .05). 
Differences in mean size judgments between 
paranoid and  nonparanoid schizophrenic 
groups (medicated, nonmedicated, and com- 
bined) were not significant. For nonmedicated 
groups, nonparanoid schizophrenics showed 
greater underestimation than paranoid schizo- 
phrenics, whereas for medicated groups, para- 
noid schizophrenics showed greater underesti- 
mation. The contrast for this overall interac- 
tion effect was significant. Only the medi- 
cated paranoid schizophrenic group showing 
marked underestimation was significantly 
different from the normal control group. No 
significant differences were found between the 


, 


mean size judgments of psychotic depressives 
and schizophrenic patients. 

Two additional size-judgment measures 
were analyzed: the absolute-error measure 
(absolute value of the constant error) and the 
intra-S variance of size adjustments. Means 
for the six S groups under training conditions 
are shown in Table 3. For both measures the 
maximized contrasts for the six S groups were 
significant for pretraining, posttraining, and 
extinction conditions (errors: pretraining F 
= 4.42, df=5/71, p < .05; posttraining F 
= 2.59, df = 5/71, p < .05; extinction F = 
3.51, df = 5/71, p < .05; variance: pretrain- 
ing F = 1.71, df = 5/71, p < .05; posttrain- 
ing F = 2.09, df = 5/71, p < .05; extinction 
F = 2.09, df = 5/71, p < .05). Except in two 
instances all patient groups showed signifi- 
cantly greater absolute errors and intra-S 
variance than the control group for all condi- 
tions and there were no significant differences 
between patient groups. 

Despite the evidence that the training pro- 
cedure eliminated differences in mean size 


TABLE 2 


MEANS or SIZE JUDGMENTS FOR PRETRAINING, POSTTRAINING, AND EXTINCTION 


Group 
Condition 
paranoid Paranoid Paranoid Psychotic 
piene BUE E schizophrenic schizophrenic depressive Normal control 
medicated nonmedicated medicated nonmedicated | nonmedicated 
Pretraining 2321 —2.60 —4,10 38 —2.07 —.69. 
Posttraining —.64 —.60 —.57 —.42 —1.00 —37 
Extinction —1.46 —2.60 —.62 —.40 —1.46 —.40 


Note.— Constant error in millimeters. 
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TABLE 3 
MEANS OF ABSOLUTE ERROR AND INTRASUBJECT VARIANCE SCORES 
Group 
Messe aod Gaditen) 14, araricua [Nose Retna LGR Paranoid Psychotic 
schizophrenic | schizophrenic | schizophrenic | schizophrenic depressive — | Normal control - 
medicated nonmedicated medicated nonmedicated | nonmedicated 
Absolute error 
Pretraining 9.06 10.37 14.77 10.49 9.17 221 
Posttraining 4.06 3.21 2.86 3.01 5.03 1.68 
Extinction 8.63 10.11 5.58 7.06 9.26 1.72 
Intrasubject variance 
Pretraining 51.1 36.2 47.5 43.5 30.4 17.2 
Posttraining 46.7 27.9 16.5 26.6 30.6 12.8 
Extinction 32.2 32.7 27.6 47.1 33.8 12.9 


judgments for the six S groups, the greater 
absolute errors and intra-S variance of size 
judgments of patients compared with control 
Ss were not eliminated. 


Eye-Movement Scores 


Results for the five eye-movement measures 
were treated separately in analyses of co- 
variance for the pretraining, posttraining, and 
extinction conditions. The F ratios for the 
maximized contrasts for means of the six S 
groups are shown in Table 4. The group means 
for the scores with significant maximized con- 
trasts are shown in Table 5. 

None of the 15 contrasts testing the differ- 
ences between paranoid and nonparanoid 
groups (medicated, nonmedicated, and com- 
bined) was significant. There was no evidence 
that schizophrenic patients showed extreme 
scanning defined either by rate of eye move- 
ment (Measures 1 and 2) or number of move- 
ments (Measures 3 to 5). 

"There were significant differences between 
patients and the control group for movement- 


TABLE 4 


F Ratios ror Maxmnzep Contrasts 
or SUBJECT GROUP MEANS 


Measure training | tinae | Extinction 
Movement rate 1,05 1.74* 2.45* 
Fixation rate 1.16 1.26 .82 
Adjustment rate -56 1.52 60 
Amount of movement 2.04* 74 1.33 
Amount of fixation 22/* 58 71 


Note.—F(1 — a)” (Rodger, 1965). F = 1.43, df = 5/71. 
*p = 05. 


rate scores under posttraining and extinction 
conditions. However, the lower movement 
rate for patient groups as a whole was mainly 
due to the two medicated groups supporting 
the results for the free-search: condition. For 
the four nonmedicated groups there were no 
consistent differences between paranoid and 
nonparanoid schizophrenics and the normal 
control group or the psychiatric control group 
of psychotic 'depressives on any of the eye- 
movement scores. 


Relation between Size Judgments and Eye- 
Movement Scores 


It could be argued that failure to confirm 
differences between paranoid and nonparanoid 
schizophrenics and normals was partly a func- 
tion of difficulties of psychiatric classification 
for patient groups. If all schizophrenics are 
examined regardless of subtype, then from 
scanning theory a relation is predicted be- 
tween extreme scanning behaviors and size 
overestimation and underestimation. It was 
decided to examine the correlations of eye 
movement and size-judgment scores for the 
whole patient group. Obviously the methods 


of selection of the patient groups and combin- - 


ing medicated and nonmedicated patients 
restrict interpretation of the correlations. The 
intercorrelations of the two sets of scores for 
the pretraining condition are shown in Table 
The pattern of eye-movement and size- 
judgment correlations was similar to those 
found in a more extensive study with no 

Ss (McKinnon, 1966). None of the 45 corte 
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TABLE 5 
Group MEANS FOR SCORES WITH SIGNIFICANT CONTRASTS 
Nonparanoid | Nonparanoid Paranoid 
Measure and condition schizophrenic schizophrenic schizophrenic ME UE Normal control 
medicated nonmedicated medicated nonmedicated | nonmedicated 
Movement rate 
posttraining 444 58.6 48.8 584 58.5 76.8 
Movement rate * 
extinction 48.5 67.5 50.2 65.4 50.9 79.5 
Adjustment rate 
posttraining 10.1 9.9 ‘ings 11.5 9.6 12.4 
Number of movements 
pretraining 64 115 7.4 9,3 7.7 tH 
Number of fixations 
pretraining 11.2 19.0 12.5 14.9 15.2 11.9 


lations of eye-movement and size-judgment 
Scores was significant at the .05 level. 


DISCUSSION 


The results failed to support. the experi- 
mental hypotheses predicting differences in 
size-judgments and visual scanning behavior 
of paranoid and nonparanoid schizophrenics 
and psychotic depressives and control Ss. 

Findings for mean size judgments of medi- 
| cated schizophrenics were consistent with re- 

sults reported by Silverman (1964b); medi- 
cated paranoid schizophrenics showed greater 
underestimation than medicated nonparanoid 
| schizophrenics. However, the opposite results 
were found for nonmedicated groups. From 
information available in the present experi- 
ment it is not possible to interpret these ef- 
fects. Although most of the schizophrenic 
patients were receiving phenothiazine drugs 
(as in the Silverman study), there may have 
been systematic differences between paranoid 
and nonparanoid groups in the particular 
medication and dosage parameters. 


Although the procedure used to assess the 
effects of medication involved limited control, 
there were differences between medicated and 
nonmedicated patients for both size-judgment 
and visual scanning scores. Tranquillizer 
medication produced a reduction in visual 
scanning rate for paranoid and nonparanoid 
schizophrenic patients. The results for drug 
effects indicated the possibility of confound- 
ing drug effects in several previous studies on 
schizophrenics with tasks taken as indicating 
scanning behavior. 

Errors shown by psychotic patients in size- 
judgment mean scores were eliminated by the 
training procedure involving verbal reinforce- 
ment. However the greater variability shown 
by patients, variability for each individual 
from occasion to occasion, and variability be- 
tween individuals was not eliminated by the 
training procedure. 

Schizophrenics and psychotic depressive pa- 
tients performed less adequately than control 
Ss on the size-adjustment task but the deficit 
was not systematically related to differences 


TABLE 6 
CORRELATIONS OF SIZE JUDGMENTS AND EYE Movement SCORES 


al 


Item 


Eoo e a GU 
Size judgment mean —.60 —43 às n ge p 
Nondirection error —.06 16 — 24 ‘03 —109 —108 
Intrasubject "b 58 ‘03 49 16 
Movement rate : 21 113 46 
Fixation rate i) —:70 A 
Adjustment rate 2 .88 


Number of movements 
Number of fixations 


Note.—N = 70. 
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in direct eye-movement measures of visual 
scanning. When medication effects were con- 
trolled there was no evidence that schizo- 
phrenic patients showed extreme scanning 
dispositions defined by direct measures of 
visual scanning. Finally, because of the sta- 
bility and generality of individual differences 
previously demonstrated for visual scanning 
measures the conclusions can be applied to a 
wide range of situations involving visual 
scanning. 
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A most frequent clinical description of the sexual offender is his deficiency in 
social skills. In studying these defects, however, the use of broad medicolegal 
groups as a parameter is a questionable procedure. In this exploratory study 
65 pedophiles and rapists, living in a segregated treatment center, were identi- 
fied by a diagnostic procedure and placed into a clinical descriptive classifica- 
tion. A sociometric procedure was used to study differences in social effective- 
ness and to test the usefulness of the classification. The diagnostic procedure 
identified four groups of rapists and three groups of pedophiles. The results 
indicate that the clinical classification is able to clarify sociometric differences 
and is of greater research value than the medicolegal grouping. 


From a clinical frame of reference an obvi- 
ous feature of the sexually deviant act is its 
essentially asocial character. The sexual act 
in the normal adult involves a mutuality of 
experience, a reciprocal sensitivity, and con- 
cern of the one for the other. By contrast, 
the acts of rape, child assault, exhibitionism, 
voyeurism, fetishism are basically impersonal 
and egocentric. 

Tn the course of studying over 800 male, 
young adults and adults who had committed 
sexually deviant acts, it became apparent 
that among other disturbances there is also 
present a marked defect in interpersonal rela- 
tionships with a relative absence of even the 
most basic of social attitudes and values or 
social skills. Such a finding has been reported 
in other clinical and research studies (Allen, 
1962; Gebhard, Gagnon, Pomeroy, & Chris- 
tenson, 1965; Glueck, 1956; Henry, 1941; 
Karpman, 1954). 

The study of these patients occurred as a 
result of a law passed by the Massachusetts 
General Court (General Laws, 1958) which 
established a center for the diagnosis and 
treatment of convicted sexual offenders (Co- 
hen & Kozol, 1966; Kozol, Cohen, & Garofalo, 
1966). Under this law a person found guilty 
of a sexual offense can be committed through 
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a civil procedure from one day to life in lieu 
of receiving a criminal sentence. Predatory 
homosexuals with some age disparity between 
the offender and victim and father-daughter 
or brother-sister incest cases have been com- 
mitted under the law. The majority of the 
committed patients, however, are pedophiles 
and assaultive rapists. 

Although all show some defect in relating 
to others, clinical observations point to dif- 
ferences in the nature, level, and quality of 
interpersonal experiences and social skills. In 
the Sing Sing study (Glueck, 1956), the au- 
thors concluded that rapists were more capa- 
ble in social interactions than pedophiles and 
went on to describe the latter group as imma- 
ture and socially inadequate. Similar state- 
ments about the pedophile can be found in 
other clinical studies of sexual offenders (Al- 
len, 1962; Bromberg, 1948; Gebhard et al., 
1965; Guttmacher, 1951; Karpman, 1954; 
Mohr, Turner, & Jerry, 1964). The authors’ 
work, however, led them to believe that the 
use of a medicolegal classification of sex of- 
fenders as a parameter in research is clinically 
and methodologically unsound. The purpose 
of this paper is to present a brief description 
of a clinically derived classification of sexual 
offenders and to report the results of a socio- 
metric study designed to test some implica- 
tions of the classification for its clinical, 
theoretical, and research utility. 
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Clinical Classification 


Rape. Descriptively, the act of rape in- 
volves both sexual and aggressive features. It 
is clear, however, that there are different rela- 
tionships between these two motives expressed 
in the sexual assault. 

In some acts of rape the intent of the act 
is primarily aggressive with sexual feelings 
minimal or absent. The sexual behavior is 
used to physically harm, to degrade, or to 
defile the victim in the service of this aggres- 
sive intent. The acts are experienced by the 
offender as the result of an “uncontrollable 
impulse” and almost always follows some 
precipitating, disagreeable event involving a 
wife, girlfriend, or mother. The victim is 
brutally assaulted, and those parts of the 
woman’s body which usually are sexually ex- 
citing frequently become the foci for the 
offender’s violence. 

Sexual excitation itself is often absent or 
only minimally present. The offender must 
masturbate to get an erection and in a large 
number of cases, he cannot reach an orgasm. 
It is readily apparent that the sexual behavior 
is not meant to gratify an intense sexual de- 
sire but rather is serving an aggressive motive. 
Since, in all such cases, the victim is unknown 
to the offender, and therefore could not her- 
self be the incentive for the anger, she is ap- 
parently the object for displaced aggression. 
Such patients therefore are classified as 
rapist-displaced-aggression type. 

In other acts of rape the aggressive fea- 
tures are less violent and frequently play such 
an inappreciable part, that should the victim 
struggle with any degree of vigor, the offender 
flees. In these acts the primary aim is clearly 
sexual and the aggression is in the service of 
gratifying the sexual desires. The offender is 
always in a state of intense sexual excitation 
and often has an orgasm in the simple pursuit 
of the victim or upon making some physical 
contact. The recurrent fantasy in such of- 
fenders is that the victims will yield, submit 
to intercourse, in which he will be especially 
virile and so pleasing to the victim that she 
will become enamoured with him and invite 
him to repeat the sexual acts. Outside of 
these episodic sexual assaults, there is no 
other antisocial behavior and in fact such 
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offenders are extremely passive and submis- 
sive. There is a pervasive, almost obsessive, 
concern with feelings of sexual inadequacy 
and the assault appears to be an effort 
compensate for or to counteract these feeling 
of impotency and inadequacy. These patients 
are classified as rapist-compensatory type. 

In a third group, the sexual and aggressive 
desires appear to coexist so that the offender 
is not able to experience, or even fantasy, 
sexual desires without a concommitant arousal- 
of aggressive thoughts and feelings. Further, 
he projects such feelings onto his victim and) 
sees her struggles and protestations as seduce 
tive—“Women like to get roughed up; they 
enjoy a good fight.” Such perceptions are 
made in the context of very brutal assaults. In 
offenders where this pattern is present, all 
object relationships, with both men and 
women, show this same quality of an erotici- 
zation of aggressive behavior. Such patients 
are very loud and assertive and those who: 
come in contact with them feel both over. 
whelmed and seduced at the same time. The: 
extreme of this position is sexual sadism 
where the woman is viciously violated andi 
murdered. Because the aggression in the sex 
ual act is so primitive, lacking any neutrali 
zation by the sexual feelings, this offender 
classified as rapist-sex-aggression-defusi 
type.? E 
In a final group of rapists neither sexual 
nor aggressive desires appear to play a signifi 
cant role. Rather, the act appears as simply 
another aspect of the offender's predatoi 
nature. The rape is frequently carried out m 
the context of some other antisocial act suc 
as robbery or theft. In such instances 
victim is unfortunately present and is sexua 
assaulted with relatively little sexual or 48° 
gressive feelings, The act is opportunistic, 
narcissistic, and impulsive, and such an 
fender has a history of predatory, antisoc 
behavior from his late childhood, pre 
lescent years. This group of sexual offender 
are classified as rapist-impulse type. 3 

Pedophilia. In pedophilia the sexual object 


2For a discussion of sex-aggression defusion S% 
Nunberg (1955, pp. 102-113). He does not addres 
himself to sexual deviancy in this discussion, Aa 
deals with the concept in terms of the psychoanaly¥® 
theory of instinctual development. 
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is the prepubertal child. In the authors’ clini- 
cal studies three groups of such offenders were 
differentiated on the bases of descriptive 
features of the act, the intentionality of the 
act and the developmental history of the of- 
fender. 

In one group of child offenders, the sexual 
interests are expressed as desires to touch, 
fondle, caress, suck, and smell the child. In 
this group the offender has never been able to 
develop or maintain mature object relation- 
ships with his male or female peers at any 
stage of his adolescent, young adult, or adult 
life. He is socially comfortable only with 
children and seeks them out as companions or 
seeks out employment where he can work or 
be near them (camp counselors, church work, 
YMCA). In most instances the child is 
known to the offender. There is nothing im- 
pulsive about the act for he begins the rela- 
tionship by a type of disarming courtship 
which engages the child and often the parents 
of the child. The sexual play occurs only 
after a period of seduction. His social behav- 
lor is best described as passive dependent. 
Because of the apparent arrest in psycho- 
Social development such patients have been 
classified as pedophile-fixated type. 

Tn another group of child offenders the sex- 
ual aim is also primarily pregenital although 
here, unlike the fixated pedophile, there is a 
history of an apparently normal adolescence 
with good peer relationships, some dating 
behavior, and heterosexual experiences. How- 
ever, throughout this period there exist in- 
creasing feelings of masculine inadequacy in 
sexual and nonsexual activities. And, as he 
enters adulthood, his social, occupational, and 
Marital adjustment is quite tenuous and 
marginal. There is frequently a history of an 
inability to deal with the normal stresses of 
„adult life and alcoholic episodes become in- 
creasingly more frequent and result in the 
breakdown of a relatively stable marital, so- 
cial, and work adjustment. In almost all in- 
stances the pedophilic acts are precipitated 
by some direct confrontation of his sexual 
adequacy by an adult female or some threat 
to his maculine image by a male peer. Most 
frequently, however, the precipitating event 
is the discovery that his wife or girlfriend is 
having an affair with another man. Although 
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a variety of different dynamics are present in 
the different offenders within this group, a 
similar consequence to the sexual rejection 
occurs which can best be described as an aim 
and an object regression. Such patients, 
therefore, are classified as pedophile-regressed 
type. 

There are two additional differences be- 
tween offenders in this group and offenders of 
the fixated type. In those instances where re- 
gression is seen to take place, the victim is 
not a familiar neighborhood child and the act 
is characteristically impulsive. The offender 
will describe driving or walking along the 
street, seeing a child and suddenly becoming 
overwhelmed by sexual excitation which he 
cannot control. Second, in almost all cases, 
the victim of the offenders in the pedophile, 
regressed group is a female child, with homo- 
sexual pedophilia extremely rare. 

In a third group of child offenders the aim 
of the relationship with the child contains 
both sexual and aggressive features, Although 
this is superficially analogous to the sex-ag- 
gression-defusion rapist, the object and the 
dynamics are quite different. The majority of 
offenders in this group select boys as the ob- 
ject for the sexual sadism, and the sexual 
component is far less object oriented. The 
primary aim is aggression, and is expressed 
in cruel and vicious assaults on the genitalia 
or by introducing the penis or elongated ob- 
jects into the victim orally or anally. The 
sexual excitement increases as an apparent 
function of the aggression, but the orgasm 
itself either does not occur or must be reached 
through masturbation. These patients are 
classified as pedophile-aggressive type. 

The clinical classification was developed 
empirically and is therefore primarily descrip- 
tive and atheoretical. However, the classifica- 
tion appears to yield expectations of differ- 
ences among the subgroups with respect to 
social adaptiveness and social skills; differ- 
ences which are obscured by the broader 
medicolegal grouping. 

The general hypothesis of the present study 
is that an analysis of interpersonal social skills 
in terms of the clinical subclasses is more 
sensitive than an analysis based upon the 
medicolegal classification. 

Further, two specific predictions are sug- 


252 


gested by the classification. As the rapist- 
displaced-aggression type and the pedophile- 
regressed type have demonstrated develop- 
mentally higher levels of social adaptation, 
and the sexual acts appear to be reactive and 
are experienced as dystonic, it would be ex- 
pected that they would show the highest lev- 
els of social skills among the sexual offenders 
—more popular, more socially active, and 
more socially sensitive. Second, since the 
pedophile-fixated type, the pedophile-aggres- 
sive type, and the rapist-impulse type appear 
to be fixated at early levels of object relation- 
ships, and the sexual acts represent character- 
istic ways of dealing with the social world, it 
would be expected that they would show few 
social skills and have little social success. 


METHOD 


The Treatment Center, Bridgewater, Massachu- 
setts, is a segregated unit within a correctional insti- 
tution and is a relatively closed social system. By 
virtue of this, a sociometric procedure appeared 
ideally suited for a study of these derivations. At 
the time of the study, there were 84 committed 
patients at the Center. The data for the research 
were obtained from 72 patients, an 85% sample, 
with the remaining 12 patients refusing to partici- 
pate. Seven patients were committed for incestuous 
acts with daughters or sisters and although their 
choices were included as part of the sociometric 
data for the scores of the other groups, this incest 
group was not included in the analyses. Of the re- 
maining 65 patients, 27 were committed for acts of 
rape: 10 were classified as rapist-displaced aggression, 
4 rapist-compensatory, 4 rapist-sex aggression de- 
fusion, and 9 rapist-impulse. Thirty-eight were com- 
mitted for sexual acts against children: 23 were 
classified as pedophile-fixated, 8 pedophile-regressed, 
and 7 pedophile-aggressive. 

All patients had been together for a period ranging 
from 6 mo. to 5 yr. The patients were seen indi- 
vidually and given a sociometric questionnaire. They 
were asked to give the names of three patients whom 
they would prefer to be with in each of four activi- 
ties and three patients whom they would not want 
to associate with in these same activities. The ac- 
tivities included group therapy, leisure time, work, 
and the dining room. In addition, they were asked 
to select three patients whom they preferred for 
patient government roles and three whom they would 
not want to have in such roles. 

For each patient seven sociometric measures were 
derived. Although these measures are not com- 
pletely independent of each other, each variable was 
felt to contribute some information not contained ìn 
the other variables. The measures include (a) posi- 
tive choice status—the number of positive choices 
received by each patient; (b) negative choice status 
—the number of negative choices received by each 
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patient; (c) net choice status—the algebraic sum of 
the first two variables; (d) total activity—the arith- 
metic sum of positive and negative choice; (e) asym- 
metrical reciprocity—the number of times a patient 
selected another patient who either omitted him 
from a positive choice or selected him as a negative 
choice; (f) symmetrical reciprocity—the number of 
times that a patient was chosen by another patient 
that he himself had chosen; and (g) index of at- 
traction—a ratio score derived from the number of 
positive choices divided by total activity. 

On preliminary analysis of the data it was found 
that all five choice situations, (group therapy, lei- 
sure time, dining room, work, and patient govern- 
ment) showed significantly high correlations with 
each other. Such findings are consonant with the 
results of studies which point to the fact that one 
individual tends to see another as a whole and either 
likes or dislikes him resulting in a uniform choice 
pattern. The finding of this “halo” effect which in- 
fluences choice on all activities led to a collapsing of 
activity categories and a consideration of the socio- 
metric variables independent of the particular social 
activity. 

The data were statistically evaluated by ¢ tests 
for differences between the two medicolegal groups 
and analyses of variances for differences among the 
various subgroups. Where, in the analyses of vari- 
ance, significant differences were found, a multiple- 
range test developed by Duncan (1955) and extended 
for use with groups of unequal size by Kramer 
(1956) and Duncan (1957) was applied to test for 
significance of differences among the subgroup means, 
This most conservative test was applied because of 
a number of methodological weaknesses in the re- 
search, which despite this corrective effort, must 
make for caution in generalization. The most im- 
portant defects include the problem of sample Tep- 
resentativeness, the small number of observations 1n 
some of the subgroups, the empirical heterogeneity 
of patients within subgroups, and the inability to 
make specific hypotheses for each variable for each 
subgroup. 


RESULTS 


The data for all variables for each of the 
clinical groups, as well as for the overa 
groups of rapists and pedophiles, are pre 
sented in Table 1. For the sake of conveni- 
ence, Table 2, which is drived from Table 1, 
presents a ranking of mean scores of i 
clinical groups, ordered on the basis ofa d 
score of 1 representing the highest level ^ 
social adaptation. The final column of Table 1 
represents the overall ranking of the e. 
groups across all variables. Since it is nO 
g carried out, 


erform 
naly- 


3In a replication study now beini 
separate analyses of the activities will be p 
to determine what, if any, contribution such à! 
ses will make. 


SocioMETRY AND THE SEX OFFENDER 253 
TABLE 1 
MEANS FOR THE SOCIOMETRIC VARIABLES 
Sociometric variables 
Clinical N Positive Negative Net Total Asymmetrical | Symmetrical Index of 
subgroup* status status status activity reciprocity reciprocity attraction 
Xx SD Xx SD Xx SD X SD x SD x SD = SD 
Pedophile 
PF 23 | 9.35 | 7.33 | 13.12 | 16.75 | —3.77 | 20.58 | 22.47 | 16.55 | 12.30 | 2.25 | 2.96 | 2.18 | 41.61 | 31.55 
PR 8 |12.62| 8.35| 5.38 | 5.08 7.24 | 11.05 | 18.00 | 9.79 | 11.00 | 1.92 | 4.14 | 1.73 | 70.11 | 26.78 
PA 7| S15| 7.93 | 15.71 | 12.09 | —10.56 | 14.57 | 20.86 | 10.06 | 13.29 | 1.80 | 1.86 | 1.77 | 24.69 | 21.30 
Combined | 38 | 9.89 | 7.38 | 12.60 | 14.46 |  —2.71 | 17.82 | 21.23 | 14.10 | 12.21 | 2.18 | 3.00 | 2.10 | 46.62 | 30.76 
Rapist 
RD 10 | 25.20 | 12.37 | 5.70 | 4.70| 19.50 | 15.09 | 30.90 | 12.52 | 9.30 | 4.59 | 5.00 | 3.35 | 81.55 | 15.39 
RC 4| 8&75| 594| 400| 425 4.75 | 7.68 | 12.75 | 2.05 | 13.75 | 1.24 | 2.00 | 1.32 | 68.63 | 31.28 
RSA 4| 600| 3.74| 8.33 | 8.39| —2.33 | 8.96 | 14.33 | 5.86 | 14.00 | .00 | 1.66 | .80 | 41.87 | 32.84 
RI 9 | 9.22 | 8.40 | 19:11 | 17.37 | —9.89 | 22.82 | 28.33 | 15.56 | 11.78 | 3.07 | 3.44 | 1.74 | 32.55 | 32.65 
Combined | 27 | 13.33 | 12.56 | 9.05 | 12.47 4.28 | 20.45 | 24.88 | 13.88 | 11.38 | 3.79 | 3.62 | 2.63 | 53.60 | 31,64 


^ Pedophiles: 
RSA = sex-aggression defusion; RI = impulse. 


clear that higher activity is necessarily asso- 
ciated with better social adaptation, Table 2 
contains no ranking for this variable. 

The data were first analyzed to evaluate 
differences between the medicolegal grouping 
of rape and pedophilia, on each variable, For 
this analysis a series of ¢ tests was per- 
formed. The results, presented in Table 3, 
show that no variable discriminated between 
the two medicolegal groups. That there is no 
clear differentiation between the medicolegal 
groups can also be noted by referring to 
Table 2. It can be seen that certain subgroups 
of rapists and pedophiles rank higher or lower 
than subgroups of the other general offense 
category. 


TABLE 2 


RANK ORDERING OF SUBGROUPS ON THE 
Basis or SOCIAL ADAPTATION 


Sociometric variables 
Clinical] =| S 
sube. Pi Eye Sof] index | Rank 
group Res Spese Net cal cal of CER " 
statuss) status | Status | reci- | rect | tion Tanks 
ity ity 
PF 3 5 5 4| 4 5 | 4 
Ee 3j ye | eet ae 
BA |) Deos | RA SR E nev ire 
RD | D EC t 
RC | 5 151 See GENIS TS NE er 
RSA -| coal oco ont zie ees nO 
RI Od eur e oo Sout EEN E 


^ Rank 1 indicates the higher level of social adaptation. 


PF = fixated; PR = regressed; PA = aggressive; Rapists: 


RD = displacement; RC = compensatory; 


A series of analyses of variance was then 
carried out to determine if the obtained mean 
differences of the subgroup samples were sta- 
tistically significant. A summary of the analy- 
ses for each sociometric variable appears in 
Table 4. For five of the seven variables 
(positive choice, net choice, asymmetrical and 
symmetrical reciprocity, and index of attrac- 
tion) the differences between the clinical sub- 
groups are significant at or beyond the .05 
level of probability. Only negative choice 
and total activity failed to discriminate 
statistically between the groups. 

In order to clarify further the obtained dif- 
ferences the multiple-range test was applied 
to the subgroup data and the results of this 
analysis are shown in Table 5. 

With respect to positive choice status, it 
can be seen that the rapist-displaced- 
aggression group is significantly differentiated 


TABLE 3 


SIGNIFICANCE OF DIFFERENCES BETWEEN 
MEDICOLEGAL GROUPS 


Rapists Pedophiles 


Sociometric variable 
x SD x 
Positive status 13.33 | 12.56 | 9.89 
Negative status 9.05 | 12.47 | 12.60 
Net status 4.28 | 20.45 | —2,71 
"Total activity 24,88 | 13.88 | 21.23 
Asymmetrical reciprocity | 11.38 | 3.79 12.21 
Symmetrical reciprocity .62| 2.63| 3.00 
Index of attraction 53,60 | 31.64 | 46.62 
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TABLE 4 


ANALYSES OF VARIANCE FOR THE VARIABLES 


Mean | Mean 
Sociometric variable IAS] Seen eee 
groups* | groups?| 

Positive status 457.7 | 66.1 | 6.92 | .005 
Negative status 266.1 | 179.7 | 1.48 | .25 
Net status 1052.2 | 317.0 | 3.31 | .01 
"Total activity 610.4 | 281.7 | 2.16 | .10 
Asymmetrical reciprocity 19.7 7.5 | 2.63 | .025 
Symmetrical reciprocity 11.1 4.8 | 2.29 | .05 
Index of attraction 2824.5 | 809.4 | 3.49 | .005 

^ df = 6. 

b df = 58, 


from all other subgroups on the basis of a 
higher positive score. The remaining sub- 
groups are statistical equivalent on this vari- 
able. For net choice status, the rapist- 
displaced-aggression subgroup is significantly 
differentiated from the rape-impulse, pedo- 
phile-aggressive, and pedophile-fixated sub- 
groups. And again all other subgroups are 
undifferentiated. On both the asymmetrical 
and symmetrical reciprocity variables the 
rapist-displaced-aggression group differs sig- 
nificantly from both the rapist-sex-aggression- 
defusion and rapist-compensatory groups. 
Furthermore, for asymmetrical reciprocity the 
rapist-displaced-aggression group also shows a 
significantly higher score than the pedophile- 
aggressive group. Finally, on the index of at- 
traction dimension, two significant discrimina- 
tions obtained. First, the rapist-displaced- 
aggression group has a higher index of attrac- 
tion than all other clinical subgroups, 
and, second, the rapist-compensatory group 
scores significantly higher than the pedophile- 
aggressive group. 
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Discussion 


The findings support the general hypothe- 
sis of the study. The analysis of sociometric 
behavior in terms of clinical subclasses is 
more sensitive than such an analysis applied 
to broad medicolegal classes. Further, the 
findings strongly support the first specific 
prediction and to a lesser degree the second, 
The rapist-displaced-aggression group is 
clearly functioning at the highest level of 
social effectiveness as compared with all other 
sexual offenders. And, also in accord with the 
first prediction, the pedophile—regressed group 
follows closely behind on all the sociometric 
scales, 

With respect to the second prediction, the 
pedophile-aggressive group consistently shows 
ineffective social functioning as expected, but 
the findings for the rapist-impulse group and 
the pedophile-fixated group are more equivo- 
cal. The rapist-impulse group is comprised of 
offenders who have a long history of a vari- 
ety of antisocial, predatory acts. Clinically, 
they are described as impulse ridden and self- 
centered with no ability to form meaningful 
object relationships and frequently diagnosed 
in classical nosology as psychopaths or socio- 
paths. And yet, according to the present find- 
ings, they have some social skills and sensi- 
tivity to an unexpected degree. Similarly, the 
the pedophile-fixated group showed greater 
social adaptability than was expected. Pa- 
tients are placed in this group when there is 
a history of a relative absence of mature ob- 
ject relations with either sex. Clinically, they 
appear as passive-dependent persons, socially 
inadequate with intense feelings of isolation 
and alienation, Despite this picture they are 
not that socially isolated and some soc! 
skills are obviously present. 


TABLE 5 
MULTIPLE Comparisons OF SUBGROUP MEANS 


Sociometric variable 


Multiple comparisons* x 
Positive status (RD) RC, RI, PF, PR, RSA 
Negative status No RAM ifferences i : 
Net status (RD, RC, PR) (RC, PR, PA, RI, PF, RSA) 
Total activity AUT No signi t differences 
Asymmetrical reciprocity (RD, RI, PR, PF, PA) (RSA, RC, RI, PR, PF, PA) 
Symmetrical reciprocity RD, RI, PF, PR) (RSA, PA, RC, RI, PF, 


Index of attraction 


grou 


3 a Any"two groups not appearing together within the 
ips appearing together within seme parentheses have nian iei da mot 


, PR) 
(RD, RG, PR) (PA, RI, RSA, PF, PR) (RC, RI, RSA, PR) 


ficantly different means (p) <S .05). Any two 


5) 
ler (p > .05) (after Duncan, 1950). 
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The rapist-compensatory group shows the 
greatest amount of variability of social ef- 
fectiveness. Although relatively socially in- 
sensitive (a low rank on both asymmetrical 
and symmetrical reciprocity) and most socially 
isolated (lowest total choice), such patients 
are least offensive or least actively disliked 
(least chosen as negative choice). This find- 
ing supports the clinical observations. Rape, 
as a compensatory act, is seen to take place 
in men who have a sense of impotence re- 
garding their impact on the world. Such an 
ego attitude is expressed in the passive and 
submissive manner in which they relate to 
others. This mode of relating helps them 
avoid direct confrontations but makes them, 
in the social world, powerless. Thus, they are 
inoffensive, but the cost is social isolation. 

It is clear that the clinically observed 
heterogeneity of sexual offenders can be tested 
by behavioral research. This is not, as it 
appears to be, stating the obvious, for with 
some rare exceptions, despite the voluminous 
literature on normal and deviant human 
sexual behavior, the published material is 
made up of clinical anecdotal reports, broad 
ranging statistical descriptive studies, narrow 
theoretical exegesses, or exhortative tracts of 
a moral or religious nature. Though there 
have been some well-executed cultural anthro- 
pological studies (Malinowski, 1929; Mead, 
1949) and a number of systematic physio- 
logical hormonal studies (Beach, 1948; Ford 
& Beach, 1951), behavioral research has been 
lacking to a notable degree. 

One final point should be made. Clinical 
classification has practical as well as theoreti- 
cal and research implications. A primary ap- 
plied application of valid classification lies in 
treatment planning and prognosis. It is quite 
clear, for example, even from the brief 
discussion of the characteristics of the groups, 
that the rapist-displaced-aggression and the 
rapist-impulse groups cannot be treated in the 
same way, and that prognosis is quite dif- 
ferent. For the latter group the sexual prob- 
lem is secondary to the general inability to 
adjust to social demands and to effect a close, 
positive, interpersonal relationship. For the 
patients so diagnosed, formal psychotherapy 
must take place in a context of an overall 
effort at socialization. And, as the antisocial 
behavior has had such an early onset (pre- 
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pubertal years) and pervades so many aspects 
of the patient’s life, the treatment will be a 
long and arduous process. In those acts of 
rape, however, where the major dynamic ap- 
pears to be a focalized displaced aggression, 
where a therapeutic alliance is relatively easy 
to establish, and where the patient has a 
large number of social and adaptive skills, 
classical therapeutic techniques can be ap- 
plied with positive results expected within a 
relatively short period. 
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PUNISHMENT OR DISTRACTION? 
OPERANT STUTTERING REVISITED 


BARBARA BIGGS anp JOSEPH SHEEHAN 1 
University of California, Los Angeles 


In a partial replication of a study by Flanagan, Goldiamond, and Azrin, six 
male stutterers enrolled in the Psychology Speech Clinic of the University of 
California, Los Angeles, were tested during successive readings of a given 
passage. Three experimental conditions were employed using a 4,000-cps tone 
at an intensity of 108 db., ISO hearing level, to determine the effect of this 
stimulus upon stuttering behavior. The conditions were (a) an aversive con- 
dition in which the presentation of the tone was contingent upon stuttering; 
(b) an escape condition in which cessation of the tone was contingent upon 
stuttering; and (c) a random condition in which the tone was presented 
independent of Ss’ disfluencies. These findings, differing from those of Flanagan 
et al., were that stuttering decreased in all conditions. The results are attributed 


primarily to distraction effects. 


In 1958, Flanagan, Goldiamond, and Azrin 
conducted an experiment in “operant stutter- 
ing” with three Ss. They employed a 1-sec. 
blast of a 6,000-cps tone at an intensity level 
of 105 db. (Flanagan, 1966) as a noxious 
stimulus. They found that stuttering increased 
in frequency when a 5-sec. cessation of a 
continual tone was made contingent upon 
stuttering. It decreased when presentation of 
the stimulus was contingent upon stuttering. 

This result is somewhat at variance with 
those of other studies reported in the litera- 
ture concerning both the theoretical and 
actual effects of punishment and reward on 
verbal nonfluency. These studies detail 
varied individual reactions depending upon 
both the strength and type of punishment 
employed and the characteristics of Ss them- 
selves (Frick, 1951; Hill, 1954; Johnson, 
1958; Shames & Sherrick, 1963; Sheehan, 
1951, 1953, 1958; Sheehan, Cortese, & 
Hadley, 1962; Sheehan & Voas, 1957; Siegel 
& Martin, 1965a, 1965b; Skinner, 1957; 
Stassi, 1961; Van Riper, 1937). 

Since this widely quoted study employed 
only a few Ss, and since its authors did not 
consider the possibility that their results 


1 Requests for reprints should be sent to Joseph 
Sheehan, Department of Psychology, University of 
California, Los Angeles 24, California. 

2 Audiometric zero as described by American Stan- 
dards Association, 1951, Specifications for pure tone 
audiometers (B. Flanagan, personal communication, 
1966). 
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might be due to distraction effects (cited by 
Bloodstein—1950—as one of the two con- 
ditions responsible for almost “all of the 
reductions in stuttering recorded in this 
study”), the present study was undertaken 
as a partial replication, with an increase in 
the number of Ss and certain modifications 
in design. 
METHOD 
Design 


Three design changes were undertaken. First there 
were questions as to the reliability of the stimulus 
employed by Flanagan, Goldiamond, and Azrin. 
Since it is common for males with otherwise normal 
hearing to have some loss of acuity at high fre- 
quencies, there was concern about the possible neces- 
sity of eliminating potential Ss if losses were found 
at 6,000 cps. Informal judgments made by Æ and 
others also led to questions about the aversive prop- 
erties of the tone. For these reasons, a pretest was 
run on normally fluent speakers. As a result, the 
stimulus selected for the present experiment was 2 
4,000-cps tone presented at an intensity level of 108 
db., ISO 3 hearing level. 

Second, it has been well established that the spe 
cific material chosen for reading has a decided effect 
on stuttering behavior (Sheehan, 1958; ur 
1966). Therefore Ss in the present study were aske 
to read the same passage throughout a given experi 
mental session, rather than different passages whi 
could cause fluctuations in performance that wo! 
be difficult to analyze. 1 

The experimental design was modified by adding 
a third condition—noncontingent, random prn 
tion of the tone. This specifically deals with 4 


3 International Standards Organization, 1964, speci- 
fications for pure tone audiometers. 
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distraction variable; the tone does not “reward” 
the stutterer by stopping at each disfluency; neither 
does it appear each time he stutters to “punish” him. 

Statistically, an incomplete balanced block design 
was employed. Order effects were controlled by the 
random assignment of Ss to a schedule of experi- 
mental sessions involving every order and sequence 
of the three conditions investigated. 


Subjects 


The Ss were six male stutterers currently enrolled 
in the Psychology Speech Clinic at the University of 
California, Los Angeles. Their ages ranged from 18 
to 28 yr., an average age of 24. Participation in the 
study was voluntary. 


Apparatus 


Ambient noise in the experimental room was moni- 
tored with a General Radio Model 1551-B sound 
level meter. The sound stimulus was produced by 
an Allison 22 audiometer, calibrated with an Allison 
Model 3A Calibration Unit. The Ss received the tone 
through TDH-39 earphones in MX 41/AR cushions. 
The same equipment was used for the hearing tests. 
Frequency of stuttering was recorded by means of 
a Linemaster Model 491-SC foot pedal switch which 
activated a General Controls Model CE 600 AS 602, 
115 V AC counter. A Rustrak four-channel event 
recorder was employed for this purpose in some 
sessions. An Olympic stopwatch was used to measure 
time intervals, 


Procedure 


Preliminary tests were run on 18 nonstuttering Ss 
with normal hearing to determine the specific fre- 
quency to be used as an aversive stimulus. Pure tone 
threshold tests indicated that 3 out of 4 male Ss 
had reduced acuity in both ears at 6,000 cps ranging 
from 5 to 15 db. At 4,000 cps only 1 male S had a 
15-db. unilateral loss. Discomfort levels were estab- 
lished for each of three frequencies; 3,000, 4,000, 
and 6,000 cps, These ranged from 75 db. to 108 db., 
an average of about 100 db. Two Ss were dropped 
at this point because no discomfort level could be 
established within the maximum output of the audi- 
ometer, The three frequencies were arranged in their 
six possible pairings, and presented to each S in 
random order so he would judge each pair twice. 
At intensities above the discomfort threshold, he was 
asked to state whether the first or second tone in a 
given pair was more comfortable. Four Ss’ responses 
Were patterned so that either the first or second 
stimulus presented was always the one chosen. The 
remaining 12 Ss consistently selected the 4,000-cps 
^ tone as being the more aversive (the other two tones 
Were chosen only six times in 140 trials). 

The experiment was conducted in a two-room 
Suite. The Ss could see neither E nor the equipment. 
The ambient noise level, measured before each ses- 
Sion, varied from 35 to 43 db., sound-pressure level, 
averaged 40 db. 
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Potential Ss were screened to eliminate those with 
hearing losses or with thresholds of discomfort above 
108 db. at 4,000 cps (maximum output of the audi- 
ometer at this frequency). A 3-min. tone decay 
test was also performed to guard against fading of 
the stimulus tone during the escape procedure. Two 
volunteers were rejected because of bilateral hearing 
losses. 

Each S was seen on three different occasions, with 
a minimum of 3 days between experimental sessions. 
The written instructions indicated that he was to 
read the assigned passage aloud in a natural manner, 
and informed him that he would hear loud tones 
through the earphones during portions of the reading. 

Headphones were worn throughout each session, 
The reading material consisted of three passages taken 
from Einstein’s (1949) The World As I See It. 

The base-line period varied in length from 6 to 25 
min., 15 being about the average. When a stable 
base rate of stuttering frequency was established, 
after an initial period of adaptation, the conditioning 
period was initiated. During this 20-min. interval S 
was presented with one of the following: (a) tone 
contingent upon stuttering (aversive condition) ; (b) 
cessation of a continuous tone contingent upon stut- 
tering (escape condition); (c) random presentation 
of tone (random condition). 

In the extinction period S continued to read for 
another 20 min. during which time no further stim- 
uli were presented. Throughout the base-line period, 
E recorded the rate of stuttering by pressing a foot 
pedal switch which operated an electric counter. The 
S's score was the number of stuttered words per 
passage read, following one of the procedures used 
by Flanagan et al. (1958). During the aversive con- 
ditioning period, the manual switch on the audi- 
ometer was pressed simultaneously, delivering a 1-sec. 
tone burst each time S stuttered. In the escape 
condition, an interrupter switch was used so the 
continuous tone stopped for 5 sec. each time a dis- 
fluency occurred. If S stuttered during this silent 
period, resumption of the tone was postponed for 
another 5 sec. In order to permit a comparison be- 
tween the aversive and random conditions, schedules 
for all possible rates of stuttering were constructed 
randomly (Hodgman, 1945). The base rate of stutter- 
ing was calculated for a 5-min. interval at the end 
of the preconditioning period, and stimuli in the 
conditioning period were delivered according to the 
appropriate schedule. Since it was necessary for E 
to employ a stopwatch to time the scheduled inter- 
vals between stimulus presentations, frequency counts 
were made by one of two other observers, both 
members of the Psychology Speech Clinic Staff. The 
observers also performed this function during other 
sessions. They were not informed as to the nature 
of any given session, nor when their judgments 
would actually be used for scoring. When a four- 
channel event recorder was made available to E, it 
was possible to read correlations of judgments from 
the tapes—.88 in the case of one observer and .92 in 
the case of the other. This is sufficiently high that 
it may be assumed that scores derived from the 
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observers’ frequency counts were essentially the same 
as those derived from those of E, that E's judgments 
during the experiment were accurate, snd the appro- 
priate stimuli were delievered to Ss. 

At the end of the third session each S was given 
a questionnaire in which he was asked to state his 
impressions of what the experiment might have been 
investigating, whether the stimulus was related to his 
stuttering, and what adjectives he would use to 
describe the stimulus (Table 4). 


RESULTS 


The essential finding of this study was that 
stuttering decreased in all conditions. The 
adaptation curves obtained on Ss (Figures 1 
and 2 are typical) are all similar, regardless of 
the imposed experimental condition. This sug- 
gests that there is no treatment effect, which 
is exactly what an analysis of variance of the 
data proves (Table 1). This analysis was 
derived from a ratio of the main frequency 
of stuttering during the conditioning or ex- 
tinction period divided by the average of the 
last two scores in the base-line period. A 
square-root transformation was applied to 
correct for nonhomogeneity of variance. Dif- 
ferences among experimental conditions fell 
short of significance (p = .10). 

In the computation of the sums of squares, 
the sum of means for the aversive condition 
was 20.6, as compared with 17.6 for both 
the random and escape conditions. This indi- 
cates the direction of the trend: the random 


TABLE 1 
ANALYSIS OF VARIANCE OF TRANSFORMED 
Raw Data 
Source af MS f 
Condition 2 
Conditioning 4797 4.096* 
E Extinction -2009 .055 
s 
Conditioning 28.9601 247.311** 
Extinction 25.8515 69.209** 
Order 
Conditioning .1860 1.588 
Extinction 1452 040 
Error 
Conditioning 1171 
Extinction .3663 


Note,— The scores for this analysis were derived from a 
ratio of the mean asiy of stuttering during the condition- 


ing or extinction average line frequency. A square- 
root transformation was applied to correct for oa beeeuelty. 
of ree 

b<. 


BARBARA BIGGS AND JOSEPH SHEEHAN 


and escape procedures resulted in slightly 
more response decrement than the punishment 
procedure. 

To minimize the magnitude of S dif- 
ferences, a second analysis was carried out 
(Table 2). Scores were derived from the 
ratio (x= b — f/b, where b refers to the 
base-line average, and f to the mean num- 
ber of stutterings in the conditioning or 
extinction period. This procedure reduced the 
f value for Ss from 247.311 to 7.220. How- 
ever, it also reduced the significance of the 
effect of experimental conditions to the .25 
level. The order in which the conditions 
were presented had no significant effect on 
performance. 


TABLE 2 
ANALYSIS OF VARIANCE OF Ratio DATA 
Source df MS Í 
Condition 2 
Conditioning .0395 2.820* 
Extinction .0162 ATA 
s 5 
Conditioning .1012 7.220588 
Extinction .0965 2.822* 
Order 2 
Conditioning .0020 142 
Extinction .0057 A67 
Error 8 
Conditioning .0140 
Extinction .0342 


Note.—The scores for this analysis were derived from & 


3 n f 
ratio: average base-line frequency minus the mean frequency 0% 
giten uring the conditioning or extinction period/base 
ie. S 
*p <.25. 
+9 <.. 
“+p <0. 


Table 3 shows the mean frequency of stut- 
tering for all Ss in all conditions. When 
cumulative frequency graphs were plotted (as 
in Figures 1 and 2), suppression of response 
was notable in all conditions in all Ss with 
the exception of a rise in stuttering frequency 
in the aversive condition for S 3 (Figure 2). 
Curves for the random and aversive condi- 
tions were generally parallel, although 1 
sponse suppression was greater in the random 
condition for Ss 5 and 6. The rate of stuttering 
continued to drop during the extinction d 
except in three instances: S 2 following t 
random period, S 4 following the escap? 
period, and S 5 following the aversive stimu 
lus presentation. S 1 stuttered so severely 
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it was rare for him to read for any 5-sec. 
interval without stuttering. Therefore, he 
experienced the escape condition as an occa- 
sional presentation of brief tones, and gen- 
eralizations about the escape period are made 
only with reference to the other five Ss. 
The cumulative frequency curves derived 
for the aversive condition appear similar to 
those presented by Flanagan et al. (1958). 
However, the curves for the escape periods 
in the two studies differ. In Figures 1 and 2 


TABLE 3 


MEAN FREQUENCY OF STUTTERING DURING BASE LINE, 
CONDITIONING, AND EXTINCTION PERIODS 


Condition 
Periods 
Aversives | Escapeb | Randome 
$1 
Base-rate average 111.50 95.00 87.00 
Conditioning mean | 104.00 78.33 83.67 
ess mean 90.00 60.33 72.67 
Base-rate average 8.00 5.00 10.50 
Conditioning mean 4.38 2.33 4.67 
eer mean 2.00 1.75 6.25 
Base-rate average 6.50 8.00 9.00 
Conditioning mean 7.89 3.88 3.09 
paren mean 4.00 322 2.67 
Base-rate average 2.50 3.00 3.50 
Conditioning mean 1.10 .89 145 
Pitinctiori mean 1.00 .67 1.60 
Base-rate average 7.00 6.50 5.50 
Conditioning mean 4.50 4.00 2.89 
pacnetion mean 4.70 2.00 122 
Base-rate average 9.00 10.50 8.00 
Conditioning mean 5.57 5.71 3.00 
Extinction mean. 1.43 3.83 1.00 


ts Tone contingent upon stuttering response, 
Cessation of tone contingent upon stuttering response. 
* Tone independent of stuttering response. 


it is apparent that the stuttering response 
is suppressed. This was true of all Ss in this 
condition (Table 3). The opposite result is 
evident in Flanagan, Goldiamond, and Azrin’s 
(1958, Figure 1, p. 174) curves for S 1 and 
S 2. Their graph for S 3 is difficult to inter- 
pret, and it is not really clear whether the 
mean number of disfluencies for the condi- 
tioning period would be greater than a base- 
line average. Figure 1, in the present study, 
shows a curve for the aversive condition 
which is similar in shape to the Flanagan, 
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ee aversive condition 
»*x escape condition 
60 Le», random condition 


WORDS STUTTERED 


20 25 


o 5 lo 15 
PASSAGES READ 


Fic. 1. Cumulative number of words stuttered 
during successive readings of the same passage—S 4, 
(Vertical lines indicate the boundaries of the condi- 
tioning period.) 


Goldiamond, and Azrin graph for S 3, but the 
rise from base-line average only amounts to 
a difference of 1.4 disfluencies per passage. 

Adaptation occurred as previously de- 
scribed in the literature, the greatest decre- 
ment usually occurring during the second 
reading, approaching an asymptote by the end 
of the third or fourth (Cullihan, 1963; 
Sheehan, 1951; Trotter, 1955). Two excep- 
tions to this pattern occur in the base-line 
period for the random condition. Ss 1 and 6 
both increased their stuttering initially to 
approach the base-line average of the other 
two conditions. 

However, with two exceptions, the aversive 
condition for S 3 (Figure 2) and the random 
condition for S 4 (Figure 1), the introduction 
of the stimulus always resulted in an im- 
mediate drop in stuttering frequency, al- 
though the number of passages read during 
the base-line period varied from four to seven. 

The Ss’ ratings of the stimulus tone appear 
in Table 4. Although a 5-point scale was pre- 
sented to Ss, no adjectives were rated with 
a 4 (very) or 5 (extremely). The differences 
between ratings are not large enough to indi- 
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PASSAGES READ 


Fic. 2. Cumulative number of words stuttered 
during successive readings of the same passage—S 3. 
(Vertical lines indicate the boundaries of the condi- 
tioning period.) 


cate more than a tendency to consider the 
stimulus as being more uncomfortable and 
distracting than painful or noxious. Only two 
of the six Ss indicated in the questionnaire 
that they had correctly interpreted the nature 
of the experiment and the relation of the 
stimulus to their disfluencies. Two Ss indi- 
cated that they began to ignore the tone after 
a time which raises the question of Ss adapt- 
ing to the stimulus. 


TABLE 4 
SUBJECTS’ RATINGS OF THE STIMULUS 
Subjects 
Adjective 
1-|*2-] xET ET [oS 762 o M: 
Painful 0|1|1/2/|0]|0] .66 
Noxious 0|2|[2|2]|0]|0]|1.00 
Distracting 2792492: We [02*| 30246 
Irritating 2|3|2|2|1]|2]|200 
Uncomfortable 3|]3|2|2]|0/|3]|216 
Annoying uae aire: le eel) 2h 12:7] 183 
Punishing 0|2|1/|2/|0]|2]|116 
Unpleasant 2|2|2|2]|2|2]|200 
.—All S: ked : 
1 Moe 2 a somewhat, 5 Eure biu Py 


tremely. 
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Discussion 


The findings in the present study were 
similar to those of Flanagan, Goldiamond, 
and Azrin (1958) when a punishment pro- 
cedure was employed, but differences were ap- 
parent in the results obtained from the use 
of an escape procedure. It is difficult to assess 
the magnitude of these differences since 
Flanagan et al. (1958) do not report numeri- 
cal data. Assuming that the discrepancies are 
significant, they may be explained in terms 
of individual differences among Ss and pro- 
cedural differences. 

Since the effect of reading material was not 
controlled by Flanagan, Goldiamond, and 
Azrin, it cannot be analyzed. This leaves open 
to question the possibility that the pages read 
during periods of increased disfluency were 
more difficult phonetically, grammatically, or 
emotionally. The curves for two of the three 
Ss in this condition (Flanagan et al., 1958, 
p. 174) are not as smooth as the four other 
curves presented, an observation which might 
support this possibility. Order was not a sig- 
nificant factor in the present study, but the 
time period between trials was never less than 
3 days. Flanagan, Goldiamond, and Azrin 
(1958) ran their Ss on consecutive days, 50 
there may have been some carry-over from 
one trial to the next. Their S 2 and S 3 
(Flanagan et al., 1958, p. 174), who experi- 
enced the escape condition in the second trial, 
show less obvious response increment than 
S 1, who encountered this condition in bis 
initial trial. 

The statistical analyses of data from the 
present study indicated that the three operant 
conditioning techniques employed had essen- 
tially the same effect: all resulted in suppres: 
sion of the stuttering response with no signi" 
cant difference in the amount of suppression: 
The only trend which could be noted indicates 
that the contingent presentation of the aver 
sive stimulus resulted in less suppression 9 
the stuttering response than noncontingent, or 
random, presentation. Since conservative St 
tistical procedures were employed, this We. 
might manifest itself more strongly if a larg? 
number of Ss were studied. f 

Before the study was undertaken, it p: 
peared that the 4,000-cycle tone at 108 9? 
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would be effective as a noxious stimulus, since 
there was reason to believe that it was more 
aversive than the tone employed by Flanagan, 
Goldiamond, and Azrin. First, Ss in the 
pretest had judged it to be so. Second, al- 
though all of the stutterers in the experiment 
had “normal” hearing, five of the six had a 
drop in acuity of 5 to 30 db. between 4,000 
and 6,000 cps, a factor which would certainly 
have reduced the experienced discomfort 
had the 6,000-cycle tone been employed. 
Flanagan, Goldiamond, and Azrin did not 
test the hearing of their Ss before their ex- 
periment. Third, there was a difference of 
20 db. in the noise levels present in the 
experimental rooms. Hirsch (1952) reports 
that when the effect of earphone cushions is 
taken into account (cutting out about 20 db. 
of the external noise), an ambient noise level 
of 40 or 50 db. may be allowed “without 
anticipating measurable effects on the abso- 
lute thresholds [p. 166].” The noise level 
reported here never exceeded 43 db., so it 
may be assumed that the stimulus was re- 
ceived at the full 108-db. intensity level. 
Flanagan, Goldiamond, and Azrin (1958) re- 
port the presence of a constant 60-db. noise. 
This could have masked their stimulus tone 
so that its effective level was 10 db. less than 
the 105 db. reported. 

Despite this, the stimulus employed always 
had the same effect on behavior, whether its 
application or removal was contingent on 
stuttering or whether its presentation was 
random. In all conditions stuttering frequency 
was reduced. Thus it is apparent that there 
was, in fact, only one experimental condition: 
a 4,000-cps tone at 108 db. whose introduc- 
tion into the environment was followed, 
almost invariably, by a drop in the estab- 
lished rate of stuttering. Since cessation of 
the tone contingent upon stuttering did 
not increase the frequency of nonfluent be- 
havior, the stimulus was not functionally 
aversive although it was rated as subjectively 
unpleasant. 3 

If the word “punishment” is used in a 
broader, non-Skinnerian context, it is pos- 
sible to interpret the results as an indication 
that a mild punishment may be effective in 
reducing symptomatic behavior. In future 
experimentation it might be desirable to in- 
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clude a control group receiving a neutral tone 
for the supposedly noxious tone, to separate 
more clearly the roles of distraction and 
punishment. However, previous studies by 
Barber (1939), Shane (1955), and Cherry 
and Sayers (1956) provide ample evidence 
that the introduction of a novel auditory 
stimulus does in itself provide sufficient dis- 
traction to reduce stuttering behavior even 
though the stimulus is not aversive. The 
present findings are clearly in line with these 
previous studies but not in line with those 
of Flanagan, Goldiamond, and Azrin. 

Since the stimulus employed in this ex- 
panded replication was independently rated 
as only mildly aversive, and did not produce 
contingent effects, the parsimonious explana- 
tion appears to be that the tone operated in 
general as a distraction device. 
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SELF-DIRECTED PROGRAM FOR WEIGHT CONTROL: 
A PILOT STUDY 


MARY B. HARRIS 1 
Stanford University 


A treatment program was designed to enable Ss to lose weight through the 
use of self-monitored techniques for changing their eating behaviors. All Ss 
who participated in the program achieved a stable loss in weight, and their 
mean loss was significantly greater than the change shown by a group of 
similarly motivated control Ss. No additional effects due to a few sessions of 
aversive counterconditioning were demonstrated, and no general mood changes 
accompanied the weight loss. The Ss did report a decreased temptation to 
overeat. It was suggested that similar programs of gradual habit change 
through self-control of stimulus conditions and reinforcement contingencies 
might be applied to the treatment of other addictive behaviors, which are also 


very refractory to change. 


Attempts to deal with the undesirable be- 
havior of overeating have been spectacularly 
unsuccessful. Although a great number of 
causal agents or correlates have been postu- 
lated, such as depression (Simon, 1963), anx- 
iety (Cauffman & Pauley, 1961), power ori- 
entation (Suczek, 1957), a variety of other 
personality problems (Bruch, 1957; Kaplan 
& Kaplan, 1957, who list 28 suggested mean- 
ings of obesity; Shipman & Plesset, 1963), 
insufficient exercise (Mayer, 1955), pres- 
ence of night-eating syndrome (Stunkard, 
19592), lack of correlation between report of 
hunger and gastric motility  (Stunkard, 
1959b), obesity of parents (Cappon, 1958), 
more dependence on external stimuli such as 
flavors and time of day to regulate hunger 
(Schachter, 1967), and no doubt many others, 
no clear causal relationships have been sub- 
stantially detailed. An equally large variety 
of treatments have been tried, ranging from 
Such monacademic means as low-calorie 
products, exercise salons, yoga, and reducing 
clubs to hypnosis (Erickson, 1960), dietary 
instruction (Young, Moore, Berresford, Ein- 
Set, & Waldner, 1955), appetite depressants 
and other drugs (Silverstone & Solomon, 
1965), general medical advice (Stunkard, 


1 The author wishes to express her appreciation to 
Albert Bandura for his generous assistance with all 
Phases of the study. Request for reprints should be 
to the author, Department of Educational Founda- 
tions, University of New Mexico, Albuquerque, New 
Mexico 87106. 


1958), psychoanalysis and other forms of 
psychotherapy (Bruch, 1957), group discus- 
sion of physical and emotional factors 
(Harmon, Purkonen, & Rasmussen, 1958), 
aversion-relief therapy (Thorpe, Schmidt, 
Brown, & Castell, 1964), recording all eating 
(Ferster, Nurnberger, & Levitt, 1962; Stollak, 
1966), self-control (Ferster et al., 1962; 
Goldiamond, 1965), aversive countercondi- 
tioning with shock (Stollak, 1966; Wolpe, 
1958), operant conditioning with shock 
(Meyer & Crisp, 1964), and aversive counter- 
conditioning with nausea (Cautela, 1966). 
In general, the reported attempts to treat 
overweight people fall into two categories: 
case histories of techniques which proved 
successful with one or a very few patients 
(Cautela, 1966; Erickson, 1960; Ferster et 
al., 1962; Goldiamond, 1965; Meyer & Crisp, 
1964; Thorpe et al., 1964; Wolpe, 1958) and 
survey studies of patients in a medical setting, 
most of which have reported a general lack 
of success in effecting any long-range weight 
reduction (Franklin & Rynearson, 1960; 
Harmon et al., 1958; Shipman & Plesset, 
1963; Silverstone & Solomon, 1965; Young et 
al., 1955). An experimental study by Stollak, 
using aversive counterconditioning therapy 
with electric shock and adequate control 
groups, reported the same poor results (a 
mean net loss of less than 5 Ib.). The inade- 
quacy of treatment has been nicely sum- 
marized by Stunkard (1958)—“most obese 
persons will not stay in treatment for obesity. 
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Of those who stay in treatment, most will not 
lose weight and of those who do lose weight, 
most will regain it [p. 79]." The evidence 
to disconfirm this description has not yet 
appeared, 

An examination of the contingencies gov- 
erning addictive behavior in general and 
overeating in particular shows several reasons 
why this type of behavior should be so re- 
sistant to change. Addictive behaviors such as 
overeating provide immediate positive rein- 
forcement for the individual, while the rein- 
forcement for refraining from eating is usu- 
ally extremely delayed. Moreover, the aversive 
consequences of overeating are typically de- 
layed for weeks or even years. As Eysenck 
(1960) has pointed out, therapies based on 
pairing aversive reinforcement with the per- 
formance of the undesirable behavior are 
likely to be ineffective in the long run, be- 
cause the fear of the negative consequences 
will tend to be extinguished when the behav- 
ior is performed without the negative rein- 
forcement following. In addition, there are two 
other variables which make overeating be- 
havior particularly resistant to alteration. 
Both Ferster et al. (1962) and Goldiamond 
(1965) have pointed out that eating behavior 
occurs in a very wide range of situations and 
is under the control of many stimuli other 
than those physiological ones causing hunger. 
A second factor, one unique to overeating 
among the addictions, is that one cannot be 
delivered from temptation. Drugs, cigarettes, 
sexual perversions, and alcohol can all be 
given up completely and the stimuli associ- 
ated with them can be avoided; however, 
everyone must eat at least two or three times 
a day, and it is impossible to avoid exposure 
to situations and performance of behaviors 
one wishes to avoid. Thus the control of eat- 
ing shares elements with the control of other 
undesirable appetitive behaviors but is further 
complicated by the facts that the stimuli for 
eating are ubiquitous and that some eating 
behaviors must be performed several times a 
day. 

The study to be discussed attempted to use 
some of the specific procedures and methods 
of analysis which have been found useful in 
controlling both overeating and other undesir- 
able behaviors. Many of these techniques, 
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particularly those of Ferster et al. (1962) 
and Goldiamond (1965) place a very strong 
emphasis on the development of self-control 
through altering the stimulus conditions un- 
der which the behavior occurs and generating 
self-produced consequences for the behavior, 
A second major influence was the use of 
aversive conditioning, both operant (Meyer 
& Crisp, 1964) and classical (Stollak, 1966; 
Wolpe, 1958), particularly the nausea condi- 
tioning of Cautela (1966). For the present 
study, nausea has several advantages over 
electric shock. It is completely under the indi- 
vidual’s control, can be used in any situation, 
and is a response directly antagonistic to eat- 
ing. In addition, a group setting was used in 
this research, primarily because of conveni- 
ence, but also to take advantage of any slight 
increase in motivation which group discussion 
and support might generate. 

Because long-term control of eating has 
been so refractory to change, it was felt that 
there was little to be gained by employing an 
attention-placebo control group in addition to 
a no-treatment control group. For the same 
reason it was decided to study the efficacy of 
a treatment approach combining several dif- 
ferent components rather than comparing dif- 
ferent techniques. The study was thus de- 
signed as a broad program to produce change 
by use of all the procedures which might as- 
sist an individual in attaining control of his 
own eating behavior by gradually approxi- 
mating the eating pattern he eventually 
wanted to maintain. Drugs, lists of forbidden 
foods, or rigid rules of only three meals per 
day were not used, as they were felt to be 
useless in the maintenance of permanent hab- 
its. Although the procedures used were often 
geared to the individual’s specific problems 
with eating, the type of approach should 
prove amenable to extension to dealing with 
other types of addictive behaviors. 


METHOD 


Subjects. The Ss for this study were both men a 
women who had answered an advertisement In asl 
Stanford University newspaper, who were at le 
15 Ib. overweight, and who were not being me Two 
treated for overweight or any serious illness. the 
individuals who spontaneously began to ded 
psychiatric problems in a 10-min. interview Wit op 
were not included in the study. From the pom 
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satisfactory Ss, two sections of three men and five 
women each who had the same available time for 
meetings were selected, along with a random group 
of three men and five women from the remaining 
potential Ss to serve as a control group. 

Control group. The members of the control group 
were truthfully told that the experimental groups 
would not be meeting during their available hours. 
They were then asked if they would attempt to lose 
weight on their own and if they would consent to 
be reweighed at the conclusion of the experiment. 
All Ss agreed. They were then weighed without shoes 
on a commercial bathroom scale, to the nearest half 
pound, given a calorie chart, and reminded that the 
only way to lose weight was to change their eating 
habits permanently. 

Treatment procedure. The two experimental sec- 
tions of eight Ss each met with E twice a week for 
approximately 2 mo. until the end of the school 
quarter. At the beginning of every meeting Ss were 
weighed individually without shoes on a commercial 
bathroom scale to the nearest half pound. The Ss 
were all given calorie charts and asked to keep a 
record of their normal eating habits for 1 wk. and 
of their daily intake, including place and time, for 
the next several months. At the first meeting and 
approximately every 3 wk. thereafter Ss completed 
a brief questionnaire consisting of eight rating scales. 
Four of the scales concerned their levels of tension 
and depression both generally and with specific 
reference to eating; three scales concerned tempta- 
tions to eat rich foods, large meals, and snacks; and 
one concerned time thinking about food. One purpose 
of this questionnaire was to ascertain whether any 
mood changes would accompany or result from the 
process of weight reduction, as is suggested by such 
theories as “obesity as a depressive equivalent” 
(Simon, 1963). Another purpose of the rating scale 
was to see whether changes in the temptation to eat 
would accompany changes in eating behavior. 

Three very general aspects of the program were 
discussed thoroughly with Ss. It was stressed that 
the program was designed to enable each individual 
to develop permanent eating habits he could main- 
tain indefinitely. For this reason no arbitrary re- 
strictions were placed on diet, and the individual 
was urged to consider what type of permanent eating 
pattern he would like to maintain. Along with the 
emphasis on establishing permanent habits there was 
an emphasis on awareness of what one was eating, 
especially caloric content and food value, and of the 
reasons for eating and the situation in which the 
eating occurred. A third general aspect of the treat- 
ment program was its emphasis on making only 
gradual changes. It was felt that making drastic 
changes, although perhaps producing greater im- 
mediate loss, would make it too difficult to maintain 
the changes indefinitely. The Ss were urged to make 
only small changes which they were sure they could 
maintain and which were in the direction of the 
permanent eating pattern they desired to establish. 

Techniques. Three general types of techniques for 
approaching their goal of altered eating habits were 


recommended to Ss during the first 2 mo. of the 
program. First, Ss were told about the concepts of 
positive and negative reinforcement, and it was 
explained to them how eating behavior was con- 
trolled by immediate positive reinforcement and only 
very delayed negative reinforcement. All Ss were 
asked to consider the positive outcomes maintaining 
their eating and to hand in a list of the aversive 
consequences causing their desire to lose weight. In 
group discussions suggestions were made as to 
methods useful for making reward for refraining 
from eating more immediate and for making aver- 
sive consequences come immediately after begin- 
ning to eat; among the suggestions were the reciting 
of the list of reasons for losing weight when tempted 
to overeat, the viewing of an unattractive picture of 
oneself in a bathing suit when tempted, self-re- 
wards of money, movies, clothes, etc. for every day 
or week of good eating habits or for every pound 
lost. It was emphasized that food should not be used 
as a reward. 

The second group of techniques concerned the 
stimulus control of eating behavior and attempted 
to limit the situations which were associated with 
eating. The Ss were asked to analyze the stimuli 
governing their eating behavior and were given 
suggestions about limiting the number of situations 
in which they eat and of stimuli connected with 
eating. Among these suggestions were many concern- 
ing limitations of place and time of eating—for ex- 
ample, try to eat sitting in only one chair at the 
same table or in as few places as possible; never eat 
while standing or in the office, bedroom, living room, 
etc.; try to eat meals and planned snacks only at 
certain definite times during the day. Other sug- 
gestions concerned the limitation of the availability 
of other reinforcements for eating and of tempting 
foods—for example, don’t eat while reading, watch- 
ing television, studying, etc.; try to avoid walking 
past candy machines, snack bars, etc. The Ss were 
also informed of some Schachter’s research (Schach- 
ter, 1967) demonstrating the control of eating be- 
havior by external rather than internal stimuli in 
overweight people. 

The third general group of suggestions dealt with 
the actual behaviors involved in eating. It was 
pointed out that many overweight people, particu- 
larly people on diets, eat much more rapidly than 
people of normal weight, and that it takes at least 
15 min. after beginning to eat before one begins to 
feel the effects of the food. It was thus suggested 
that Ss try to break up the chain of eating by not 
putting any more food on the fork until they had 
finished chewing and swallowing the last bite, that 
they chew their food very slowly and with great 
attention to taste and texture like a gourmet, that 
they practice taking short breaks of not eating 
during a meal, and that they begin to try out all of 
these suggestions only near the end of a meal when 
they are not so hungry that these behaviors would 
become aversive. It was also recommended that 
they attempt to practice leaving a little bit of every 
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food on their plate, rather than automatically eating 
it all. 

Throughout the presentation and discussion of all 
these techniques for controlling eating habits, it was 
emphasized that these were all methods for self- 
control of these behaviors, Although each technique 
should prove to be useful in some cases, it was 
recognized that certain techniques would be differen- 
tially useful for certain individuals; for this reason, 
each S was asked to try out all techniques which he 
felt might be appropriate for regulating his own be- 
haviors and to try to commit himself to one small 
change at a time in the direction of his permanent 
eating habits. After the above meetings were com- 
pleted, so that Ss had learned principles and tech- 
niques of reinforcement and self-control of eating 
behaviors, they were given a paper summarizing 
these principles and techniques. A list of low calorie 
foods and beverages suggested for cooking and eating 
was also given to them. 

Two additional aspects of the treatment procedure 
were the discussion of nutrition and training in 
relaxation. A nutritionist spoke to Ss about diet in 
general, answered many questions about the nutri- 
tional and caloric values of various foods, and made 
brief verbal and written comments to Ss about the 
nutritional adequacy of their individual eating rec- 
ords. The Ss were also given training in relaxation 
exercises by listening to a tape and reading a manual 
designed by E to accompany the tape. The relaxa- 
tion exercises were presented as an alternative to 
eating in dealing with tension, depression, and bore- 
‘dom. 

Differential treatment of subgroups. After a period 
of approximately 23 mo. and following a change of 
schedule for those Ss who were university students, 
each of the two earlier experimental sections was 
subdivided into an aversive conditioning subgroup 
and a continuation subgroup. As two participants 
had withdrawn from the program, only the remain- 
ing 14 Ss were subdivided. The division was made 
into two subgroups such that each had 4 from one 
original section and 3 from the other, each had 
three males and four females, three good friends 
were put in one subgroup, each had about the same 
number of those who showed regular attendance in 
earlier meetings, and each had approximately the 
same mean weight loss for men and for women up to 
that time. A random selection was then made as to 
which of the two subgroups would be designated 
the continuation subgroup and which the aversive 
counterconditioning subgroup. Two hours per week 
were chosen at times when the continuation Ss were 
free, and these Ss were told that they could attend 
group meetings at one or both of these times. At 
these meetings, discussion centered around current 
problems Ss were having with eating and included 
suggestions and support from the other group mem- 
bers. The aversive counterconditioning Ss, on the 
other hand, did not have further group meetings, 
although a few of these Ss attended one or two of 
the continuation subgroup meetings. Instead, the 
aversive counterconditioning was carried out in indi- 
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vidual meetings with E, which were held approxi- 
mately weekly. 

$ Aversive counterconditioning. The procedures used 
in conditioning were based on a report by Cautela 
(1966). The Ss who were not able to relax suffi- 
ciently after the group instructions in relaxation 
were given further individual instruction until they 
were able to experience a moderate degree of relaxa- 
tion. Each E was then given a thorough explanation 
of and rationale for aversive counterconditioning in- 
cluding the fact that the response was completely 
under his control and that the nausea conditioning 
would only be applied to certain foods in certain 
situations in which he was sure he did not want to 
eat. As work with two Ss in a preliminary study 
had demonstrated that nausea conditioning could 
also be effectively applied to reduce the positive 
valence of such behaviors as eating all the food on 
ones plate or eating more than one cookie, these 
were also mentioned as possibilities. Each S then 
chose a specific food in a specific situation where he 
desired to refrain from eating and was asked to 
describe the situation to E in great detail. The S 
was also asked to recall the last time he felt nauseous 
and vomited and to describe to E the sensations 
associated with nausea. Surprisingly, these varied a 
great deal among Ss. 

During the actual nausea conditioning session, S 
sat comíortably in a chair with eyes closed and 
tried to visualize and imagine as vividly as possible 
the scene E described to him. The E then described 
in detail the situation S had chosen and described 
S's eating the food, feeling more and more nauseous 
and sick with every bite, and then vomiting. On 
alternate trials, S was described as about to eat the 
food, realizing how he didn't really want it, deciding 
not to eat the food, and then relaxing and feeling 
very happy. All but one S reported that their 
imagery and feelings of nausea were extremely 
vivid. One S (who had lost 20 Ib.) reported at the 
next meeting that he did not want to undergo 
nausea conditioning again, because he had found it 
unpleasant. All the other Ss reported that they felt 
the experience was very effective. Usually two aver- 
sive and two relief trials were given in each session. 
Some Ss chose different situations for each aversive 
counterconditioning session; some practiced aversive 
counterconditioning at home; some did neither OF 
both. Due to the approaching end of the academic 
year and because some Ss required lengthier instruc- 
tion in relaxation exercises than others and some 
were unable to meet with Æ each week, each S had 
only one to three sessions of aversive countercondi- 
tioning. 

Final weighing. A final meeting was held app’ 
mately 4 mo. after the study began in order to get 
a final weight for all experimental and control 5s 
All Ss who could not attend the meeting were oat 
tacted and final weights were obtained for a uw 
the two Ss who had previously stopped participating 
and one control S who repeatedly broke appoint 
ments and finally could not be located. At the time 
of the final weighing, the experimental program wae 
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discussed with control Ss, and they were given copies 
of the paper on suggestions for weight reduction, 
the list of low calorie foods, and the manual of 
relaxation exercises. 


RESULTS 


The weights for individual experimental 
and control Ss are reported in Table 1. All 
14 Ss who remained in the study as well as 
the 2 who dropped out lost weight, with the 
difference between pretest and _ posttest 
weights being highly significant (£ = 5.19, p 
< .001). These losses are represented graphi- 
cally in Figure 1. 

Tables 2 and 3 show the results of analyses 
of variance for pounds lost and percentage 
weight loss, respectively, for all experimental 
Ss versus the controls. The effect of the ex- 
perimental treatment is highly significant us- 


ing both measures (p < .001), and the effect 
of sex is significant at the p < .05 level for 
pounds lost and at the .05 < p < .10 level for 
percentage weight loss. Clearly, the sample is 
too small to allow definite conclusions to be 
reached about whether men are likely to lose 
more weight than women independent of origi- 
nal weight. A ż test on the difference between 
the additional weight losses of the aversive 
conditioning and continuation Ss during the 
last month and a half, when aversive condi- 
tioning Ss were receiving the extra treatment, 
revealed no significant differences (t = .201) 
between the two experimental subgroups. 
The questionnaire data generally did not 
show large changes in the mood indexes over 
time. Only the data from those 12 Ss who had 
taken the questionnaire both in late January 


TABLE 1 
WEIGHT LOSSES FOR INDIVIDUAL EXPERIMENTAL AND CONTROL SUBJECTS 
Original weight | Loss at 24mo. | Final weight | Total change | Percentage of 
dress counterconditioning 
su ‘ou 
Male kon 196.5 =15.5 176.0 —20.5 —10 
Male 197.0 —20.5 171.0 —26.0 -13 
Male 198.0 3.5 194.5 —3.5 -2 
Female 135.0 =3.5 124.5 —10.5 —8 
Female 143.0 —2.5 137.5 —5.5 —4 
Female 159.0 —6.0 152.0 =—7.0 —4 
Female 135.0 —5.0 130.0 —5.0 —4 
Mean for males 197.2 —13.2 180.5 —16.7 —8 
Mean for females 143.0 —43 136.0 —1.0 m 
ean for subgroup 166.2 —84 155.1 —11.1 - 
ontinuati 
Mae E 190.0 —80 1750 -145 -8 
Male 218.5 —11.5 208.0 —10.5 —5 
Male 214.0 —14.0 193.0 —21.0 —10 
Female 155.5 =7.5 146.5 —9.0 ave 
Female 143.0 +.5 137.0 pee a 
Female 152.0 —7.0 145.5 —6. i 
Female 129.5 —=2.5 127.5 KA E 
Mean for males 207.5 11.2 192.2 -153 =} 
ean for females 145.0 —44 n ES Ss 
ean for subgroup 171.7 —69 161. ire p 
ean for treatment males 202.4 —12.2 mis TE ze 
ean for treatment females 144.0 —4.3 2.5 —105 By 
ean for treatment group 169.0 —7.5 158. s 
Control Ss —15 i 
Male 181.0 179.5 . Tr 
Male 178.5 166.5 —12.0 
Male 238.0 254.5 16.5 7 
Female 149.0 144.0 78 3 
Female 147.0 159.0 Tn : 
Female 143.0 155.0 20 à 
Female 146.5 Is 10 n 
Mean for control males 199.2 820 o 4 
ean for control females moa 1726 36 2 


ean for control group 
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Fic. 1. Mean weights for experimental and 
control Ss. 


and in mid April were analyzed, as fewer Ss 
completed questionnaires in May and June. 
General level of tension or depression as well 
as tension or depression specifically related to 
eating showed only nonsignificant decreases 
from beginning to end of the experiment and 
no relationship to amount of weight loss. Both 
the temptation to eat large meals (¢ = 2.34, 
p < .05) and a general index of temptation 
to overeat (t = 2.22, p < .05) showed signifi- 
cant decreases over the time course of the 
experiment, but the correlation between per- 
centage weight loss and decrease in temptation 
was not significant (r= .186). The study 
thus provides no evidence that dieting neces- 
sarily causes large mood changes, although 
there is some evidence that this program can 


TABLE 2 


SUMMARY OF ANALYsIS OF VARIANCE 
FoR Pounps Lost 


Source SS df MS F 
Sex 322.32 1 322.32 5.51* 
Treatment 938.15 1 938.15 | 16.04** 
Interaction 27.86 1 27.86 | «1 
Within 994.41 17 58.49 

Total 2282.74 20 
* p « .05. 
a3 < .001. 
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TABLE 3 


Summary OF ANALYSIS OF VARIANCE 
FOR PERCENTAGE WEIGHT Loss 


Source SS df MS F 
Sex 67.06 1 67.06 4.03* 
Treatment 304.02 1 304.02 18.25 
Interaction 58 1 58 | <1 
Within 283.29 17 16.66 

Total 654.95 20 

*.10 .05, 

Drr hd 


reduce the temptation to overeat over a 3-mo. 
period. 


Discussion 


The minimal requirement for a weight 
reduction routine is that it enable people to 
lose weight and keep it off for a moderate 
amount of time without causing serious dis- 
ruption of the individual’s daily life or health. 
More desirable attributes would be the pro- 
duction of permanent weight loss with mini- 
mal expense of time or money and with the 
new eating pattern becoming both habitual 
and satisfying, so that the individual is not 
constantly troubled by desires to eat. It is 
clear that the program described in this study 
does satisfy the minimal requirements. All Ss 
showed a decrease in weight, a loss which was 
significantly greater than zero and than the 
slight gain shown by control Ss. Expense in 
money was nil and time spent attending meet- 
ings varied from about 10 to 20 hr. 

Several of the less essential but desirable 
features are also found in this program. All 
Ss expressed satisfaction with the program 
and stated that they had found it extremely 
helpful. The low dropout rate of 12.5% com- 
pares very favorably with those reported in 
the literature, such as 39% not returning 
after one visit (Stunkard, 1959c), 33% not 
answering questionnaires after 6 mo. (Fran ; 
lin & Rynearson, 1960), 66% dropping OU 
within 1 yr. (Silverstone & Solomon, 1908) 
27% after one meeting (Shipman & e 
1963). Of the Ss who received aversive cont 
tioning, only one was not enthusiastic a 
the technique. This was the only instance A 
any S objecting to any aspect of the Pa 
cedure; part of the success of the prog" 
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may have been that Ss were free to choose the 
techniques and eating patterns which suited 
them best rather than being forced to con- 
form to rigid rules. 

The questionnaire data show no significant 
mood changes as a result of the program, al- 
though the changes are in the direction of 
greater happiness and relaxation. The only 
changes which were significant were those in- 
dicating a lesser desire to overeat at meals and 
a decrease in general temptation to overeat at 
meals, eat sweets, and eat between meals. It 
seems likely that 3 mo. is not a sufficient 
time for large changes in affective values of 
eating to occur. It is also possible that ques- 
tions dealing with eating behaviors rather 
than attitudes would reveal more significant 
changes. 

The data appear to indicate that the pro- 
gram may be more effective for males than 
females, a difference also reported in a study 
by Stunkard (1959c). Although the possibil- 
ity of physiological differences is not ruled 
out, the variables of awareness and motiva- 
tion probably contribute to the sex differences 
found. In general, the men participating in 
this study were less aware of the caloric and 
nutritional values of various foods and of 
what their own eating habits were than were 
the women; moreover it is possible that the 
commitment to a group weight-reduction pro- 
Bram represents a greater degree of motiva- 
tion for men than for women, who often have 
4 long history of going on diets with "the 
girls.” A larger study will be necessary to 
discover the extent and causes of these sex 
differences, which were not quite significant 
When percentage weight lost rather than 
pounds lost was used as the measure. 

The most crucial test of a weight reduction 
Program or indeed of any therapy program is 
the permanence of its changes. Although long- 
term data are very difficult to obtain on 
members of a fluctuating university popula- 
tion, some data are available on Ss who par- 
ticipated in a preliminary study of many of 
the techniques used in this final program. 
"Twelve female Ss originally began the pilot 
study; two dropped out very early, one for 
Medical reasons and pregnancy. Eight of the 
Ss met with E in two groups; two of the Ss 
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met individually with E and received aversive 
conditioning. All Ss learned about stimulus 
control procedures. Each S kept daily eating 
records before and during the period of the 
experiment. Most of the Ss met regularly with 
E for only 2 to 3 mo., although follow-up 
data were gathered on all available Ss about 
6-7 mo. after the start of the study. All but 
one S lost weight after 2-3 mo. with the 
mean short-term and also long-term loss for 
all pilot Ss being 6 Ib. Thus it does appear, 
for this preliminary study, that some of the 
effects of this program do last for at least 6 
or 7 mo. Other long-term data are provided 
by E, who lost about 27 Ib., regained about 
12 Ib. and has lost about 7 of those, 12 mo. 
after the beginning of the original study. 
However, in view of the extremely poor long- 
term results of other weight reduction pro- 
grams, it is clear that no conclusion can be 
drawn about the permanence of these changes. 

It is possible that several variables may 
have contributed to the success of this par- 
ticular study besides the planned experimental 
procedures. The modeling effect of E, who 
went from fat to moderate with the pretest Ss 
and from moderate to thin with Ss in the final 
study, was commented upon by many of the 
Ss. Many of the Ss stated that participating 
in a research project where data they pro- 
vided were important increased their motiva- 
tion. The majority of these Ss, moreover, were 
college educated, which may have made it 
easier for them to assimilate such concepts as 
reinforcement, stimulus control, and caloric 
values. 

It is also possible that certain of the varia- 
bles in this program, such as the group inter- 
action, might make no additional contribu- 
tion beyond that of the other procedures. It is 
possible that certain procedures, such as eat- 
ing only when sitting down at the table, might 
be effective for all Ss whereas others, such as 
rewarding oneself monetarily for every day of 
good eating habits, might be of no use to any 
S. A much more controlled study in which 
various techniques and combinations of pro- 
cedures are isolated would be necessary to 
discover their differential effects. In this study 
no additional effect of a very few sessions of 
aversive conditioning was found, although 
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most Ss reported that it had reduced or elimi- 
nated their desires for those foods and the 
two pretest Ss given aversive conditioning 
continued to lose weight at least over a pe- 
riod of 6 mo. It is conceivable that aversive 
conditioning, both classical and operant, will 
turn out to be a useful tool for Ss who do 
have cravings for certain foods or for eating 
everything set before them, although it may 
well be useless for Ss whose problem is lack 
of awareness of their eating habits and caloric 
values. One might expect the effects of aver- 
sive counterconditioning used as an adjunct 
to such a program as the one described in 
this study to be much more enduring than the 
effects of aversive counterconditioning ses- 
sions alone. In short, the study was designed 
to see if any program for long-term weight 
reduction through change of eating habits 
could be achieved; future studies will be 
necessary to discover the contributions of 
specific variables, and the permanence of the 
weight losses achieved. 
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ILLUSORY CORRELATION AS AN OBSTACLE TO THE 


USE OF VALID PSYCHODIAGNOSTIC SIGNS? 


LOREN J. CHAPMAN 4x» JEAN P. CHAPMAN 


University of Wisconsin 


Practicing psychodiagnosticians (W = 32), when surveyed, failed to report ob- 
serving Wheeler-Rorschach Signs 7 and 8 as accompanying male homosex- 
uality although research evidence indicates that these are valid signs. They 
instead reported observing Wheeler Signs 4, 5, 16, 19, and 20, which research 
literature indicates are invalid. These popular invalid signs were found to have 
much stronger rated, verbal associative connections to male homosexuality 
than the unpopular valid signs. Six hundred and ninety-three undergraduates 
(divided among 13 conditions) viewed 30 Rorschach cards on each of which 
was arbitrarily designated a patient's response and his two symptoms. The Ss 
"rediscovered" the same invalid Rorschach content signs of homosexuality as 
the clinicians reported observing in their clinical practice, although these rela- 
tionships were absent in the experimental materials. They did so regardless of 
the degree to which the clinically valid signs were valid in the contrived task 


materials. 


The psychodiagnostician who uses psycho- 
logical tests relies usually on his clinical ex- 
perience for interpreting test responses. He 
accumulates observations of the different re- 
Sponses that occur as correlates of various 
personality characteristics of patients. He later 
uses this information to infer, at least tenta- 
tively, the presence of similar characteristics 
in patients giving similar responses. To 
do this, the diagnostician must assume, of 
course, that he is able to observe and remem- 
ber which characteristics of test performance 
Occur as correlates of each characteristic of 
personality. The inference of characteristics of 
personality from the occurrence of isolated 
characteristics of performance, instead of in- 
terpretation in terms of more complex pat- 
lerns of test performance, is often termed the 
“sign” approach. This frequently deprecated, 
yet widely used, approach to projective test 
Interpretation is the focus of the present 
study, 

A large body of research literature demon- 
Strates that many diagnostic test signs lack 


1This study was supported by Research Grant 
MH-07987 from the National Institute of Mental 
Health, United States Public Health Service. The 
authors are indebted to Fred Cosentino for his as- 
sistance in the gathering and analysis of the data, and 
to Richard McFall for his critique of the manuscript. 

? Requests for reprints should be directed to Loren 
; Chapman, Department of Psychology, University 
of Wisconsin, Madison, Wisconsin. 
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the psychological meanings that clinical ob- 
servers have claimed for them. It is not sur- 
prising, then, that other studies have demon- 
strated that psychodiagnosticians are usually 
less able, than they believe, to make valid 
statements about patients on the basis of 
tests. Little and Shneidman (1959) made this 
point very clearly. In an especially well-con- 
trolled study, they found that eminent clini- 
cians performed only slightly above chance. 

The enormous discrepancy between the re- 
ports of clinical observers and the research 
evidence concerning the meanings of test re- 
sponses has long been a puzzle. The conflict 
of evidence has been especially disquieting 
because clinicians usually show substantial 
consensus as to specific meanings of various 
test responses, although these same meanings 
may have been discredited by research evi- 
dence. 

The present writers (Chapman & Chapman, 
1967) have recently suggested a possible 
resolution of this enigma by demonstrating a 
source of massive systematic error in ob- 
servations of correlations between symptom 
statements and features of projective test 
protocols. The projective test studied was the 
Draw-a-Person Test, which has been largely 
discredited as a measure of personality by the 
research evidence (Swensen, 1957). Naive 
undergraduates viewed a series of 45 Draw-a- 
Person Test drawings paired with contrived 
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TABLE 1 
Criticat RATIO (z) VALUES FOR WHEELER SIGNS THAT DISTINGUISHED 
MALE HowosEXUALs FROM HETEROSEXUALS IN THREE STUDIES 
s value 
Study 
Sign 7 Sign 8 Sign 10 Sign 17 Sign 19 Sign 20 
Wheeler 1.64 1.79 
Davids, Joelson, & McArthur 1.70 247 1. 1.80 
Hooker 1.94 1.90 


symptom statements about the alleged pa- 
tients who drew them. In these contrived 
materials there was no correlation between 
the occurrence of any symptom statement 
and any drawing characteristic. The under- 
graduate Ss “rediscovered” the same correla- 
tions between drawing characteristics and 
symptoms that a group of clinicians most 
often reported observing in their clinical prac- 
tice. These relationships are called “illusory 
correlations” because the naive Ss reported 
observing them, although they were actually 
absent in the experimental materials. More- 
over, the popularity of the various illusory 
correlations corresponded to strength of rated, 
verbal associative connection between symp- 
tom and drawing characteristic. These find- 
ings suggest that the popular meanings of 
many test signs, as reported by clinicians, are 
illusory correlations based on verbal associa- 
tive connection of the test sign to the symp- 
tom, rather than on valid observations. 

Not all tests, however, are as completely 
lacking in validity as the Draw-a-Person 
Test. Performance on most psychodiagnostic 
tests has some correlation with personality 
characteristics. In light of the research evi- 
dence indicating that clinicians are only 
moderately successful in the interpretation of 
psychodiagnostic test performance, one must 
wonder whether clinicians’ observations of 
valid signs, when such signs are present, is 
impeded by their proclivity to observe, in- 
stead, associatively based illusory correla- 
tions. This question is the central interest of 
the present paper. 

To investigate this issue, it was first neces- 
sary to choose a symptom and a test for 
which both valid and invalid signs have been 
reported by clinical observers. Male homo- 
sexuality and Rorschach content analysis were 


chosen because they appear to fulfill these 
criteria. Wheeler (1949) offered 20 Rorschach 
signs of male homosexuality. Clinicians com- 
monly report several of these signs as sub- 
stantiated by their own clinical experience, 
but research evidence strongly supports only 
two of the signs. Three studies by different 
investigators (Davids, Joelson, & McArthur, 
1956; Hooker, 1958; Wheeler, 1949) have 
reported statistically interpretable evidence on 
the validity of all 20 Wheeler-Rorschach con- 
tent signs, and these studies show some 
agreement. Table 1 presents the z values for 
those Wheeler signs that distinguished homo- 
sexual and heterosexual groups in one or more 
of the three studies at the .05 level using à 
one-tailed test. For Wheeler’s study, the 2 — 
values were computed by the present writers 
from Wheeler’s published data, using the | 
formula for the critical ratio between uncor- 
related proportions. Hooker obtained her 
values using the comparable formula for cor 
related proportions, after matching pairs of 
Ss by total number of responses. Davids et al. 
did not indicate which formula they used. 

As seen in Table 1, Wheeler Signs 7 and 8 
were both found to distinguish homosexual 
from heterosexual groups in two of the three 
studies. Wheeler Sign 7 is a response on Car 
IV of “human or animal—contorted, mon 
strous, or threatening,” and Wheeler Sign 
is a response on Card V, W or Center D, 
a “human, or humanized animal.” Wheelers 
example of “humans” is “woman dressed 3$ 
a bat” which indicates that the humans a 
animalized humans. Signs 10, 17, 19, and i 
were each found to distinguish the gronini d 
one study but were tested and not found va a 
in the other two studies. A finding by b 
alone of a significant difference for one 3 pe 
out of 20 is not unexpected. Therefore, 


ia tS a) es s—‘“‘—~=CS 


J 
2 


ILLUSORY CORRELATION AND PsYCHODIAGNOSIS 


purposes of this study, Signs 7 and 8 were 
considered the only clinically valid signs. 
This conclusion tends to be supported by 
Reitzell (1949) who reported Signs 7, 8, and 
16 as the most discriminating of the 20 
Wheeler signs. Unfortunately, her data can- 
not be analyzed statistically because she re- 
ported them in terms of number of responses 
of each type given by homosexual and non- 
homosexual groups, rather than the number of 
Ss in each group who gave each sign. 

The hypotheses of this study are: 


1. The “popularity” of signs among prac- 
ticing clinicians has little relationship to the 
objective clinical validity of the signs, as in- 
dicated by research evidence. 

2. The most popular signs among practic- 
ing clinicians are the ones that have the 
Strongest verbal associative connection to 
male homosexuality. 

3. Naive observers, when presented with 
contrived Rorschach responses arbitrarily 
paired with statements of symptoms of the 
patient who gave each response, erroneously 
report observing that these same associatively 
based invalid signs occur as correlates of 
homosexuality. 

4. The naive observers report these associa- 
tively based illusory correlations even when 
the materials are contrived so that other 
(clinically) valid correlations are present. 


Part I 


Experiment 1: Clinical Observations by 
Practicing Psychodiagnosticians 


It was first necessary to learn the kinds of 
Rorschach content that psychodiagnosticians 
believe they have observed most often in the 
Protocols of male homosexuals. A question- 
naire was prepared for circulation among 
practicing clinicians. The questionnaire was 
anonymous, but it asked the clinician to list 
his academic degrees and the years he ob- 
fained them, as well as the number of years 
of his psychodiagnostic experience. 


Instructions 


Although the more traditional clinical use of the 
Orschach is the interpretation of Rorschach de- 
terminants, many practicing clinicians in recent years 
ave observed that the content of Rorschach re- 
Sponses is also related to the patient's emotional 
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problems. Some of these observed relationships have 
been discussed in published reports, while others have 
not. We wish to ask about your observations con- 
cerning the content of Rorschach responses by two 
kinds of patients. 

1. Have you seen the Rorschach protocols of a 
number of men who have problems concerning 
homosexual impulses, either overt or covert? 

2. What kinds of Rorschach content have you 
observed to be prominent in the Rorschach protocols 
of men with problems concerning homosexual im- 
pulses? 

3. If possible, would you list some examples of the 
kinds of responses made by such men. 

Altogether 76 copies of this questionnaire were sent 
to clinicians at internship training centers, academic 
psychology departments, and other leading clinical 
installations. Each of 11 recipients received several 
copies with the request that he circulate them to his 
colleagues. 


Results 


Questionnaires were returned by 42 clini- 
cians 32 of whom said that they had seen the 
Rorschach protocols of a number of men with 
homosexual problems and were willing to list 
the kinds of Rorschach content they had ob- 
served. Of these 32 clinicians, 22 reported 
having the PhD degree and 10 the master’s 
degree. They reported 2-29-yr. psychodiag- 
nostic experience, with a mean of 9.1. 

The clinicians’ most frequent responses are 
listed in Table 2. Only two clinicians listed 
one of the two valid Wheeler signs. Both of 
these were Wheeler Sign 7. Most of the 
clinicians listed one or more of five invalid 
Wheeler signs. Two of these five signs have, 


TABLE 2 


Tue Five WHEELER-RORsCHACH CONTENT SIGNS 
or Mate HowosEXUALITY Most 
FREQUENTLY REPORTED BY THE 


CLINICIANS 
Percentage of 
clinicians re- 
Wheeler sign porting the 
sign 
16 Human or animal anal content 44 
20 Feminine clothing 38 
4 Humans with sex confused 288 
5 Humans with sex uncertain 16" 
19 Male or female genitalia 38 


a The percentages listéd here are for the clinicians’ reports of 
this RUE regardiees of the card on which they observed it. 
Wheeler limited Signs 4 and 5 to Card III, but few of the 
clinicians specified a card. Wheeler described sexual confusion 
on Card I as Sign 2, but did not describe sexual uncertainty on 
that card despite the conceptual similarity of these two classes 


of percepts. 
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as previously discussed, each received support 
in only one of three studies, and the other 
three were supported by none of them. 

In order to test the hypothesis that the 
popularity of the invalid signs of homosexual- 
ity is based on high-strength, verbal associa- 
tive connection with homosexuality, ratings of 
strength of associative connection were ob- 
tained. The following rating item, used for 
“anality,” illustrates the format that was used. 


The tendency for “homosexuality” to call to mind 
“rectum” and “buttocks” is 

a, Very strong. 

b. Strong. 

c. Moderate. 

d. Slight. 

e. Very slight. 

f. No tendency at all. 


Rating items were used for both of the two 
unpopular valid signs as well as for the five 
popular invalid signs that are listed in Table 
2. The wordings of the signs were in some 
cases rephrased, as in the example above, to 
facilitate understanding by the raters. These 
wordings are listed in Table 3. Because of 
their conceptual similarity, “sexual uncer- 
tainty” and “sexual confusion" were repre- 
sented in the ratings by a single item of “part 
man—part woman." In addition, there was 
one item each for “food” and “maps,” which 
were used as filler items in the studies that 
follow. 

The eight items concerned with homosex- 
uality were intermixed with 80 other items in 
the same format, most of which were not con- 
cerned with sexual material. The questionnaire 
was given to a group of 34 undergraduate 
students who did not participate in the other 
studies reported in this paper. 

The six associative ratings from a to f were 
assigned values from 6 to 1 and a mean was 
computed for each item, Table 3 reports the 
mean rated associative strength between 
homosexuality and each of the eight cate- 
gories. 

As seen in Table 3, the popular invalid 
signs have a much stronger associative con- 
nection to homosexuality than either the un- 
popular clinically valid signs or the two cate- 
gories of filler items. There is no overlap be- 
tween the mean rated strength of the popular 
invalid signs and that of the other signs and 
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TABLE 3 


MEAN RATED STRENGTH OF ASSOCIATIVE 
CONNECTION BETWEEN HOMOSEXUALITY 
AND CONTENT AREAS 


Content area 


strength. 
Popular invalid signs 1 
Rectum and buttocks 4.38 1 
Part man—part woman 3.58 ] 
Feminine clothing 342 
Sexual organs 447 
Unpopular valid signs i 
Part animal—part human 1.93 
Monsters 1.68 
Filler items 
Food 1.09 
Maps 1.09 


categories of filler items. The rated associa- 
tive strength to homosexuality of the lowest 
rated popular invalid sign (feminine clothing) ` 
was compared with that of the highest rated 
unpopular valid sign (part animal—part hu- 
man). The difference was significant as indi- - 
cated by a two-tailed ¢ test (£ = 3.52, P< 
.001). These data suggest that whether or not 
a clinician reports a given category of per- 
cepts as a correlate of homosexuality is de- 
termined primarily by the strength of its 
verbal associative connection to the symptom, 
rather than by objective reality. This in- 
ference is congruent with earlier findings 
(Chapman & Chapman, 1967) concerning the 
Draw-a-Person Test. 


Part II 


Laboratory studies were designed to de- 
termine whether naive observers, present 
with contrived statements of patients’ symp 
toms and their Rorschach responses, WO 
make the same errors of observation that 
clinicians appear to have made in their o! 
servational reports. The demonstration 1m i 
laboratory of these systematic errors of 0 
servation based on verbal associative e 
tion would lend strong additional sup, 
to the contention that the clinicians' es 
reflect illusory correlation based on ass 
tive connection. 


bject. 
Subjects sd undi 


The Ss in the laboratory studies were test ositiom 
13 conditions which, for convenience of en iments 
were divided into three experiments (Enn intro 
II, IIT, and IV). All Ss were students in 4! 
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ductory psychology course. The number of Ss in 
Conditions 1-13 were 60, 43, 53, 37, 39, 52, 49, 59, 
54, 61, 60, 60, and 66, respectively. No S served in 
more than one condition. All Ss were asked on a 
questionnaire if they had any familiarity or experi- 
ence with principles of Rorschach interpretation. The 
occasional S who indicated any such familiarity was 
not included in the sample. 


Experiment II (Conditions 1-5): Illusory 
Correlation in the Absence of Valid 
Relationships 


Experiment II was designed to determine 
if the invalid signs that were found to be 
popular with the clinicians would also be re- 
ported by naive observers when no valid rela- 
tionship is present between any category of 
percepts and the symptom of male homo- 
sexuality. This study contained five condi- 
tions, each of which was designed to explain, 
on the basis of illusory correlation, the popu- 
larity of the report of one of the five popular 
invalid signs listed in Table 2. 


Method 


Clinical materials were fabricated to be shown to 
naive observers. The materials consisted of 30 Ror- 
schach cards, on each of which one percept (or 
response) was paired with two statements of the 
emotional problems of the purported patient who 
was alleged to have given the response. The cards 
were covered with transparent plastic. Rorschach 
Percepts were indicated by circling an area of the 
card and pasting on it a typed statement of the 
verbalization. For example, for one of the 30 Ror- 
schach responses, the center area of Card V—or area 
D-7 (Beck, 1961)—was circled and labeled “Bugs 
Bunny.” In a corner of the card appeared the state- 
Ment: 


The man who said this 


1. has sexual feelings toward other men. 
2. feels sad and depressed much of the time. 


The 30 percepts were chosen so that six fell into each 
of five categories, which were: 

A. One popular invalid sign 

B. Wheeler Sign 7: Human or animal—con- 
torted, monstrous, or threatening on Card 
IV (a clinically valid sign) 

C. Wheeler Sign 8: Humanized animal or an 
animalized human on Card V (a clinically 
valid sign) 

D. Geographic features (a filler category) 

E. Food (a filler category) 


There were five conditions in Experiment II, 
Which differed only as to which invalid sign was 
Used. In all other respects, both the procedure and 
materials in all five conditions were identical. The 
invalid signs for the five conditions were: Condition 
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1—human or animal with anal content; Condition 2 
—feminine clothing; Condition 3—humans on Cards 
I or III with sex confused; Condition 4—humans on 
Cards I or III with sex uncertain; and Condition 5 
—male or female genitalia. 

The percepts used for each category are listed be- 
low. Locations of percepts are indicated using Beck's 
system. The percepts chosen for the seven Wheeler 
signs were Wheeler's (1949) examples, and others 
modeled after his examples. 

Wheeler Sign 7. Human or animal, contorted 
monstrous or threatening, All were on Card IV, W: 
(a) a horrid beast, (b) Frankenstein, (c) a headless 
monster about to step on me, (d) man looking back 
through legs, (e) a giant with shrunken arms, and 
(f) a deformed man doing a back bend. 

Wheeler Sign 8. Humanized animal or animalized 
human, on Card V. Four were W responses: (a) 
woman with butterfly wings, (b) an alligator in a 
fur coat, (c) man dressed like a bat, (d) a pigeon 
wearing mittens. Two were D-7: (e) a dog wearing 
clothes, and (f) Bugs Bunny. 

Human or animal anal content. (a) Card I, Dd-22, 
rectum, (b) Card VII (inverted), Dd-25, anus, (c) 
Card IV, W, anal opening of man bending over, 
(d) Card VI, W (without D-3), horse's rear end, 
(e) Card IX, both D-1, woman's buttocks, and (f) 
Card II, D-4, anal opening. 

Feminine clothing. (a) Card III, D-10, woman's 
high-heeled shoe, (b) Card II, D-2, woman's hat, 
(c) Card X, D-9, fur stole, (d) Card VIII, D-5, 
woman's laced corset, (e) Card III, D-7, woman's 
bra, and (f) Card VI, D-1 (inverted), woman's fur 
cape. 

Sexual confusion. 'Three of these percepts were on 
Card I, D-4: (a) it looks like a woman but with 
broad shoulders, (b) upper part is male and the 
bottom is female, (c) this has shoulders like a man 
but breasts and hips like a woman. Three were on 
Card III, D-9: (d) looks like a man below the 
waist but like a woman above, (e) a man but with 
breasts, (f) a woman, here are her breasts, but has 
the feet of a man. 

Sexual uncertainty. Three of these percepts were 
on Card I, D-4: (a) I guess this is a man or maybe 
it’s a woman, (6) a person can't tell if it's a man or 
a woman, (c) a human, could be masculine or 
feminine. Three were on Card III, D-9: (d) a per- 
son, might be male or it might be female, (e) could 
either be a man or a woman, and (f) two people, 
but I can't tell what sex they are. 

Male or female genitalia, (a) Card I, Dd-22, 
testicles, (b) Card II, D-3, female genital organ, 
(c) Card III, Dd-26, penis, (d) Card X, D-11, male 
genitalia, (e) Card VI, D-2, penis, and (f) Card 
III, D-8, vagina. 

Geographic features. (a) Card VII (inverted), 
D-2, map of North and South America, (b) Card 
VII, D-3, map of Spain, (c) Card X, D-9, map of 
California, (d) Card II (inverted), D-2, map of 
South America, (e) Card X, D-13, New Zealand, and 
(f) Card VI (inverted), Dd-25, map of California. 

Food. (a) Card VII, D-4, a loaf of rye bread 
split in half, (b) Card IX, D-6, raspberry sherbet, 
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(c) Card VIII, D-7, jello, (d) Card VI, D-6, aspara- 
gus tips, (e) Card X, D-2, scrambled eggs, and (f) 
Card III, D-1, bowl of fruit. 

The two statements of emotional problems or 
symptoms listed on the cards were drawn from a 
pool of four such statements. These were: 


1. He has sexual feelings towards other men. 

2. He believes other people are plotting against 
him. 

3. He feels sad and depressed much of the time. 

4. He has strong feelings of inferiority. 


The statements of symptoms and Rorschach per- 
cepts were paired on the 30 cards so that each of 
the four symptom statements appeared 15 times. 
Each symptom statement was paired with three of 
the six percepts from each of the five categories of 
percepts. Thus there was no relationship between 
the occurrence of any one of the four symptoms and 
any one of the five categories of response. 

The prediction was that despite this lack of true 
correlation in the experimental materials, Ss who 
viewed the series of 30 cards in each of the five 
conditions would believe that they observed that the 
symptom statement, ^He has sexual feelings toward 
other men," had appeared more often with the as- 
sociatively based invalid sign than with any of the 
other four categories of content. 

The testing was done in groups of 30 or fewer. 
The Ss were given some brief introductory informa- 
tion as to the nature of the Rorschach. They were 
told that Rorschach responses indicate personality 
functioning. However, they were given no informa- 
tion about categories of either content or determi- 
nants, They were then told: 


I am going to show you a series of inkblots, 
one at a time. On each inkblot you will find a 
typed statement of what one patient saw on this 
blot and also what his two chief emotional prob- 
lems are. Each of these 30 cards represents a 
different patient. You will see what 30 different 
patients said they saw on a card. Now let me tell 
you what I want you to do. Please carefully study 
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PERCENTAGE OF NAIVE OBSERVERS REPORTING EacH or Five CATEGORIES OF PERCEPT 
As Most OFTEN ACCOMPANYING THE PROBLEM OF HOMOSEXUALITY FOR EACH 
OF THE Five CONDITIONS IN EXPERIMENT II 
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tient said that he saw in it. Also study the state- 
ment of the patient's two severe emotional prob- 
lems. When everyone has looked at all of the 
cards, I’m going to give you a questionnaire in 
which I will ask you about the kinds of things seen 
by patients with each kind of problem. 


each inkblot and the statement of what the pa. | 


The cards were circulated in a prearranged pat- 
tern so that each S saw each of the 30 cards for 60 
sec. After S had seen the 30 cards, he was given a 
questionnaire which presented four items (one for 
each of the four symptom statements) in the follow- 
ing format. 


Some of the things in the inkblots were seen by 
men who have the following problem: 
He has sexual feelings toward other men. 


Did you notice any general kind of thing that was 
seen most often by men with this problem? 


Yes __ No __.. If your answer is yes, name that 
kind of thing, and give one example of that kind 

Kind of thing 

Example 
Two forms of the questionnaire were used with dif- 
Results 

Since the responses to the homosexual item 
were of primary interest in the present paper, 
the analysis of the results focused on this 
item. Table 4 shows the correlates reported 
for homosexuality by Ss in each of the five 
conditions. As seen there, very few Ss. indi- 
cated that they could find no relationship be 
tween the percepts and the symptoms. In € 
condition, Ss, as predicted, reported that they 
observed the hypothesized illusory correlate 
as accompanying homosexual problems more 


Condition 
Correlate reported 
1Anality | 2Feminine 3 Sexual 4 Sexual 5 Genitalia 
clothing confusion uncertainty ponpe oOo 

Predicted illusory correlate 58 40 45 24 S 
Clinically valid signs 3 

Human or animal, monstrous 8 14 13 10 0 

Part animal—part human 3 14 6 8 
Filler categories 13 

Geographic features 12 5 13 16 B 

Food 8 16 15 18 8 
Other correlates 0 0 2 10 13 
No correlate reported 10 12 6 13 


of thing. 
ferent orders of the four items. 
ach 
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often than any other category of percept. 
Chi-square analysis indicated that the dis- 
tributions of responses among the five cate- 
gories departed from chance (p< .01) for 
each condition except Condition 4, in which 
the hypothesized illusory correlate was sexual 
uncertainty. 

The frequent report of the invalid Wheeler 
signs as illusory correlates of homosexuality 
cannot be attributed to a higher frequency 
of report for all symptom statements. The 
mean percentage report of the five signs for 
the other three symptoms were as follows: 
anality, 4%; feminine clothing, 13%; sexual 
confusion, 9%; sexual uncertainty, 8%; and 
genitalia, 10%. In none of the five conditions 
was the illusory correlate of homosexuality 
the most frequently reported correlate of any 
one of the other three symptoms. 

One may conclude from the data of Table 
4 that the observation of popular invalid signs 
of male homosexuality were reproduced in the 
laboratory as illusory correlates. On the other 
hand, the clinically valid signs of homosex- 
uality were not reported as illusory correlates 
of that symptom any more often than were 
the two filler categories. 


Experiment III (Conditions 6-11): Illusory 
Correlation in the Presence of Valid 
Relationships 


Experiment III was undertaken to de- 
termine if the associatively based illusory cor- 
relates are reported by naive observers even 
if the clinically valid signs have contrived 
validity in the experimental task materials. 
To build in contrived validity, the clinically 
valid Wheeler Signs 7 and 8 were paired with 
the symptom statement of homosexuality more 
often than with the other classes of percepts. 


Method 


Only two of the five popular invalid signs were 
chosen to be used in Experiment IIL They were 
anality and sexual confusion. 

The task materials were almost identical to those 
used in Experiment II. The instructions and answer 
Sheets were unchanged. The percepts, as well as the 
Pool of symptom statements were the same as those 
Previously used in Conditions 2 and 4 in which 
anality and sexual confusion were used as popular 
invalid signs. As in the earlier conditions, each S 
Saw 30 cards, six presenting a percept of a popular 
invalid sign, six for each of the two clinically valid 
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signs (Wheeler Signs 7 and 8), and six for each of 
the two filler categories (geography and food). The 
task materials differed from Experiment II only in 
the percentage of clinically valid percepts that were 
accompanied by the symptom statement of homo- 
sexuality. Three degrees of contrived validity were 
investigated: one in which the symptom statement 
of homosexuality occurred with two-thirds of the 
percepts of each of the two valid signs, one in 
which it occurred with five-sixths of them, and one 
in which it occurred with all of them. After being 
rounded off to the nearest percentage point the 
values of the three degrees of contrived validity 
were 67%, 83%, and 100%, respectively. The com- 
bination of three degrees of contrived validity with 
each of two popular invalid signs yielded six con- 
ditions altogether. In Conditions 6, 7, and 8, the 
popular invalid sign was sexual confusion, with con- 
trived validity for Signs 7 and 8 of 67%, 83%, and 
100%, respectively. In Conditions 9, 10, and 11, the 
popular invalid sign was anality with the same three 
degrees of contrived validity for Signs 7 and 8. As 
in Experiment II, 50% of each of the other three 
categories of percepts were accompanied by the 
symptom statement of homosexuality, as well as by 
the other three symptom statements. 


Results 


Table 5 shows the percentage of Ss in each 
condition who reported the co-occurrence with 
homosexuality of each of the five categories 
of percepts. The most striking feature of 
these data is the degree to which the illusory 
correlates based on associative connection are 
impervious to the contrary influence of valid 
correlations. In all six conditions, the clinically 
popular invalid sign was the most frequently 
reported correlate of the symptom statement. 

For each of the six conditions, the report 
of the homosexual symptom statement was 
distributed among the various categories of 
percepts significantly different from chance, 
as shown by chi-square analyses (p < .01 in 
every case). The number of Ss who chose the 
invalid sign as a correlate was compared with 
the number who chose one or the other of 
the two valid signs. A goodness-of-fit chi- 
square was used with the expected values 
being one-third of the Ss for the invalid sign, 
and two-thirds of the Ss for the two valid 
signs combined. In five of the six comparisons 
the difference was significant (p< .01 in 
each case). The one nonsignificant difference 
was for anality when the clinically valid signs 
had 83% contrived validity. The difference 
for this condition fell short of significance 
(x? = 3.03, df = 1, p < 10). These findings 
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TABLE 5 


PERCENTAGE, OF Ss IN EXPERIMENT III REPORTING 
THE OBSERVATION OF EAcH CATEGORY OF 
PERCEPT AS A CORRELATE OF 


HOMOSEXUALITY 
Percentage contrived 
validity for Signs 
7 and 8 
Category 
67% | 83% | 100% 


Conditions 6, 7, and 8 


Invalid signs 


Sexual confusion 50 | 41 | 46 
Valid signs 

Human or animal—monstrous | 12 | 27 | 20 

Part animal—part human 10 6 | 14 
Filler categories 

Geographic features 2 12 5 

Food 8 0 2 
Other correlates 10 6 8 
No correlate reported 10 8 5 


Conditions 9, 10, and 11 


Invalid sign 
Anality 54 | 34 55 
Valid signs 
Human or animal—monstrous | 16 16 15 
Part animal—part human 13; | 23 13 


Filler categories 


Geographic features 6 | 13 5 
Food 2 3 2 
Other correlates 0 3 2 
No correlate reported 9 7 8 


indicate that the invalid signs were reported 
as correlates of homosexuality more often 
than were the valid signs, which had objec- 
tively true correlations with the symptom. 
Another way to view these data is in terms 
of a comparison of the number of Ss who re- 
ported the popular invalid signs, and the 
number who reported the valid signs, as cor- 
relates of homosexuality at each of the four 
levels of contrived validity of the two valid 
signs. (The four levels are the 50% co-oc- 
currence of the clinically valid sign with the 
symptom statement—from Experiment II— 
plus the 67%, 83%, and 100% levels of con- 
trived validity of Experiment IIT.) When the 
popular invalid sign was sexual confusion, 
these four conditions did not differ on fre- 
quency of report of the invalid sign (x? = .84, 
df = 3, 80<~<.90). They also did not 
differ on number of Ss reporting one or the 
other of the two valid signs (x? = 4.92, df 
=3, .10 < p< .20). When the popular in- 
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valid sign was anality, the four conditions of 
contrived validity differed in both the number 
of Ss reporting the popular invalid sign (y? 
= 8.47, df = 3, p < .05) and in the number 
reporting one or the other of the two valid 
signs (x? = 13.05, df = 3, p < .01). Inspec- 
tion of the data for the three anality condi- 
tions indicates that Ss achieved highest ac- 
curacy by both criteria in the 83% validity 
condition. These findings indicate that in- 
creasing the contrived validity had a small 
effect in reducing the report of one popular 
invalid sign as a correlate of homosexuality, 
but not the other. The validity was accurately 
perceived to a slight degree. 

Further analyses were performed to seek 
evidence on the accuracy of perception of the 
contrived validity. For the three conditions 
for which sexual confusion was the popular 
invalid sign, more Ss chose the valid sign than 
the filler items for the 100% validity condition 
(9 = 9.38, df —1, p< 01) but this dif- 
ference did not emerge on either the 67% or 
83% validity conditions. For the three condi- 
tions for which anality was the popular in- 
valid sign, Ss reported the valid signs more 
often than the filler categories for all three 
levels of contrived validity (p < .05 in all 
three cases). These findings again show that 
the contrived validity was to some degree ac- 
curately perceived, despite the fact that its 
presence did little to reduce the frequency of 
report of the illusory correlates. 


Experiment IV: Accuracy of Report in the 
Absence of Popular Invalid Signs— 
Condition 12 


The most striking findings of Experiment 
III were the frequency of report of the il 
lusory correlates, and the low accuracy of 
Ss observational reports of the valid signs. 
These findings might lead one to wonder } 
the infrequent detection of valid signs 0 
these tasks is due to the distracting influence 
of illusory correlates or whether the valid 
signs are inherently difficult to discover. Fur- 
ther, one might wonder whether associatively 
based illusory correlation occurs only when 
the detection of valid signs is very difficult. 

In defense of the tasks used in Experimett 
III, one might point out that they surely 
present the observer with a less difficult JO 
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of information processing than does conven- 
tional clinical practice. In clinical practice, 
the symptoms are more numerous and more 
ambiguous, and many percepts are given by 
each patient. Also, the clinician encounters 
the patients over a long period of time, so 
that retrospective falsification of observations 
should occur more often. Nevertheless, it is 
of interest to determine if accuracy is higher 
on a task that is comparable to those of 
Experiment III, but in which associatively 
based popular invalid correlates are absent. 
The first part of Experiment IV (Condition 
12) was designed to give evidence on this 
question. 


Method 


The stimulus materials were almost identical to 
those of the two 83% validity conditions (Conditions 
7 and 10) of Experiment III. The symptom state- 
ment of homosexuality accompanied 83% of the 
percepts of the two clinically valid signs (Signs 7 
and 8) and 50% of the other categories of per- 
cepts. The only change from Conditions 7 and 10 
was that in Condition 12, nonsexual body parts 
were substituted for a popular invalid sign. The 
percepts for the category of nonsexual body parts 
were: (a) Card II, D-4, hands, (b) Card VII, D-1, 
gray hair, (c) Card IX, Dd-31, nose, (d) Card I, 
Dds-30, eyes, (e) Card II, D-2, foot, and (f) Card 
VI, Dd-25, toe. Thus, each S saw five classes of 
percepts: nonsexual body parts, humanized animals, 
monsters, food, and geographical features, The mode 
of administration of the task was identical to that 
of Experiment III. 


Results 


Table 6 shows the percentage of Ss who re- 
ported each category of percept as a correlate 
of homosexuality. The distribution of Ss 
among the five categories differed from chance 
(x? = 26.23, df= 4, p< .001). As seen in 
Table 6, the two clinically valid signs were 
the most frequently reported categories, and 
6576 of the Ss reported one or the other of 
them. This value is almost double the per- 
centage reported in the comparable conditions 
9f Experiment III, and the rise in accuracy 
Was significant (x? = 11.6, df=1,p< .001). 


Condition 13 


: The second part of Experiment IV was de- 
Signed to eliminate one possible source of 
doubt concerning the meaning of the findings 
ìn Condition 12. One might speculate that the 


TABLE 6 


PRECENTAGE OF Ss REPORTING EACH CATEGORY OF 
PERCEPT AS A CORRELATE OF HOMOSEXUALITY 
IN THE Two CONDITIONS OF 
ExPERIMENT IV 


Percentage of Ss re- 
porting each category 


of perce] 
Category Sn 
Condition | Condition 
12* 13b 

Clinically valid signs 

Part animal—part human 27 17 

Human or animal—monstrous 38 18. 
Filler categories 

Body parts 15 23 

Food 3 24 

Geographic features 8 9 
Other correlates 2 4 
No correlate reported 7 4 


^ Eighty-three percent contrived validity for Signs 7 and 8, 
50% for the other three categories. 
b Fifty per cent contrived validity for all five categories, 


greater accuracy in Condition 12 than in 
Conditions 7 and 10 is not attributable to 
accurate observation of the valid correlates, 
but to illusory correlation. Table 3 showed 
that the clinically valid signs have a slightly 
stronger verbal associative connection to 
homosexuality than do the two filler cate- 
gories. This associative connection might pro- 
duce illusory correlation when stronger verbal 
associates are absent. 


Method 


In Condition 13 the percepts were identical to 
those used in Condition 12, but each class of percept 
appeared equally often with each symptom state- 
ment. If the increased accuracy of report of valid 
correlates in Condition 12 was an artifact of illusory 
correlation, it should remain in this condition. If 
the increased accuracy of report reflected increased 
accuracy of obseryation, it should disappear. 


Results 


The results appear in Table 6. The distribu- 
tion of people reporting the five categories of 
percepts did not differ from chance (x? = 
5.17, df =4, p > .05), and the two clinically 
valid signs were reported at almost exactly 
the mean frequencies of the other three cate- 
gories of correlates. This finding demon- 
strates that the increased accuracy of report 
in Condition 12 was due to increased accuracy 
of observation, and was not attributable to 
illusory correlation, 
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Discussion 


There was a marked congruence between 
the reports of the clinicians as to their ob- 
servations in clinical practice and the reports 
of the naive observers in the contrived ex- 
perimental situations. Almost none of the 32 
practicing psychodiagnosticians reported ob- 
serving either of the two signs of male homo- 
sexuality that research findings indicate are 
clinically valid. Instead, they tended to agree 
with one another in reporting several signs 
that appear invalid in published research, but 
which have a high strength verbal associative 
connection to the symptom of homosexuality. 

The naive observers presented with con- 
trived clinical materials reported similar er- 
roneous observations. They reported the as- 
sociatively based signs as illusory correlates 
of homosexuality both when there were no 
valid relationships present and when the two 
clinically valid signs had a contrived validity 
in the task materials. 

One of the most striking findings of these 
studies is the persistence of illusory correla- 
tion in the face of contradictory reality. Even 
in the two conditions in which all of the 
percepts of the two valid signs were paired 
with homosexuality, these two valid signs 
were reported as correlates of the symptom 
less often than the associatively based in- 
valid signs. In addition, the low accuracy of 
report of the valid signs cannot be attributed 
entirely to there being too much information 
for the observer to process. Instead, the low 
accuracy is, to a considerable degree, a result 
of Ss’ susceptibility to the illusory correla- 
tion. Experiment IV showed that, when the 
percepts that have a high associative con- 
nection to the symptom were not presented, 
the observers almost doubled their accuracy 
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of report of valid correlates. The illusory cor- 
relates blind the observer to the presence of 
valid correlates of the symptom. 

Associatively based illusory correlation is 
a powerful bias in the observational report of 
correlations between classes of events. Yet its 
influence is so unapparent that many prac- 
ticing psychodiagnosticians have overlooked 
it, and have substituted illusory correlates for 
valid correlates in their diagnostic practice. 
One possible solution to this problem might 
be to demonstrate to graduate students, as 
part of their training, their own propensities 
toward illusory correlation. 
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THINKING DISORDERS IN PARENTS OF YOUNG 
PSYCHOTIC CHILDREN ' 


ERIC SCHOPLER ax» JULIE LOFTIN 


University of North Carolina School of Medicine 


In this study parents of young psychotic children were tested on the Object 
Sorting Test. The sample differed from the replicated studies in that parents of 
adult schizophrenics were tested in those studies. Since childhood psychosis 
has substantial differences from adult schizophrenia, it was anticipated that 
parents of psychotic children would not show the same extent of thought im- 
pairment found in parents of adult schizophrenics. Parents of psychotic chil- 
dren, however, showed more thought impairment than parents of normal 
children. Mothers of psychotic children showed more impairment than fathers. 
Findings were interpreted to suggest circumscribed test anxiety in association 
with a psychotic child rather than representing a formal thought disorder. 


The interaction between schizophrenics and 
their parents has received considerable atten- 
tion during the past 10 yr. These parents have 
been found to be rejecting and emotionally 
cold (Kanner, 1943). They involve their 
children in a “double bind” (Ruesch & Bate- 
son, 1951) in which the child can make no 
correct response; they engage in pseudomu- 
tuality (Wynne & Singer, 1963) where inter- 
personal communication is garbled; and they 
have been found to suffer from “parental per- 
plexity” (Goldfarb, 1958), including attitudes 
of uncertainty, lack of spontaneity, and lack 
of appropriate empathy and awareness of the 
child’s needs. Bettelheim (1950) has been 
most articulate about linking parental pathol- 
ogy with the etiology of the extreme stress 
which produces the schizophrenic develop- 
ment in children. His treatment resolution has 
been to conduct a parentectomy and to pro- 
vide residential treatment for the child which 
Prevents his return to the toxic parental in- 
fluence. 

The above literature systematically deletes 
any consideration of schizophrenia as based 
on biogenetic impairments. No consideration 
is given to family members as being reactive 
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to the schizophrenics’ elusive deficiencies. 
Rimland (1964) has argued that parents of 
autistic children have been held responsible 
for being the primary cause of their child's 
autism, although the origins of the condition 
were to be found in genetic and organic 
sources, Frank's (1965) review of the litera- 
ture also challenged the role of parental pa- 
thology in the formation of childhood pathol- 
ogy. He argued that research evidence on 
parental contribution to childhood disorders 
was not consistent with the emphasis given 
by psychodynamic theories and formulations. 
This etiological emphasis has face validity, 
especially when considering disordered think- 
ing in parents of young, psychotic children. 
These parents most often function within 
normal limits and also have other normal 
children. 

The process of disordered thinking has been 
considered a central mechanism by which 
pathology is transmitted from parent to child. 
Thought disorders which may manifest them- 
selves in illogical concepts, fragmented or 
blurred attention, inability to interpret and 
communicate the meaning of objects and 
events, have been for many investigators the 
cardinal characteristics of schizophrenia. Lidz, 


Cornelison, Terry, and Fleck (1958) pre-, 


sented lucid clinical examples of how the 
parents of schizophrenics maintained self- 
concepts and thinking styles which were in- 
compatible with reality. As the parents dis- 
torted reality to suit their own needs, their 
children received continuous training in irra- 
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tionality. Nine of the 15 patients had at least 
one parent who was considered schizophrenic, 
paranoid, or ambulatory schizophrenic. None 
of these parents, however, had ever been hos- 
pitalized in a mental hospital. They all be- 
longed to the middle or upper middle class 
and were functioning adequately socially. 
Since the finding of schizophrenogenic thought 
disorders in the parents was based on sophisti- 
cated clinical observation and analysis, it was 
not at once obvious that explicit and objec- 
tive measures of thought disorder would sup- 
port this clinical finding. 

Lovibond (1954) developed such an ob- 
jective scoring system for measuring impair- 
ment in conceptual thinking. He defined con- 
ceptual thinking in Pavlovian terms. A con- 
cept was considered a cortical representation 
of a generalized relationship between the ob- 
jects, events, and processes of the external 
world. He regarded conceptual thinking as 
the highest or third stage in the process of 
acquiring subjective representations of the 
real world. The first stage was simple un- 
analyzed sensation, followed by perception, in 
which the formation of complex systems of 
cortical linkages permitted the registration of 
objects. At this stage S was still bound to 
immediate experience, until the secondary sig- 
naling system, based on language, freed the 
individual from the particularities of concrete 
experience. 

Lovibond used the Object Sorting Test 
(OST) as described by Rapaport (1945) for 
measuring impairments in conceptual think- 
ing. He developed an objective scoring method 
for rating the type of impairment in S's re- 
sponse. Both administration of the OST and 
the scoring system were described by Lovi- 
bond (1954). In his pilot study, Lovibond 
tested 32 schizophrenics hospitalized for elec- 
troconvulsive therapy (ECT) or insulin treat- 
ment and correlated their test scores with 
psychiatric ratings of thinking disorder. He 
found agreement between test results and 
clinical ratings. Both showed significantly 
higher impairment scores for schizophrenics 
than for a normal control group. McConaghy 
(1959) extended the use of the OST to the 
parents of schizophrenics. He selected 20 
schizophrenics who showed clear indications 
of thought disorder. In testing their parents, 
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he found they obtained significantly higher 
impairment scores than did a randomly se- 
lected control group. At least one parent of 
each schizophrenic S reached the impairment 
level. McConaghy suggested that this find- 
ing demonstrated a genetic transmission of 
thought disorder, specifically in the dominant 
rather than the recessive mode. 

The McConaghy experiment was replicated 
by Lidz, Wild, Schafer, Rosman, and Fleck 
(1963) with improved controls. Hospitalized 
schizophrenics who showed disordered think- 
ing clinically and on projective tests were 
tested with the OST. Parents of nonschizo- 
phrenics were tested to control for the effect 
of parenthood itself, and the vocabulary sub- 
test of the Wechsler Adult Intelligence Scale 
(WAIS) was given to control for the effects of 
intelligence. Only 5 of the 20 parents in the 
experimental group, in comparison with 12 out 
of 20 in the McConaghy series, had impair- 
ment scores. Hence the McConaghy findings 
were not confirmed. When the protocols were 
rescored by Lovibond, however, it turned out 
that he tended to assign higher scores for 
faulty grouping. When a new set of raters 
used the corrected Lovibond scoring system, 
the McConaghy findings were confirmed. His 
interpretation of a dominant genetic trans- 
mission was maintained. i 

Because of the great potential value of this 
technique for studying the etiology of psy- 
chosis and also for epidemiological investiga- 
tions, a second replication was conducted by 
Rosman, Wild, Ricci, Fleck, and Lidz (1964). 
Using a larger sample than before, they in- 
vestigated the relationships of sex of parent, 
age, intelligence, and social class to the test 
measures of thought disorder. A higher fre- 
quency of pathological scores was found 
among mothers of patients. On the other hand, 
there was nò difference between fathers of 
patients and control fathers or mothers. The 
age of the parents did not affect their im- 
pairment scores. Their intelligence, however 
did. At the lower levels of intelligence, parents 
in both groups obtained higher impairment 
scores and could not be distinguished from 
each other. Only in the higher levels of in- 
telligence, education, and occupation coul 
the patient-parent group be differentiate 
from the control group. The evidence to date 
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is clear that a greater incidence of illogical 
and inappropriate conceptual thinking is 
found among parents of psychotic patients 
than among parents of nonpsychotic patients, 
Rosman et al. (1964) correctly inferred that 
the findings neither confirm nor deny genetic 
sources of schizophrenia. 

In the previous studies the experimental 
groups were parents of adult schizophrenics. 
The present study is a replication, using par- 
ents of children diagnosed as psychotic or 
autistic. There is growing evidence in the 
literature that early childhood psychosis may 
not be the antecedant condition to schizo- 
phrenia in adulthood. Brain damage (Pollack, 
1967) and retardation (Gittelman & Birch, 
1967) may play a greater determining role 
in childhood psychosis. According to Rimland 
(1964) neurological and biochemical factors 
are the major determinants of infantile au- 
tism. If this is true, it might be expected 
that parents of psychotic children will not 
show the same impairment in conceptual 
thinking shown by parents of adult schizo- 
phrenics. Although parents of psychotic chil- 
dren have often been characterized by their 
rejecting, perplexed, and other pathological 
child-rearing attitudes, these may be primarily 
parental stress reactions to an unresponsive 
child. In such cases the parents’ formal think- 
ing processes may be within normal limits 
even though they have deviant responses to 
their child. 


TABLE 1 


AGE, SOCIAL Crass, YEARS OF EDUCATION, AND 
IQ or THE THREE PARENT GROUPS 


P: ts’ Parents’ 


Experimental groups | Parents’! Socjal | years of] Parents 
class | “tion 
Parents of psychotics 
(N = 34) 
ange 26-51 | 1-4 | 10-21] 79-134 
P 36 2 14 111 
arents of retardates 
= 22) 
nue 24-49 | 1-5 | 8-21| 80-125 
P 36 3 14 110 
arents of normals 
(N = 42) 
ange 24-48 | 1-3 | 12-25 |102-131 


34 1 18 121 
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This proposition was assessed by measur- 
ing impairment of conceptual thinking in 
parents of psychotic children, parents of re- 
tarded children, and parents of normal chil- 
dren, using the OST. If this proposition is 
true, it could be anticipated that these three 
groups of parents would not differ signifi- 
cantly in their manifestation of disordered 
thinking. The frequency with which at least 
one parent of a schizophrenic showed im- 
paired thinking in the McConaghy study 
(1959) should not be applicable to the parents 
of young psychotic children, Where the moth- 
ers showed greater thinking impairment than 
the fathers (Rosman et al, 1964), such sex 
differences were not expected in the present 
sample. 


METHOD 


Subjects. The experimental group consisted of 
parents of 17 children who had been diagnosed as 
autistic or psychotic. These children had been in a 
clinic program for long-term group therapy (Speers 
& Lansing, 1965) for a period from 3 to 5 yr. Al- 
though most of these children had improved during 
their course of therapy, they had all shown at least 
four of the nine points (Creak, 1961) defining child- 
hood psychosis at some time during their treatment, 
Their parents had been involved in conjoint group 
therapy. The families were primarily in the middle 
class, with above average intelligence, and had other 
nonpsychotic children in the family. 

Two control groups were used. The first included 
parents of 11 retarded children. This group was se- 
lected to be comparable to the experimental group 
with respect to the age of the impaired child, and the 
social class status (Hollingshead, 1957) of the parents. 
The second control group was selected from parents 
of children enrolled in the public schools or kinder- 
garten, who were of comparable age to those in the 
experimental group. All children were under 12 yr, 
of age, with the mean age for each group at 6 yr, 
6 mo. The parents of 21 children volunteered to 
participate. The characteristics of the three groups 
are summarized in Table 1. 

Procedures and measures. The OST materials and 
method of administration were the same as de- 
scribed by Lovibond (1954), McConaghy (1959), 
and Rosman et al. (1964). The S is presented with 
an array of 33 common subjects such as a bicycle 
bell, a hammer, and a screwdriver. In Part I of the 
test, E selects 1 object in a standard sequence and 
tells S to place with it the other objects which be- 
long with it. After each sorting S is asked, “Why do 
these belong together?” In Part IT of the test, E 
presents S with different groups of objects, each 
group sharing a common characteristic such as red- 
ness or roundness. For each grouping S is asked, 
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TABLE 2 
Comparison oF OBJECT SORTING Test SCORES FOR THE THREE PARENT GROUPS 


Impairment scores 


Median split» 


Experimental groups 


0-6 7 or higher «Mdn >Mdn 
A. Parents of psychotics (V = 34) 20 (59%) 14 (41%) 10 (29%) 24 (71%) 
B. Parents of retardates (V = 22) 15 (68%) 7 (32%) 9 (41%) 13 (59%) 
C. Parents of normals (N = 42) 37 (88%) 5 (12%) 23 (55%) 19 (45%) 
^ i d C: x? = 8.59, p < 01. 
erga peces E TEE 


“Why do all of these belong together?” and his 
responses are recorded verbatim. 

The Shipley-Hartford Intelligence test (Shipley & 
Burlingame, 1941) was also administered to all Ss. 
This test correlates well with other intelligence tests, 
and provides a quick and reliable estimate of Ss’ 
intelligence. 

Test scoring. The scoring system for the OST has 
been described in detail by Lovibond (1954). Scores 
are given for eight types of impaired responses which 
display concept formation based on inappropriate, 
irrelevant, and illogical relationships among objects. 
These response types are categorized as syncretistic, 
fabulatory, chain, symbolic, arbitrary, personal 
association, irrelevant, and impersonal association. 
Responses are given weighted scores from 1 to 3, de- 
pending on the extent of the conceptual impair- 
ment. A S's total score is the sum of all his impair- 
ment scores. Correct responses and failures to give a 
response receive no score. Lovibond (1954) deter- 
mined empirically that a score of 7 or higher corre- 
lated with the clinical rating of disordered thinking 
in schizophrenic patients and distinguished them from 
normal controls, The criterion of a score of 7 or 
higher as an impairment indicator was used by 
McConaghy (1959), Rosman et al. (1964), and the 
present study. 

All protocols were given code numbers and scored 
independently by a second E who did not know 
which tests were those of parents of psychotic chil- 
dren, parents of retarded children, or parents of 
normal children. A Pearson correlation between the 
scores of both Es was calculated and a scoring reli- 
ability of r=.93 was achieved. 


RESULTS 


The three groups were compared for both 
the frequency and magnitude of their impair- 
ment scores, Lovibond (1954) used an em- 
pirically determined cutoff score of greater 
than 7. In addition to this cutting score, a 
median split is reported. Either high or low 
scores could be expected to be a random dis- 
tance from the three groups’ median (Mdn = 
3). The magnitude of this distance offers a 
more sensitive measure of relative impairment 


than the cutting score. The impairment scores 
of each group are presented in Table 2. 

Among the parents of psychotics, 41% at- 
tained an impairment score greater than 7 
while among the parents of normals 12% of 
the individuals scored higher than 7. This 
difference was significant at the .01 level. 
Comparing the same two groups, 71% of the 
psychotics’ parents scored equal to or above 
the median, while only 45% of the normals’ 
parents attained scores of this magnitude (x° 
= 491, p < .05). (All x? analyses were made 
using two-tailed test). None of the other com- 
parisons between parent groups showed any 
significant difference for either of the two 
analyses. Since these findings were consistent 
with those reported by Rosman et al. (1964) 
and McConaghy (1959), they did not offer a 
basis for differentiating parents of adult 
schizophrenics from parents of young psy- 
chotic children. 

Table 3 presents the frequency of parent 
couples in each of the three groups with at 
least one high-scoring member and at least 
one member with a score greater than the 
median. 

At least one parent scoring 7 or more Wes 
found in 59% of the parents of psychotics, 
while among the parents of normals, 2476 had 
at least one high-scoring parent (y? = 4.81, $ 
< .05). A parallel difference was found for the 
same two groups for scoring higher than the 
median (y? = 6.66, p < .01). There was no 
significant difference in any other group com- 
parisons in Table 3. Clearly, the expected 
similarities among the three groups did not 
obtain. At least one parent showed thought 
impairment more often in the psychotics’ par- 
ents than in the control couples, similar t0 
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TABLE 3 
FREQUENCY OF AT LEAST ONE HIGH-SCORING MEMBER AMONG COUPLES 
OF THE THREE PARENT GROUPS 
Impairment scores Median split? 
Experimental groups 
0-6 7 or higher <Mdn >Mdn 
A. Parents of psychotics (V = 17) 7 (41%) 10 (59%) 5 (29% 12 (71 
B. Parents of retardates (N = 11) 5 (45%) AEA à 07) 2 (64%) 
C. Parents of normals (N — 21) 16 (76%) 5 (24%) 15 (71%) 6 (29%) 


s Comparison between Groups A and C: x? = 4.81, p <.05. 
P <.01; Mdn = 3. 


b Comparison between Groups A and C: x? = 6.66, 


the findings reported by McConaghy (1959) 
and Rosman et al. (1964). 

The relationship between sex of parents 
and thought impairment is shown in Table 4. 
This table shows comparative findings for 
mothers and fathers in the three groups. 

l Fifty-three percent of mothers of psychot- 
ics had an impairment score of more than 7, 


while 19% of mothers of normals and 5% of 
fathers of normals achieved such a score. 
These are both significant differences. In the 
same comparison for scores higher than the 
median, the mothers of psychotics were higher 
(p < .001). Twenty-nine percent of the psy- 
chotics’ fathers scored 7 or higher, while 19% 
of normals’ mothers and only 5% of the nor- 
mals’ fathers had such a high score. The dif- 
ference between the fathers of psychotics and 
fathers of normals was significant at the .05 
level, while there was no difference between 
fathers of psychotics and mothers of normals. 
In the comparison of scores greater than the 
median, there were no differences among fath- 
ers of psychotics, fathers of normals, and 


mothers of normals. Clearly, there is more 
thinking impairment in the mothers of psy- 
chotics than in the fathers of psychotics. This 
conclusion is further supported by comparing 
mothers and fathers of psychotics. Although 
these mothers have more impairment scores 
than their husbands, this difference was not 
significant. However, the mothers of psy- 
chotics scored higher than the median in 94% 
of the cases, while their husbands scored above 
the median in only 47% of the cases (x° = 
9.06, p < .01). The mothers of psychotics 
also had higher scores than mothers of re- 
tardates, These findings show more indication 
of thought impairment for mothers of psy- 
chotics than any other parent group. This is 
consistent with the findings of the Rosman 
group (1964). The mothers of psychotic chil- 
dren show more thought impairment than 
fathers whether they are mothers of adult 
schizophrenics or mothers of young psychotic 
children. 

Six out of eight significant differences re- 
ported in Table 4 involved the high-scoring 


TABLE 4 


Comparison OF MOTHERS’ AND FATHE! 


rs’ OBJECT SORTING Test SCORES 


FOR THE THREE GROUPS 


Impairment score® 


Median split» 


Experimental groups 


0-6 7 or higher <Mdn zMdn 
An. Psychotics’ mothers (N = 17 8 (47%) 9 (53%) 1 (6%) 16 (94%) 
AG Paychigpice hes = 17) / 12 FHA 5 (29%) 9 (53%) 8 (a1) 
f Bm. Retardates’ mothers (N = 11) 9 (82%) 2 (18%) 6 (54%) m 
Eimesues-D | sos | H5 3D id 
m. Is’ mothers (V = 21 h 
CE NOHARIS how = 21) 20 (95%) 1 6%) 11 (52%) 10 (48%) 


è Significant comparisons: Am and Cm. x* Yd 
j i 5 E 
| ecco 


$ «.01; Amand Bm, x? = 843, p € 


< .05; Ar and Cr, xè = 4.31, $ < .05; Brand Cr, x? = 7.86, p < .0 


01. 
01; Amand Cm, x? = 10.99, p < 001 
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mothers of psychotics, while one involved the 
fathers of psychotics. It is of interest to note 
that the only other significant difference was 
between the fathers of retardates and the 
fathers of normals. Forty-six percent of the 
retardates’ fathers obtained an impairment 
score of greater than 7, while only 5% of the 
normals’ fathers did so (x? = 7.86, p < .01). 
No such evidence of impairment was found 
for mothers of retardates. 

No significant correlations were found be- 
tween impairment scores, IQ, and social class. 
This was not consistent with the findings re- 
ported by the Rosman et al. (1964) study in 
which the parents of psychotics and their con- 
trols could only be distinguished at the higher 
intellectual and educational levels and not at 
the lower levels. The absence of confirmation 
for the Rosman findings in the present study 
may well be accounted for by the relative 
homogeneity in higher social class and intel- 
ligence of all three groups in this study. 


Discussion 


This study was begun with the assumption 
that early childhood psychosis is not pri- 
marily the precursor to the schizophrenia of 
adulthood, and that the thought impairment 
reported for parents of adult schizophrenics 
would not show in parents of young psychotic 
children. Instead, the authors found that the 
parents of psychotics showed greater signs of 
impaired thinking than did the parents of 
normals. There were more couples with at 
least one high-scoring parent in the group of 
parents of psychotics than in the other two 
groups. The impaired thinking was more prev- 
alent for the mothers of psychotics than for 
any other same-sexed parent group: The 
greater degree of thought disorder in mothers 
of psychotics than in the fathers may reflect 
their greater exposure and involvement with 
their impaired children. This explanation, 
however, should also apply to the parents of 
retardates in which the opposite tendency 
was found. The fathers showed a greater 
tendency to high-impairment signs than did 
mothers. These findings may be explained by 
the main differences characterizing childhood 
psychosis and mental retardation. With psy- 
chosis, the child does not relate appropriately, 
involving impairment in both affect and 
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thinking, while with retardation the child 
often relates quite adequately but is incapable 
of normal coping and school achievement, 
According to Parson’s sex-role definition, the 
mother’s role is more involved in expressive 
and emotional adaptations, while the father’s 
role is more oriented toward task achievement, 
The psychotic child’s relationship impairment 
may have a greater impact on the mother, 
while the failure in task achievement may 
have a more adverse effect on the father-child 
relationship. Perhaps such adverse responses 
to a child’s impairment can affect the parents’ 
conceptual thinking. A study testing this 
possibility is described further in the follow- 
ing paragraphs. 

The major issue in this study was that the 
three hypotheses were not confirmed. Instead, 
the findings were in substantial agreement 
with those of previous studies based on the 
parents of adult schizophrenics. Do these re- 
sults suggest that no significant difference ex- 
ists between parents of early childhood psy- 
chotics and parents of adult schizophrenics, 
that the two disorders are closely linked with 
parental thought impairment? Although this 
possibility is open, another explanation for 
the findings must be considered first. The 
parents of the psychotics were in regular at- 
tendance with their children in the clinic in 
which the study was conducted. A similar 
psychiatric involvement was true for the ex- 
perimental groups reported in the previous 
studies. Perhaps the experimental groups 
clinical involvement confounded the parents 
concern with being evaluated and their guilt 
ior having produced a sick child—in short, 
test anxiety may have been confounded with 
a formal thought disorder to be found in all 
areas of the Ss’ thinking. Whether these im- 
pairment scores reflect an impairment circum- 
scribed to the context of a psychiatrically ill 
child or whether they are an attribute of the 
Ss’ formal thinking process in any situation 
was tested in a subsequent study. 

Such a study was initiated. Another Po 
of parents of psychotic children was teste 
on the OST. Instead of conducting the study 
within the context of the psychotic child’s 
psychiatric treatment program, this secon 
group of parents was interviewed about one 0 
their normal children. The purpose of M 


interview was to elicit parental characteristics 
associated with successful child rearing before 
the OST was administered. Less thought im- 
pairment was hypothesized for the parents in 
this second group than was found in the first 
(Schopler & Loftin, 1969). 
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PERCEPTUAL SELECTIVITY IN HALLUCINATORY 


SCHIZOPHRENICS * 
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A manual for categorizing themes present in hallucinations was constructed 
using a representative 10 males and 7 females from 49 actively hallucinating 
patients interviewed. The authors agreed on the major theme for 30 of 32 
remaining Ss, indicating significant interjudge reliability (p < .001, Cohen's k 
= .93). An hypothesis based on cognitive theory and sensory deprivation re- 
search that an hallucinatory theme is congruent with selective recall was sup- 
ported. Twenty-seven available Ss heard a story with themes from the hal- 
lucinatory theme classification manual. In 18 of the 23 Ss on whose dominant 
recall theme two new judges agreed, the major hallucinatory theme category 
predicted the dominant recall theme. This significant relationship (Cohen's 
k=.74, p< 001) appeared meaningful, rather than spurious, based on in- 
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ternal and external criterion measures. 


In clinical work with psychotic patients 
there is little doubt that some patients do, in 
fact, hallucinate. Further, the principle of 
psychic determinism would indicate that this 
phenomenon which, in most formulations, is 
analogous to dreams, should be a source of 
potentially meaningful information which 
could aid in understanding the patient’s per- 
sonality dynamics. There has, however, been 
little systematic scientific research on halluci- 
nations. Consequently, most theories about 
hallucinations have been developed from post 
hoc interpretation of clinical data in the 
framework of psychoanalysis (e.g., Arieti, 
1955, 1959). 

Hebb (1949), a pioneer in the study of 
cognition and brain functioning, has more 
recently (Hebb, 1960) issued a challenge to 
behavioral scientists. More specifically, he 
suggested that contemporary research in sen- 
sory deprivation has brought the scientific 
study of phenomena such as hallucinations 
within methodological reach. The approach 
which Hebb suggests is based in part on the 
fact that some of the Ss who have been sub- 
jected to reduced sensory variation have re- 


+The authors wish to express their appreciation 
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ported experiences which resemble hallucina- 
tions. For a more complete review of these 
studies see Zuckerman and Cohen (1964). 
While the exact relationship of these sensory 
phenomena to clinical hallucinations has not 
been established, there seems to be general 
agreement that the human brain needs varied 
sensory stimulation for organized, logical, 
goal-directed thinking. If the sensory varia- 
tion is not sufficient, a confusion between 
external and internal sources of stimulation 
can result. An hallucination is a specific 
instance of this kind of confusion in which 
the individual attributes an external source 
for stimulation which, in fact, originated 
internally. 

In sensory deprivation studies, E reduced 
the sensory variation by making the environ- 
ment more homogeneous, whereas, it is here 
postulated that the sensory variation of the 
hallucinatory schizophrenic is reduced by 
some form of central inhibition of perceptual 
processes. Recent formulations of brain an 
perceptual functioning (e.g., Miller, Galantet, 
& Pribram, 1960; Pribram, 1963) state that 
only stimuli for which cognitive models are 
available can be perceived; that is, there is 
selective gating of environmental information. 

It is further postulated that by comparison 
with normal individuals, hallucinating schizo- 
phrenics have limited cognitive models. TM 
fore, the hallucinating schizophrenic gate 
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very little information. The hallucination 
occurs in the context of the patient’s reduc- 
tion in sensory variation which causes con- 
fusion between external and internal sources 
of information. Since, in the hallucination, an 
external source is attributed to stimulation 
which originates internally, it follows logically 
that hallucinatory themes must be a valid 
index of the patient’s cognitive models. 

In view of the foregoing theoretical orienta- 
tion the present study was undertaken for 
two reasons: (a) to develop a reliable scoring 
system for categorizing the dominant theme 
present in the hallucinations of schizophrenic 
Ss, as they describe them, and (5) to conduct 
an initial test of the hypothesis that the 
accentuation in perceptual selectivity will be 
congruent with the patient's hallucinatory 
themes expressed in these categories. 


METHOD 
Selection of Subjects 


Staff psychologists in a psychiatric hospital sub- 
mitted a list of patients who, in their judgment, 
met two criteria: (a) were currently hallucinating 
and (b) had been hallucinating for at least 3 mo. 
A total of 32 males and 22 females was obtained. 

The first author interviewed these Ss individually 
with the primary purpose of soliciting from each 
patient a description of his hallucinations. The tape- 
recorded interviews never exceeded 45 min. The hal- 
lucinatory themes were determined for 29 males and 
20 females (3 males and 2 females were eliminated 
because they denied the existence of hallucinations 
or because they were incoherent). 


Procedure 


Reliability study of the scoring manual for hal- 
lucinations. The taped interviews of the 49 Ss were 
transcribed. One-third of these cases (10 males and 
7 females), representative with respect to social his- 
tory and distribution of hallucinatory themes, were 
Selected to construct a scoring manual. A shortened 
Version of the manual is presented in Table 1. 

Both investigators used the manual to score inde- 
Pendently the major theme present in the hallucina- 
tion described in the interview for the 19 males and 
13 females remaining. A summary of the social 
history of these 32 Ss is available from the authors. 

Preliminary predictive validity study. Following 
the establishment of the interrater reliability of the 
Scoring, predictions were made about the direction 
of selectivity and distortion in the perception of the 
Patients on whose hallucinatory theme categories the 
two authors had agreed. 

To test these predictions, a male and a female 
Story were constructed containing themes matching 
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TABLE 1 
SAMPLES OF SCORING CATEGORIES FOR THEMES OF 
HALLUCINATIONS 


Conscience disturbance (male 6%, female 67%) 


4 Examples would be voices say: “you are a tramp,” 
‘you have cancer” (implies you are rotten inside), “you 
are evil,” or “repent your sins,” etc. 


Compensatory grandiosity (male 17%, female 0%) 


Ego ideal (grandiose delusions, etc.). Examples: 
“you are superman, God, etc.,” “you are the best person 
on earth,” “you will live forever,” “no one can hurt 
you," or “you will win, no one can stop you,” etc. 


Level of maturity: Coping with problems (male 56%, 
female 17%) 


Examples for women: patient hears children’s voices 
crying to her for help, or hears children begging her for 
food, etc. 

Examples for men: voices say “stick up for your 
rights," hears voices say ‘‘find a job and get married,” 
etc. 


Impulse control problems (male 17%, female 0%) 


Examples: voices verbalize to eat, drink, and be 
merry (or make Harry or Mary). Voices say “go out 
with no clothes on,” “De licivious," “take a shit," etc. 


Denial of frustrated needs, e.g., dependent needs (male 
0%, female 0%) 


Regressive fantasy content. Examples: voices say 
“you are nice and sweet and the doctor and nurses are 
nice and sweet,” “your mother will take you home 
soon," etc. 


Conflicting preoccupations (male 0%, female 0%) 


Example for women: ‘‘be licivious—you are evil.” 
Example for men: “you are superman—grow up.” 


Impersonal hallucinations (male 6%, female 17%) 


Patient does not see the hallucination as related to 
him. Patient hears *'voices of a crowd cheering,” “cars 
being driven noisily,” or patient sees “just clouds and 
spaces," etc. 


Note:—The percentages shown indicate the proportion of 
males and females whose hallucinatory themes were in that 
category. N —18 males and 12 females. 


six of the hallucinatory theme categories in the 
scoring manual (the impersonal hallucinatory themes 
had to be omitted since they could not be made 
part of the story's content). Each story was tape- 
recorded by both a male and a female who had no 
other connection with the current study. 

The patients were brought in individually for a 
second time. The males heard the male story and the 
females heard the female story. The examiners were 
two students, a male and a fmale, who divided the 
cases equally. The sex of the voice presenting the 
story and the sex of the examiner were counter- 
balanced in a Latin-square design. The S was then 
asked to retell the story immediately after he had 
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heard it. His reproduction was tape-recorded. The 
male and female stories are also available from the 
authors. 

The prediction was that the stories would be dis- 
torted, the theme remembered, or emphasized being 
congruent. with the dominant theme of the patient's 
hallucination. Only 25 cases (out of the 30 on which 
the two investigators agreed) were used for the valid- 
ity study since 3 cases (two female and one male) 
had impersonal hallucinations and two females had 
been discharged before the data could be collected. 

The 25 auditory reproductions were transcribed 
and two new students rated them independently for 
the dominant theme in the recall. Following the es- 
tablishment of the interrater reliability of the audi- 
tory reproductions, the strength of the relationship 
between the major theme of the hallucination of each 
patient and the obtained emphasis in reproduction of 
his story was assessed. 

To guard against the possibility that the distortions 
in the recall of the hallucinating Ss were spuriously 
related to hallucinatory content, the story recalls 
were also elicited from the control group of 25 (17 
males and 8 females) nonhallucinating psychiatric 
patients. A comparison of the dominant recall themes 
of these two groups was made. 


RESULTS AND DISCUSSION 
Reliability Study of the Scoring Manual 


The scoring manual defined the seven hal- 
lucinatory theme categories, The two authors, 
both clinical psychologists, who independently 
rated the patients’ hallucination descriptions 
for the main theme present, agreed on 30 of 
the 32 cases, yielding a binomial z of 13 
(p < .001). 

To express this finding as a reliability co- 
efficient, Cohen’s (1960) Kappa (k) coeff- 
cient of agreement for nominal scales (a non- 
parametric statistic) was used. The & value 
was .93 indicating that, with this sample of 
Ss, the scoring based on the manual showed 
high interrater reliability. 


Preliminary Predictive Validity Study 


The two students agreed on the dominant 
themes present in the reproduced auditory 
story for 23 of the 25 cases, Again, the bi- 
nomial test was computed and the z value was 
9.6 (p< .001). The A coefficient was .90. 
Hence, scoring of the auditory reproduction 
data also had good interrater reliability. 

Finally, as a preliminary predictive validity 
measure, the agreement between the predic- 
tions from the dominant theme of the pa- 
tient’s hallucinations and the dominant theme 
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of his story recall was assessed for the 23 
cases on whose recall theme the students 
agreed. For 18 of the 23 Ss, the predicted and 
actual dominant recall themes agreed, yield. ' 
ing a binomial z of 7.6 (p < .001) and a k 
coefficient of .74. 

Predictions, based on cognitive theory and 
the empirical results of research on sensory 


deprivation, were supported in the results of 
the present study. The possibility remained 
that the obtained relationship between the 
patients" hallucinatory themes and their selec- 
tivity in recall was spurious. 

Support for the interpretation that the 
relationship between hallucinations and per- 
ceptual selectivity was meaningful came from 
two sources. First, if the congruence were the 
result of an artifact, then most of the suc- 
cessful predictions and a very small portion 
of the unsuccessful ones should be found in 
the high-frequency hallucinatory themes (i.e. 
immaturity for males and conscience for 
females), with the converse holding for the 
infrequent themes. In the data from males, 
there was little indication of a disproportion- 
ately high success to failure ratio in the im- 
maturity category, since 75% of the successes 
and 67% of the failures came from this domi- 
nant hallucinatory theme category. In addi- 
tion, there were no unsuccessful predictions 
in the most infrequently used hallucinatory 
theme categories (ie, compensatory grandi- 
osity and impulse control). For the females, 
50% of the prediction successes came from 
the conscience category but so did the only 
prediction failure. Thus, for both sexes, there 
seemed little indication of the disproportionalt 
success to failure ratio that would be expected 
if the relationships between hallucinatory 
themes and selective recall were due to a? 
artifact. 

Table 2 indicates that the dominant story 
recall themes of the hallucinating pail 
differed significantly from those of RE 
group nonhallucinating patients. When 
males of these two groups were compared m 
the basis of dominant recall theme of im 
maturity versus all other, the x* of 610, 
df = 1, p < .02 indicated that the groups a 
fered significantly i in dominant recall sf 
When the female hallucinating and non es : 


lucinating patients were compared with CO" 
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science related dominant recall themes versus 
all other themes, a Fisher’s exact test was 
used because the small number of cases in- 
volved resulted in expected cell frequencies so 
small that the use of the x? statistic was not 
appropriate. The Fisher’s exact p was .05 
indicating that for the females as well as for 
the males, the dominant recall themes of the 
hallucinating and nonhallucinating patients 
differed significantly. When the probabilities 
for males and females were combined using 
the Lindquist (1940) method, the combined 
probabilities x? was 23.41, df = 4, p < .001. 
From the standpoint of both internal (pre- 
diction of the success to failure ratio within 
the hallucinatory theme categories) and exter- 
nal (comparison of the dominant story recall 
themes of the hallucinatory and nonhallucina- 
tory patients) criteria, there was support for 
a meaningful, rather than a spurious, con- 
gruence between the dominant hallucination 
theme and the dominant story recall theme. 
The finding that the dominant theme in the 
hallucinations of males was different from the 
one for females seemed noteworthy. Arrested 
maturity themes predominated in the males’ 
hallucinations, while conscience related themes 
were most frequent for females. The pre- 
occupation with what they have not done in 
the hallucinations of the males and the 
females heightened concern in their halluci- 
nations with what they have “done” parallels 
findings that the behavior pathology of male 
schizophrenics tends to center around exces- 
Sive withdrawal and passivity, while that of 
the females usually is related to overly aggres- 
sive and dominant behavior (e.g. Cheek, 
1964; Gross, 1959; Lorr, O'Connor, & Staf- 
ford, 1960). It might also be noted that the 
Specific dominant themes in the hallucinations 
of the males and females center about their 
apparent inability to conform to the culturally 
defined appropriate sex roles. 
1f the theoretical model used to predict the 
Perceptual selectivity of the hallucinatory 
Schizophrenic patients in the present study 
gains additional empirical support, it would 
have many potential implications for the in- 
terpretation of schizophrenia. Viewing schizo- 
Phrenic behavior as resulting primarily from 
_ à deficiency in cognitive models (or schemas) 
for coding incoming stimuli, it would follow 
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TABLE 2 


DOMINANT OR ACCENTED THEMES IN THE STORY RECALL 
or HALLUCINATING AND NONHALLUCINATING 
PSYCHIATRIC PATIENTS 


Males Females 
Patients Con- 
Imma- All All 
turity | other Miser other 
themes | themes | themes | themes 
Hallucinating 14 3 6 3 
Nonhallucinating 7 10 2 6 


Note.—For males x*=6.10, df=1, p<.02; for females 
Fisher's exact p =.05, 


that schizophrenic patients lack models for 
many, and in some cases, most of the stimuli 
they encounter.’ Thus, they cannot use much 
of the information potentially available to 
them, with resultant reduction of sensory 
variation below the levels needed for distin- 
guishing between external and internal sources 
of stimuli and for maintaining organized, 
goal-directed perception and thinking. Using 
the above orientation, the poor judgment, the 
inadequate contact with reality, and the pri- 
mary process thinking, all of which are diag- 
nostic of schizophrenia, would be explained on 
the basis of a deficiency in cognitive models. 
In the current situation, as is the case in 
almost every situation, no single theoretical 
system can claim exclusive rights to a par- 
ticular prediction. However, the cognitive- 
theory-based prediction of the congruence be- 
tween the dominant theme of a patient’s hal- 
lucination and the dominant theme of his 
story recall, would not seem to follow as di- 
rectly from current perceptual and personality 
theories, or from the projective test “theory,” 
that is, the basic assumptions underlying 
projective testing. s 
Current perceptual, personality, or projec- 
tive theory might be extended to cover hallu- 
cinations and psychotic patients and to pre- 
dict possible relationships between themes in 
patients’ hallucinations and selectivity in 
their perception and recall of stories. It is not 
enough to talk simply about projection. It is 


3 This formulation does not imply that the cogni- 
tive deficit is necessarily permanent, nor does it have 
any direct bearing on the issue of whether either 
the disease process or the cognitive deficit is cause 


or effect. 
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Fic, 1, Comparison of stress and nonstress groups responses on word association test. 


dard deviations for the control group were 
37.73 © 11.18, 36.53 + 5.14, 14.73 + 5.43, 
and 11.00 + 9.74 most common, common, un- 
usual, and unique, respectively. A £ test dem- 
onstrated that the stress and nonstress groups 
differed significantly on both the most com- 
mon and the unusual categories of response— 
the stress group yielding significantly less 
most common responses at an .05 level and 
significantly more unusual responses at an 
-001 level. However, there was no significant 
difference in the frequency of common and 
unique responses between the two groups, the 
stress group averaging 36 common Tesponses, 
the nonstress group 37. The mean number of 
unique responses was identical at 11 for the 
two groups. Using Cochran’s test of Homo- 


geneity of Variance a trend toward less vari- 
ability was noted throughout the four cate- 
gories in the stress group, the unique cate- 
gory yielding significantly less variablity at 
a .005 level. 


DISCUSSION AND CONCLUSION 


It appears that certain classes of responses 
to the word association test are affected by 
actual stress while others are not. For the 
practicing clinician it is important to UK 
what these classes are, for such differentia 
responsiveness to situational stress and anx- 
leties must be taken into Arena when 
interpreting diagnostic materials. 

The esene which was demon 
strated to be unaffected in frequency by act! 
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stress, is of primary importance to the diag- 
nostician. That these responses were unaf- 
fected by temporary stress would indicate that 
they may be manifestations of deep seated 
more permanent emotional trends. The unique 
response of “hate,” for example, to the stimu- 
lus word MOTHER is usually more illuminating 
to the therapist than would be the response 
“father,” which is given by 70% of the norm 
group. Thus the demonstrated stability of 
these unique responses indicates their impor- 
tance within projective testing, and serves as 
a validating aspect of the word association 
test. 

Saltz (1961) obtained similar results in a 
study of the effects of experimentally induced 
stress on the word association test. His find- 
ings indicated a highly significant decrement 
in common responses, yet no tendency toward 
"novel" responses as a function of his opera- 
tionally defined stress. This similarity of re- 
sults would seem to indicate the efficacy of 
experimentally induced stress in such studies. 

Saltzs purpose was to determine whether 
stress should be considered a stimulus or a 
drive. He ascertained that if a stimulus, stress 
should “produce characteristic stress re- 
sponses which may compete with the common 
free associations." However, if a drive, stress 
should “multiply the strength of both common 
free associations and their competing stress 
tesponses. Consequently, the drive character- 
istics of stress should strengthen the strong 
common free associations disproportionately 
more than the weak common associations. 
Thus, the strong common associations should 
suffer less of a decrement than the weak com- 
mon associations.” The author’s work, al- 
though based on actual rather than experi- 
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mentally induced stress, supports Saltz’s find- 
ing that the former condition was demon- 
strated to be the case, indicating the stimulus 
rather than drive qualities of stress. 

Tn addition, this experiment would seem to 
offer some resolution to the Goldstein and 
Sarason discussion. These results, due to the 
four-category system, seem to offer a resolu- 
tion for the argument, demonstrating that the 
lower commonality score may not be due to 
a greater frequency of rare or unique re- 
sponses, but to the particular classification 
system used to categorize responses, Thus, 
using this system of classification demon- 
strated that Sarason’s lower commonality 
scores might be attributed to a lowered fre- 
quency of most common responses and a 
greater frequency of unusual responses in the 
stress situation. 

The lowered variability of responding in 
the experimental group seems to indicate 
an increase of rigidity precipitated by the 
stressful situation. 
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SOCIAL ADJUSTMENT AND FAMILIAL SCHEMA + 
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University of California, Santa Barbara 


Two groups of 25 male undergraduates with good and poor social adjustment 
as determined by a questionnaire freely placed figures of a father, a mother, a 
son, and a daughter on a field. As predicted, Ss with poor social adjustment 
placed the son closer to the father, while Ss with good social adjustment 
placed the son closer to the mother (p < .01). Both groups placed the daugh- 
ter closer to the mother (p < .01). The results parallel those obtained from 
emotionally disturbed versus normal boys. 


Kuethe (1962a, 1962b, 1964) has demon- 
strated the existence of common social 
schemata, that is, fundamental organizing 
principles in social perception. For example, 
when male undergraduates are asked to place 
felt figures of a man, a woman, and a boy on 
a field, they more frequently place the boy 
closer to the woman. Weinstein (1965) has 
recently employed the same technique to in- 
vestigate the familial schemata of emotionally 
disturbed boys. Like the majority of Kuethe’s 
undergraduates, Weinstein’s normal control 
boys placed the son closer to the mother. The 
disturbed boys, on the other hand, placed the 
son closer to the father. Weinstein interpreted 
these data as reflecting a distant and negative 
mother-child relationship on the part of the 
disturbed boys. The present study sought to 
account for such atypical familial schemata 
within the “normal” range of functioning. It 
was hypothesized that whereas well-adjusted 
normal males exhibit a familial schema in 
which the son is closer to the mother, less 
well-adjusted normal males display a familial 
schema in which the son is closer to the father, 


METHOD 


One hundred male undergraduates who had lived 
their entire lives (until entering college) with their 
biological parents as an intact family unit served 
as Ss. Adjustment was assessed by a self-administered 
questionnaire stressing social-sexual interests and 
activities in high school and college such as friend- 
ship patterns, group membership, dating behavior, 
and so forth (Goodman, 1968). This “social adjust- 


1This research was supported by a faculty re- 
search grant from the University of California to 
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ment scale,” which was designed for use with college 
normals, correlates highly with a premorbid adjust- 
ment scale (Phillips, 1953) widely used in research 
in schizophrenia (Garmezy, 1968; Rodnick, 1967). 
Goodman (1968) administered the scale to 29 schizo- 
phrenics whose premorbid adjustment scores were 
known; a product-moment correlation between the 
scale scores yielded a coefficient of .75 (p < .0005). 
Possible scores on the scale range from 28 (poor 
adjustment) to 88 (good adjustment). Extreme 
quartiles were selected as the poor social adjustment 
(PSA) and good social adjustment (GSA) groups 
with mean scale scores of 55.7 and 78.8, respectively. 
The PSA and GSA groups were similar in age and 
socioeconomic status. 

Kuethe’s (1962a) Felt Figure Technique was used 
to determine familial schema. The S was instructed 
to place four red felt figures (Figure 1) consisting 
of a man (father), a woman (mother), a boy (son), 
and a girl (daughter) “in any manner you wish” 
on a 6 X3.5 ft. vertically mounted green felt field. 
The figures were handed to S in a pile, the positon 
of the figures in the pile being randomly varied. 
Extensive testing in this laboratory has shown that 
Ss invariably identify the sex and relative age of 
the figures correctly. The figures were presented 


Fic. 1. Felt figures. 
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TABLE 1 
PLACEMENT OF FELT FIGURES 
Interfigure distance (in.) Interventions 
Group | Total group | Female sibling(s) | No female sibling | Total group Female sibling(s) | No female sibling 
Mother- | Father- | Mother- | Father- | Mother- | Father- | Mother- | Father- | Mother- | Father- | Mother- | Father- 
son son son son son son son son son son son son 
PSA 
N 25 17 8 25 17 8 
M | 1222 | 10.32 | 14.16 | 12.07 | 8.12 6.61 84 .56 94 71 .62 .25 
ore 8.76 | 624 | 9.78 | 6.73 | 3.43 228 | .61 -70 .64 15 .49 | 44 
GS. 
N 25 18 7 25 18 7 
M 718 | 9.38 7.00 | 9.04 | 7.62 | 10.25 -36 .80 338 78 A3 .86 
SD | 3.14 | 3,93 3.53 | 418 | 1.67 3.04 | .56 .69 57 .62 .50 .83 


together to S, rather than a pair at a time as in 
Weinstein's (1965) study, in order to enhance the 
"family group" percept. The distance between the 
centers of the heads of the figures was measured to 
the nearest à in. in the S's absence. The S’s familial 
schema was obtained prior to administration of the 
social adjustment questionnaire. 


RESULTS 


As may be seen from the left half of 
Table 1 (total group column), the PSA Ss 
placed the son figure closer to the father than 
to the mother figure, while the GSA Ss placed 
the son figure closer to the mother than to the 
father figure—Groups X Parents interaction, 
F=7.29, df= 1/48, p<.01. In terms of 
relative frequency, 76% of the GSA group 
placed the son figure closer to the mother 
than to'the father figure, whereas only 40% 
of the PSA group displayed this schema 
(x? = 5.25, df = 1, p < .025). Had the “poor 
adjustment" end of the assessment scale been 
better represented, it seems probable that the 
contrast between the groups would have been 
even sharper. 

It should be noted that the above differ- 
ences in familial schema between the PSA and 
GSA groups were not merely child-parent dif- 
ferences, but were sex specific for the child 
figure of the same sex as Ss, that is, the son 
figure. The combined groups placed the 
daughter figure closer to the mother than to 
the father figure—parents effect, F = 16.39, 
df = 1/48, p < 01, with the Groups X Par- 
E interaction not approaching significance 
FUKT 

“Closeness” between son and a given parent 


figure may be viewed not only in terms of 
interfigure distance, but also in terms of the 
number of other figures (i.e., other parent 
and daughter) placed between the son and 
that parent figure. The right half of Table 1 
(total group column) reveals that the PSA 
group placed a greater number of such inter- 
vening figures between the son and mother 
figures than between the son and father 
figures, whereas the GSA group more fre- 
quently placed interventions between the son 
and father figures than between the son and 
mother figures—Groups X Parents interaction 
F = 7.20, df = 1/48, p < .01.? This was true 
whether the intervening figure was that of 
the other parent—Groups X Parents interac- 
tion F = 4.50, df = 1/48, p< .05, or the 
daughter—Groups X Parents interaction F = 
4.04, df = 1/48, p = .05. The same trend was 
reflected in the proportions of PSA and GSA 
Ss placing more interventions between son 
and mother figures, no interventions between 
son and either parent figure, and more inter- 
ventions between son and father figures 
(à = 9.16, df = 2, p < .02). Again, these dif- 
ferences in schemata proved sex specific for 
the son figure, since the combined groups 
placed fewer interventions between the daugh- 
ter and mother figures than between the 
daughter and father figures—parents effect 
F = 23.44, df = 1/48, p < .01, the Groups X 
Parents interaction falling far short of signifi- 
cance—F < 1. 


3 Keuthe (1962a) also reported that the schema 
in which the man was placed between the woman 
and boy was relatively rare. 
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Since a daughter figure was used with all 
Ss, subanalyses were conducted with respect 
to whether or not the families of the Ss actu- 
ally contained a female sibling (Table 1). 
While all such analyses yielded results which 
lay in the predicted direction, only that which 
treated interfigure distance for Ss with female 
siblings reached the .05 level of significance, 
probably owing to the reduction in JV. 


Discussion 


The results indicate that normal males who 
differ in the adequacy of their social adjust- 
ment also differ in their familial schema. The 
familial schema of well-adjusted males in- 
cludes greater proximity of the male child to 
the mother, with fewer family members per- 
mitted to come between mother and son. 
Relatively poorly adjusted males, however, 
possess a familial schema of greater distance 
and more interventions between mother and 
son. The relative position of the male child 
in the poorly adjusted male’s schema repre- 
sents a sex-specific deviation, since both 
poorly and well-adjusted males maintain 
close proximity between female child and 
mother. Inasmuch as “The schema that places 
a child with a woman . . . is probably one 
of the first specific social schemata developed 
by most children [Kuethe, 1964, p. 253],” 
and insofar as the physical distance placed 
between human figures may be equated with 
emotional distance (Weinstein, 1965), the re- 
sults may reflect a negatively toned maternal 
relationship in the early lives of males whose 
social-sexual adjustment is poor—at least as 
perceived by the child. 

An alternative explanation of the results 
might argue that the obtained data reflect 
identification with the father on the part of 
the PSA Ss, and with the mother by the GSA 
Ss. However, when asked which parent they 
“feel most like" (perceived similarity), there 
was a nonsignificant tendency for the PSA 
group (32%) to report “mother” more fre- 
quently than the GSA group (16%). Further, 
from the very nature of the adjustment ques- 
tionnaire, in which items deal with social and 
sexual activity and assertiveness, it is ap- 
parent that the sex typing of the GSA group 
is more masculine than that of the PSA group. 
Although these counterarguments do not defin- 
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itively refute an identification hypothesis, they 
do make unlikely any simple explanation of 
the PSA and GSA schemata on the basis of 
identification. 

A final problem relates to possible influ- 
ences of a social desirability set (Crowne 
& Marlowe, 1964). However, when the 
Marlowe-Crowne  Social-Desirability Scale 
was correlated with the social adjustment 
scale in this laboratory, the resulting 
coefficient was small (r= —.31, df =22, 
p> 10). Further, evidence of the distorting 
effects of social schemata on judgment indi- 
cates that the schemata are not “merely the 
result of subjects attempting to respond the 
way they believed was ‘correct’ or the way 
‘most people’ would respond [Kuethe, 1962b, 
p. 74].” Thus, explanation of the results in 
terms of social desirability would seem some- 
what improbable. 

Using the same social adjustment scale as 
the present study in a two-choice decision 
situation, Goodman (1968) found that PSA 
Ss responded more slowly when influenced by 
a maternal than by a paternal figure, while 
GSA Ss responded with more nearly equal 
latencies to maternal and paternal figures.“ 
Since long latencies have been reported to be 
indexes of conflict (Rotter, 1951), Goodman’s 
results appear consonant with those of the 
present study. The present results also paral- 
lel those of Weinstein (1965) obtained from 
emotionally disturbed boys. This parallelism 
is of importance not only in the fact that 
similar differences in familial schemata hold 
at different age levels, but also that their 
appearance may be predicted within the 
normal population as well as between normal 
and pathological groups. Familial schemata 
are apparently sensitive to relatively subtle 
variations in adjustment and, by inference, 
early parent-child relationships. 

4 The caption under Goodman's Figure 3 (p. 509) 


is incorrect and should be replaced by the caption 
under Figure 4 (p, 510). 
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APPROACH AND AVOIDANCE RESPONSES OF THE HYSTERICAL 


PERSONALITY TO SEXUAL STIMULI* 


MARION O'NEILL ? anp BERNHARD KEMPLER € 


Emory University 


Thirty-two female hysterical personalities and 32 nonhysterical personalities 
performed a paired-associates learning task and a visual recognition task. Both 
tasks included neutral and sexual material. Half the Ss in each group per- 
formed under sexually “seductive” conditions, the other half under sexually 
neutral conditions. Results showed that (a) hysterics learned sexual material 
quickly under neutral conditions but relatively slowly under sexual conditions; 
(b) hysterics had a relatively low visual recognition threshold for sexual words 
under neutral conditions and a relatively high threshold under sexual condi- 
tions; and (c) these threshold differences were maximized when sexual stimuli 
referred to the self. It was concluded that the characteristic behavior of the 
hysteric was consistent with an approach-avoidance conflict involving sexual 


motives and sexual behavior. 


The earliest references to the “hysterical 
personality” (e.g., Jaspers and Janet as 
quoted by Muncie, 1939) were descriptions 
of the personality characteristics of individu- 
als with conversion symptoms. More recently 
several writers have advanced the concept of 
hysteria as a personality dimension, even for 
individuals who show no evidence of conver- 
sion symptoms (Chodoff & Lyons, 1958; 
Fidern, 1940; Fitzgerald, 1948; Schilder, 
1939; Thornton, 1948). With the exception of 
psychoanalytically oriented case studies and a 
few studies testing the hypothesis of the 
greater suggestibility of conversion hysterics, 
the literature on the hysterical personality 
has not been extensive. The absence of an 
objective definition, as well as the difficulties 
in testing the psychoanalytic hypotheses con- 
cerning fixation at different psychosexual 
stages, has contributed to the lack of empirical 
work in this area. 

The present study is concerned with the 
frequently observed combination in the female 


1 This investigation was supported in part by a 
Public Health Service Fellowship No. 5-FI-MH-24, 
454-02 (MTLH) from the National Institute of 
Mental Health and is based on an unpublished doc- 
toral dissertation carried out by Marion O'Neill 
under the direction of Bernhard Kempler. 

? Now at the Georgia Mental Health Institute. 

3 Requests for reprints should be directed to 
Bernhard Kempler, who is now at the Department 
of Psychology, Georgia State College, Atlanta, 
Georgia 30303. 


hysteric* of sexually “provocative” behavior 
with the denial of sexuality. Chodoff and 
Lyons (1958) in their review of the literature 
on the hysterical personality found frequent 
mention, on the one hand, of “lasciviousness, 
sexualization of all non-sexual relations, 
coquetry, and provocativeness,” and, on the 
other hand, “sexual frigidity, intense fear of 
sexuality and failure of the sex impulse to 
develop toward its goal.” According to 
Lewis (1956), hysterics are both “coquettish 
and frigid.” Laing (1962), writing from 
an existentialist viewpoint, stated that "sexu- 
ality is very here and now and so the 
hysteric flirts with it as she flirts with life 
but without ever being able to face it.” 
Starr (1953) mentioned as one of the 
hysteric’s characteristics the “persistence in 
sexualizing all non-sexual relations,” and 
Marmour (1953), arguing for an oral-fixation 
interpretation, stated that the “hysteric is dis- 
tressed when her sexuality brings on advances; 
as she is really asking to be loved as a child 
and not as a woman.” : 

These interpretations, in spite of their 
varied theoretical perspectives, appear to be 
descriptive of a sexual approach-avoidance 
conflict. When relatively far from the sexual 
goal, the behavior of the hysteric appears to 
be sexually motivated and to have socially 


^ The status of the hysterical personality in n 
male is, as yet, ambiguous and requires à T 
extended discussion than is called for in this pap 
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defined sexual significance. When close to the 
sexual goal, her behavior is characterized by 
avoidance of sexual stimuli and denial of 
sexual feelings and motives. It can be further 
assumed that this approach-avoidance con- 
flict is evident not only in the interpersonal 
behavior of the hysteric but on the level of 
perceptual and cognitive processes as well. 
Thus, under relatively neutral (ie., non- 
sexual) conditions, the hysteric should be 
particularly sensitive to and observant of 
stimuli which have sexual implications. How- 
ever, under sexually *provocative" conditions 
she should be selectively unaware of sexual 
stimuli, particularly if a relationship of these 
stimuli to her is implied. 

The present study tested these hypotheses. 
Two groups of female Ss, hysterics and non- 
hysterics, were required to perform a paired- 
associates learning task and a visual recogni- 
tion threshold task. Half of the Ss in each 
group worked under sexually neutral condi- 
tions, the other half under sexually *provoca- 
tive” conditions. Both tasks involved neutral 
and sexual verbal material. In addition, the 
manner of presentation of the stimuli in the 
recognition threshold task encouraged either 
à set toward self-reference or a set toward 
Other-reference. The specific predictions were 
(a) under sexually neutral conditions hys- 
terics, compared with nonhysterics, would 
make relatively fewer errors in paired- 
associate learning of sexual words and would 
have a lower recognition threshold for sexual 
Words, and (5) under sexually “provocative” 
Conditions hysterics, compared with nonhys- 
terics, would make relatively more errors in 
paired-associates learning of sexual words and 
Would have a higher: threshold for sexual 
Words, particularly when self-reference of 
these words is implied. 


METHOD 
Subjects 


, Female undergraduate students (W = 150), enrolled 
in a physical education course, were administered 
the Edwards Personal Preference Scale (EPPS) 
and the Guilford-Zimmerman Temperament Survey 


(G-Z). Previous work by O’Neill® has shown that, 
— 


5M. O'Neill, unpublished master’s thesis, “Validity 
see sterical Personality as a Clinical Category,” 
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TABLE 1 
Four CATEGORIES OF STEM AND RESPONSE 
Worps 
Category 
H-H H-L L-L L-H 
Kiss C ck |Caress B_st | Book R.te |Write L. ve 
Penis B_lls | Petting W rk| Walk N.te |Swim D_ck 
Screw C me | Embrace P. ge | Study M. re | RunningS. ck 
Rape F_ck |Breast J.st | Read Bick |Desk C_nt 


Note.—H-H = sexual stem-sexual response; H-L = sexual 
stem-nonsexual response; L-L - nonsexual stem-nonsexual 
response; L-H = nonsexual stem-sexual response. 


using a discriminant-function analysis, the combined 
discriminant weights of six scales (Abasement, Ag- 
gression, and Change from the EPPS; General 
Activity, Personal Relations, and Masculinity-Femi- 
ninity from the G-Z) yielded the best discrimination 
between hysterics and nonhysterics as diagnosed by 
psychotherapists. In a follow-up validity study 
O’Neill® showed that female nonpatient groups, 
scoring high on this hysterical personality measure 
were judged by clinicians in interviews to have 
hysterical personality traits, while Ss scoring low 
were judged to have significantly fewer hysterical 
personality traits. 

Next, 56 Ss were selected from S pool, 28 defined 
as hysterics and 28 as nonhysterics, based on their 
extreme scores on O'Neill's hysterical personality 
measure (HP). The two groups were also equated for 
verbal learning ability on a 5-word nonsense-syllable 
task. One-half of the hysterics and one-half of the 
nonhysterics were assigned to the experimental 
(sexually “seductive”) group and the remaining Ss to 
the control (sexually neutral) group. 


Procedure 


All Ss were tested individually by a male exam- 
iner? Each S was administered a paired-associates 
learning task (PAL) and a visual recognition thresh- 
old (VRT) task. One-half of the Ss received the 
PAL first while the other half received the VRT first. 

PAL. Stimuli consisted of 16 stem words paired 
with 16 incomplete response words. Eight words were 
selected for their sexual meaning; the other 8 were 
selected for activity connotation which was not 
sexual, The incomplete response words which had 
previously been rated by college females for their 
degree of sexual connotation were words of four and 
five letters with the second letter omitted. Four 
sexual words were randomly paired with 4 sexual 
response items; the other 4 sexual words were 
randomly paired with nonsexual items. The same 
procedure was followed for the nonsexual stems and 


* M. O'Neill, unpublished study entitled, “Valida- 
tion Study for Hysterical Personality Scales,” 1966. 

T The authors are indebted to William Ewald and 
Brian Jordan for their help in this research. 
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Fic. 1. Sample of pronoun and verb lettering size. 


response items. Table 1 shows the four categories 
of stem and response items. 

Stimuli were projected onto a 4X 5 ft. standard 
movie screen by a Kodak Carousel No. 800 projector 
equipped with an automatic timing device. The pro- 
jector was mounted on a table located behind and to 
the left of S; the screen was located 3 ít. from S. 
Each stimulus was presented for 5 sec. The study- 
test method of presentation was used, each S re- 
ceiving a training trial followed by a test trial. 
The training trial showed each stem word alone 
followed by presentation of the stem word paired 
with the correct response item. The test trial pre- 
sented the stem word only. It was S's task to spell 
out loud the letters of the incomplete response word 
correctly paired with the stem word. Any error on 
the test trial necessitated a complete training trial. 
A 2-sec. interval separated the training from the test 
trial. Criterion of mastery was 100% correct recall 
of the response list (ie. one errorless trial). 

VRT. The stimuli were 12 two-word phrases, con- 
sisting of six verbs, each one combined with the 
pronoun, J (self-reference), or with the pronoun, 
They (other-reference). Three verbs were sexual in 
content (rape, kiss, screw) and three were neutral 
(read, swim, study). All verbs appeared in the PAL 
task. Procedure and method of presentation were 
similar to those reported by Kempler and Wiener 
(1964). The pronouns were printed with a thick 
pen point (Leroy No. 4) and the verbs with a thin 
pen point (Leroy No. 0). When presented in the 
tachistoscope the pronouns were more distinct and 
easier to recognize than the verbs. Figure 1 shows 
two sample phrases. 

A two-channel, Dodge-type, electronically triggered 
tachistoscope, No. 800 F was used. Recognition 
threshold was established by the method of ascend- 
ing limits, The first exposure for each S began at 
the lowest Vernier-scale setting and was increased 
by half-unit steps thereafter until S correctly identi- 
fied the stimuli twice. The 12 phrases were presented 
in random order to each S. 


Experimental and Control Conditions 


All Ss were tested in the evening. They were met 
by a male examiner in a waiting room and taken 
by him to an adjoining experimental room. This 
room was in semidarkness when S entered and 
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remained so throughout the experiment. It was di- 
vided into two sections by a partition behind which 
sat a female assistant who recorded S's responses, 

Two male examiners were used, each one testing 
half of the Ss in the hysteric group and half in the 
nonhysteric group. The Es had no knowledge of the 
diagnostic classification of the Ss. Both men were 
tall, in their 20s, and generally considered to be 
handsome. The experimental and control conditions 
were defined by E's behavior toward S. In the 
experimental (sexually "seductive") condition the 
examiner (a) made personal and flattering comments 
to S (eg., “I like your dress"), (b) made at least 
one physical contact with S prior to the beginning 
of the experiment (e.g., taking her arm while leading 
her to the experimental room), and (c) upon enter- 
ing the experimental room locked the door in order 
to “insure privacy.” In addition both examiners, by 
their tone of voice and other subtle methods, behaved 
“seductively” but not in a crude or offensive manner. 
In the control (sexually neutral) condition the 
examiner was friendly but impersonal and business- 
like in his comments to S, made no physical contact 
with her, and did not lock the door of the experi- 
mental room. 


RESULTS 
Paired-Associates Learning Task 


The measures obtained for the PAL were 
errors to criterion, A three-factor with re- 
peated measures analysis of variance (Winer, 
1962) was computed on these measures, the 
three factors being personality type, experi- 
mental condition, and category of stem- 
response item combination. Results of the 
analysis are shown in Table 2. d. 

The main effect of S groups was not signifi- 


TABLE 2 


PAIRED-ASSOCIATE LEARNING ANALYSIS OF VARIANCE 
FOR NUMBER OF ERRORS ON Four 
CATEGORIES OF WORDS 


Source af | MS F 
Between 55 
Groups 1| 11.161 EU. 
Conditions 1 | 412.572 20.600 
Groups X Conditions 1 | 261.446 | 13. 
Error 52| 20.019 
Within 168 
Gategones 3 91.503 190% 
Groups by Category 3 x 01). 
E X Category 3| 27.654 | 5945 
[nd Conditions X 3 | 18.458 3.968" 
Error 156| 4.65 
*p <01 
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cant, indicating that, other things being 
equal, hysterics and nonhysterics do not differ 
in paired-associates learning ability. The main 
effects of experimental conditions (p < .001) 
and of word categories (p < .01) were signifi- 
cant, as were the interactions of groups with 
conditions (? < .001) and groups with verbal 
categories (5 < .01). These effects are most 
clearly revealed by examining the significant 
(p< .01) triple interaction shown in Fig- 
ure 2. It is clear that the “seductive” behav- 
ior of the examiner had little effect on the 
nonhysterics, but had a marked effect on the 
hysterics. Furthermore, this effect is related 
to the degree of sexual meaning contained in 
the verbal categories. 


Visual Recognition Threshold 


VRT was defined as the number of constant 
increments of exposure time required for S 
to correctly report twice each of the 12 stimu- 
lus phrases. The threshold data were trans- 
formed into difference scores (D scores) by 
subtracting each S's J sexual threshold scores 
from her They sexual threshold scores, and 
her J neutral from her They neutral threshold 
scores. Table 3 shows the results of a three- 
factor analysis of variance with repeated 
Measures (Winer, 1962). The only significant 
effects were the main effects of word cate- 
Bories (p < .05), and the triple interaction 
(p < .05). Comparison of means (Scheffé, 
1959) showed that the mean D score for 
sexual words under experimental conditions 


TABLE 3 


Visvar~Recocnrrion Task ANALYsIS OF VARIANCE 
FOR SEXUAL AND NONSEXUAL WORDS 


Source df MS F 
| Between 55 
TOups 1| 650.893 | 1.21 
Conditions 1| 282.893 
Groups X Conditions 1| 641286 | 1.19 
tor 52| 538.09 
- Within 56 
Verb category 1 | 1,586,286 | 4.91* 
roups X Verbs 1| 408.83 | 1.35 
Onditions X Verbs 1| 282.893 
meee X Conditions X Verbs} 1 | 1,856.571 | 6.14* 
tror 52| 302.56 


*d «05, 


BOE entes 
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nonsexual sexual nonsexual M 


Fic. 2. Paired-associate learning task. (Errors to 
criterion under experimental—sexually seductive— 
and control—sexually neutral—conditions for four 
categories of word stem-response combinations.) 


was significantly higher (p< .01) for the 
hysterics than any other group (see Table 4). 


Discussion 


The results of this study are in full agree- 
ment with the hypotheses, and offer support 
for the interpretation of at least some aspects 
of the behavior of the hysterical personal- 
ity as a sexual approach-avoidance conflict. 
Under nonsexual conditions the hysteric is 
sensitized to stimuli with possible sexual con- 
notations. Conversely, under sexually seduc- 
tive conditions she avoids or selectively 
inattends to stimuli with sexual connations. 

The dimension of “distance from sexual 
goal" is, no doubt, a complex one. The pro- 
cedures of this study permitted the differentia- 
tion of two meanings of this concept: (a) the 
objective distance, that is, the culturally de- 
fined sexual implications of the physical 
setting and of interpersonal behaviors, and 
(5) the subjective distance, that is, the aware- 
ness and acceptance of one's own sexual 
motives. The latter meaning of distance was 
represented in the VRT by the self-reference 
and other-reference pronouns, J and They, 
respectively. The data indicate that these 
two aspects of distance may interact; that is, 
the hysteric is aware and accepting of sexual 
self-reference when the environmental condi- 
tions are sexually irrelevant or neutral. How- 
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TABLE 4 


VISUAL RECOGNITION THRESHOLD MEAN TRIAL 
D-SconE For SEXUAL AND NONSEXUAL 
Worps 


Threshold value 


Group 
Sexual words |Nonsexual words 
Hysterical Ss 10.92 —11.50 
Hysterical controls —84 — 845 
Nonhysterical Ss —1.00 1.50 
Nonhysterical controls 4.57 —10.72 


ever, when the environmental conditions are 
rich in sexual implications she denies and 
avoids sexual self-reference, while at the same 
time becoming highly aware of sexual motives 
and behaviors in others. 

Previous studies in the personality-percep- 
tion area with emotion or affect arousing 
words have been fraught with conceptual and 
methodological difficulties (e.g., Goldiamond 
& Hawkins, 1958; Goldstein, 1962; Howes & 
Solomon, 1951; Solomon & Postman, 1952). 
While the results of this study demonstrate 
that learning and recognition threshold behav- 
ior of hysterics are affected by sexually 
“seductive” conditions, they do not offer an 
unequivocal basis for the description of the 
processes responsible for this effect. For ex- 
ample, the interpretation of response suppres- 
sion (i.e. “seeing” but not “saying”) has 
sometimes been offered for studies using 
taboo or “dirty” words (Howes & Solomon, 
1950). The affect-arousing words in this 
study, while clearly sexual in meaning, were 
not of the "dirty" variety; nevertheless, the 
response-suppression interpretation could still 
be relevant here, particularly in view of the 
female Ss and the male Es. 

The position taken here is that when the 
learning and recognition of classes of words 
are predictably affected in different Ss by 
verbal (e.g, Z and They) as well as inter- 
personal context, such data are relevant for 
personality whether one invokes special and 
unusual processes (e.g., subception, perceptual 
defense) or makes use of more mundane 
explanations, such as response suppression 
due to embarrassment or prudishness. One 
need only assume that with specific cues 
differences in personality will show up as 
different response characteristics in the pres- 
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ence of such cues (Kempler & Wiener, 1963, 
1964), and furthermore, that response charac- 
teristics are affected by a variety of condi- 
tions—physical, social, and interpersonal. 

Such an orientation would not be expected 
to yield much information about the intra- 
psychic characteristics of different personali- 
ties, but it would be productive of statements 
concerning reactions of certain personalities 
to classes of cues under particular conditions, 
The hysterical personality, then, is one who, 
under sexually neutral conditions, sensitizes 
to a large variety of sexual cues, and who, 
under sexually “provocative” conditions, 
selectively inattends to the same sexual cues, 
particularly when self-reference is implied. 

Other characteristics of the hysterical per- 
sonality, leading to a more comprehensive 
description, can be investigated by other com- 
binations of cues in different (physical, inter- 
personal, etc.) settings. 
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SIZE ESTIMATION IN SCHIZOPHRENIA AS A 
FUNCTION OF CENSURE, DIAGNOSIS, 
PREMORBID ADJUSTMENT, AND 
CHRONICITY 


PETER A. MAGARO 1 


Northern Illinois University 


Schizophrenics divided in terms of chronicity, diagnosis, and premorbid ad- 
justment estimated the size of a line embedded in different affective and neutral 
content. The main result was that premorbid adjustment and chronicity in- 
teracted in size-estimation performance with the good, premorbid-adjustment 
group exhibiting a smaller size estimation in the chronic conditon than in 
the acute, whle the poor, premorbid-adjustment group exhibited the opposite 
effect. Although a Premorbid Adjustment X Chronicity interaction was pre- 
dicted in previous work, the specific over- or underestimation tendency found 
for each schizophrenic subgroup in the present study differed from those 
previously predicted. The differences in the present results and predictions de- 
rived from a review of the literature were discussed in terms of possible 
difficulties inherent in making inferences from studies which did not directly 
control the relevant subject dimensions. The results indicated little support for 
the social-censure hypothesis and suggested the greater effect of organismic 
variables relative to stimulus variables in predicting schizophrenic task per- 


formance. 


Rodnick and Garmezy (1957) have pro- 
posed a social-motivational model which has 
emphasized the motivational qualities of 
specific stimulus conditions in the determina- 
tion of the schizophrenic-performance deficit. 
It is proposed that the schizophrenic has 
undergone early censorious child-rearing ex- 
periences which have sensitized him to cues 
depicting censure. The deficit is due to 
censorious stimuli eliciting anxiety to which 
there has been learned a characteristic with- 
drawal response which interferes with task 
performance (Dunham, 1959; Magaro, 1967). 
Other investigators have not found a relation- 
ship between censure and performance (Klein, 
Cicchetti, & Spohn, 1967), and others have 
considered the deficit to be related to any 
affective stimulus (Buss & Lang, 1965). 

Silverman (1964) has emphasized the rela- 
tionship between the schizophrenic perform- 
ance deficit and particular subject dimensions. 
The deficit is considered to be related to the 
attention-response disposition characteristic of 
each schizophrenic subgroup rather than to 
the effects of particular stimuli. The attention- 

1 Requests for reprints should be directed to 


Peter A. Magaro, Department of Psychology, Uni- 
versity of Maine, Orono, Maine 04473. 
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response disposition of interest to the present 
paper is that of scanning which is considered 
to be more characteristic of certain schizo- 
phrenic subgroups than others. The domi- 
nance of the scanning disposition for a pat- 
ticular subgroup would determine the over- 
or underestimation tendency in a size-esti- 
mation task. Stimulus content such as censure 
is considered to be anxiety inducing as in 
the above social-motivational model, but it is 
considered to exaggerate the response dis- 
position of a schizophrenic subgroup rather 
than elicit a competing response of with- 
drawal. Silverman’s hypothesis regarding the 
scanning disposition of schizophrenic sub- 
groups dichotomized on the basis of pre- 
morbid adjustment and chronicity has bee 
derived from the results of separate studies 
using different combinations of the two SU» 
ject dimensions. These studies differed in 
the tasks used, the means of categorizing the 
two dimensions, and the information available 
regarding the presence of the chronic and pre 
morbid subject dimensions as well as other 
possible confounding subject dimensions su 

as that of diagnosis. This lack of specification 
of what subject variables are operating 1n ii 
experimental situation makes inferenc 


es CON | 
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cerning the effects of premorbid adjustment 
and chronicity fairly speculative. The present 
study examined the effects of chronicity and 
permorbid adjustment on a size-estimation 
task using both paranoid and nonparanoid 
groups. From the inferences of Silverman, 
an interaction between chronicity and pre- 
morbid adjustment would be predicted. Acute 
poors and chronic goods should overestimate, 
and chronic poors and acute goods should 
underestimate. Paranoids in each group 
should exhibit greater underestimations than 
nonparanoids. It would also be predicted that 
the under- or overestimation tendency of 
each group will be exaggerated under censure 
conditions. 

The general purpose of the present study 
was to contrast aspects of the social-censure 
model of Rodnick and Garmezy (1957) with 
what is considered the organismic model of 
Silverman (1964). Although both positions 
predict that censorious cues will have drive 
properties, the social-censure model considers 
differences in drive to be mainly responsible 
for schizophrenic subgroup differences. The 
presentation of specific censorious cues in- 
creases drive and consequently the response 
tendency of withdrawal. Therefore, differences 
between schizophrenic subgroups would be 
predicted only under specified stimulus con- 
ditions. 

The organismic model views the schizo- 
phrenic-performance deficit as being mainly 
telated to the habit system. Differences in 
subgroup performance are due to differences 
in the habit strength of the response disposi- 
tion of scanning. In a size-estimation task, 
the relative performance of two schizophrenic 
Subgroups depends upon the dominance of 
Scanning in the response hierarchy. More 
Specifically, the organismic model would pre- 
dict differences between groups rather than 
between stimuli. 

An attempt was also made to assess the 
deficit in size estimation in terms of a re- 
Sponse to an immediate presentation of the 
Stimulus versus memory of the stimulus. 
Vates (1966) suggested that the schizophrenic 
deficit was mainly a short-term memory func- 
tion rather than a distortion in the reception 
of the stimulus. Previous studies utilized a 
Short-term memory procedure (Harris, 1957; 
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Zahn, 1959), and, therefore, it seemed ad- 
visable to compare performance on a direct, 
size-estimation procedure to one involving size 
estimation in a memory procedure. 


METHOD 
Subjects 


The sample consisted of 72 hospitalized, schizo- 
phrenic males, mostly veterans, ranging in age 
20-60 yr. An attempt was made to equate for race, 
social class of father (Hollingshead, 1960), educa- 
tion, urban versus rural, age, and drug intake. All 
drugs were phenothiazine derivatives with the vast 
majority of patients being given Thorazine. All Ss 
lived with their mothers at least until the age of 16 
and had been diagnosed as schizophrenic without 
complications of organicity, alcoholism, or physical 
handicaps. The patients meeting the above criteria 
were divided into equal groups of chronic (5 yr. or 
more of total hospitalization) and acute (3 yr. or 
less). These groups were subdivided into good (a 
score of 14 or less) and poor (15 or more) levels 
of premorbid adjustment as derived from the Phillips 
Scale of Premorbid Adjustment (Phillips, 1953). 
Each subgroup of 18 Ss was further divided into 
6 paranoids and 12 nonparanoids according to both 
the hospital’s admitting and current diagnosis. Twelve 
control Ss were composed of aides and middle-class 
staff members. 


Materials 


The relevant cue was a 228-mm. vertical black 
line embedded in different contextual stimulus scenes. 
Two X two transparencies of the stimulus scenes 
were projected from a Kodak Carousel slide pro- 
jector to a glass-beaded screen. The size of the 
drawing paper was 20 X 25 in. 

The practice stimulus consisted of a right triangle 
with the vertical line emphasized by being black 
and heavy in contrast to the base and its hy- 
potenuse. The neutral-content stimuli were a single, 
neutral, black line and a single, vertical, black line 
between two trees. A stimulus denoting positive 
affect was a single, vertical, black line between a 
mother and son in which the mother is giving her 
son some milk from a pitcher. A censure stimulus 
was a single, vertical, black line between a son and 
mother in which the mother is scolding her son. 
The same scenes without the line were those used 
by Harris (1957). 


Procedure 


The procedure was as follows. Each S was seated 
at a small table with paper and pencil before him 
and the screen 10 ft. to the front. The S was al- 
lowed at least 5 min. to acclimate to the room and 
the equipment. The practice stimulus was presented 
with the following directions: “See this triangle? 
Now look at this line; it is darker and heavier 
than the others. Draw it on the paper in front of 
you the same length as you see it here. Let me 
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TABLE 1 
Means AND STANDARD DEVIATIONS FOR CONTROLLED AND VARIED SUBJECT VARIABLES 
Subgroup 
Variable 
AG AP CG CP N 
Ape (i 
a Xen 36.2 35.6 48.3 43.8 394 
SD 17.37 5.87 11.06 10.27 14.04 
% urban 100 92 83 83 83 
% Classes IV & V (lower class) 100 83 83 67 75 
Education (ii 
M Mae 11.60 9.67 9.91 10.50 12.8 
SD 1.88 448 2.60 244 3.73 
Phillips score 
a 11.0 22.6 8.9 20.8 
SD 2.08 3.51 3.75 4.80 
"Total hospitalization (in years) 
M 1.56 1.45 11.22 11.25 
SD 1.34 81 8.30 6.06 
Medication (mgs. per day) (phe- 
nothiazine derivatives) 
M 223.92 216.58 208.75 311.08 
SD 227.07 152.42 247.00 322.62 
Note.—AG = acute goods; AP = acute poors; CG = chronic goods; CP = chronic poors; and N = normals. 
know when you are finished. Now I am going to  sentation was randomized except for the line 


hold it up again, but this time I will show it to you 
for 5 sec, then I will turn it over. After I turn it over, 
draw a line from memory." If S indicated that he 
understood the task by his performance, E then 
went on to the first experimental trial which was 
a single black line, If S seemed to be having diffi- 
culty, the practice trial was repeated. 

The directions for the experimental scenes were as 
follows: *Now I am going to show you a picture 
of a black line on this screen in front of you. 
When the picture goes on, draw a line on this paper 
the same size as the one on the screen." Then, with- 
out comment, E provided a fresh sheet of paper and 
gave instructions from memory. After the presen- 
tation of the line stimulus, the other content stimuli 
were presented with the following directions: “Now 
I am going to show you a picture of a black line 
between two trees, draw the line the same size as 
you see it.” (Then instructions were given for 
reproduction from memory.) “Now I am going to 
show you a picture of a black line between a 
mother who is scolding her son, she is really bawling 
him out. Draw the black line the same size as you 
see it.” (Then Ss were instructed to reproduce from 
memory.) “Now I am going to show you a picture 
of a black line between a mother who is feeding 
her son. She is giving him a glass of milk.” (Then 
instructions were given for reproduction from 
memory.) 

Within each S group, the order of stimulus pre- 


stimulus which was always first. GSR and base rate 
were also recorded at this time. These results 
be reported in a separate paper. 


RESULTS 


Table 1 presents the means and standard 
deviations for controlled and varied subject 
variables. The acute-chronic and poor-good 
grouping separated Ss as intended. The only 
other variable which showed a significant 
difference was medication (x? = 2847, df = 
3, P « 01). Chronic poors exhibited the 
greatest difference by being at a high level of 
medication. The correlation between amount 
of medication and size estimation was not 
significant for the total group of patients no? 
for any single group. When those Ss who were 
not receiving any medication at all were 
eliminated from the analysis, there was à 
significant negative correlation between size 
estimation and amount of medication for the 
nonparanoid Ss (r = — 44, df =32, p € 01). 
This effect was mainly due to the nope 
noids in the chronic good (r = —.86, 4 5 z 
< .01) and acute good (r= 91, df=% 
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FIG.1 PLOTS OF THE MEAN SIZE ESTIMATION FOR EACH SCHIZOPHRENIC SUBGROUP 
AND NORMALS FOR CONTENT AND FOR TRIALS (SEQUENCE) 


b < .01) subgroups; for these Ss the smaller 
the amount of medication, the larger the size 
estimation. The total amount of medication 
taken by paranoids and nonparanoids did not 
differ. 

The first analysis was a repeated measure 
four-way (poor-good, chronic-acute, stimulus 
content, and memory-reproduction) analysis 
of variance (Winer, 1962). The significant 
effects were stimulus content (F = 6.59, df = 
3/204, p<.01) and the Poor-Good x 
Chronic-Acute interaction (F = 4.52, df= 
1/68, p < .05). The insignificant memory- 
reproduction effect indicated that presenta- 
tion and memory procedures do not have 
differential effects and only seem to act as 
additional trials for each stimulus. In exam- 
ining the content effect, it seemed that the 
Main difference between stimuli was due to 
4 small size estimation on the first-presented 
Nonrandomized stimulus (black line) and a 
larger but similar size estimation on the 
Other randomized stimuli, (trees, feeding, 
Scolding). This suggested that the content 
effect was due to order of presentation 
(sequence). Consequently, a three-way (poor- 
good, chronic-acute, stimulus content) analy- 


sis of variance was performed collapsing over 
the insignificant memory-reproduction factor 
and excluding the first-presented nonrandom- 
ized line stimulus. The Poor-Good X Chronic- 
Acute interaction was again significant, but 
content was not significant (p > .05). This 
result indicated that the three content scenes 
(trees, feeding, scolding) had the same effect 
on all groups. The lower size estimation evi- 
dent on the first-presented stimulus, the black 
line, was probably due to its order in the 
sequence. However, the present design does 
not completely rule out the possibility that 
the simple content of the line scene did not 
create the differential stimulus effect. In 
order to examine the effect of sequence irre- 
spective of content, a repeated measure, three- 
way (poor-good, chronic-acute, sequence) 
analysis of variance (Winer, 1962) was per- 
formed including all presented stimuli and 
collapsing over the insignificant memory- 
reproduction factor. The sequence factor 
was highly significant (F = 6.02, df = 7/476, 
p < 01) and indicated a gradual increase in 
size estimation over trials regardless of stimu- 
lus content. The sequence factor was also 
examined in another analysis which excluded 
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FIG.2 THE POOR-GOOD x CHRONIC-ACUTE INTERACTION COLLAPSED 
OVER CONTENT AND MEMORY-REPROOUCTION CONDITIONS. 


the performance on the nonrandomized line 
stimulus. Sequence was again significant 
(F = 4.64, df = 5/340, p< .01). Figure 1 
presents the performance of each subgroup 
by trials with content randomized as well as 
the performance of each group by content 
with sequence randomized. Although content 
was randomized in order of presentation for 
each group, the censure stimulus did not pro- 
duce the poor-good difference as predicted by 
the social-censure hypothesis. There was a 
suggestion of the censure stimulus differen- 
tially affecting the poor-good groups in that 
all groups followed the same pattern of in- 
creasing or decreasing their size estimation on 
each scene except on the censure scene where 
both groups increased their size estimation 
and both good groups remained constant 
(see Figure 1). However, such a suggestion 
must be quite tenuous in light of not finding a 
significant Premorbid Adjustment X Content 
interaction. 

The significant Chronicity X Premorbid Ad- 
justment interaction is presented in Figure 2 
and is due to the good underestimating in the 
chronic condition and overestimating in the 
acute condition, while the opposite occurred 
for the poors. The consistent relative position 
of each subgroup on each stimulus scene indi- 
cates that the Poor-Good x Chronic-Acute 
interaction was consistent across stimuli. The 
significant interaction supports our main hy- 
pothesis that chronicity and premorbidity do 
interact on a size-estimation task. The specific 
deviation tendencies of each group, however, 
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do not confirm the predictions derived from 
the inferences of Silverman (1964). 

The significant individual group compari- 
sons (Studentized Range Statistic, Winer, 
1962) on each stimulus scene are presented 
in Table 2 and indicate that the chronic goods 
significantly differed from the acute-good and 
chronic-poor groups as well as normals on all 
stimuli. As can be seen in Figure 2, the 
chronic goods were the most deviant group 
by exhibiting the smallest size estimations, 
while the acute goods and chronic poors 
exhibited the largest size estimations. The 
acute-poor group was intermediate as were 
the normals. Since all groups including 
normals underestimated in relation to the 
standard, the specification of under- or 
overestimation tendencies of a particular 
schizophrenic subgroup was made in relation 
to the size-estimation performance of the 
normals. Poors tended to exhibit a larger size 
estimation than goods on all stimuli; however, 
the higher size estimations of the poors as 
compared with the goods were mainly due to 
the high size estimations of the chronic poors 
and the low size estimations of the chronic 
goods. There were no chronic-acute differ- 
ences collapsed over premorbid adjustment 


TABLE 2 


SIGNIFICANT “q” Tests BETWEEN THE SCHIZOPHRENIC 
Suscrours oF Acure Poors, Acure GOODS, 


Curonic Poors, Cmgowic Goops, AND 
Norats on EACH or THE FOUR 
SrIMUvLUS SCENES 
Stimulus Subgroup E 
Line AG > CG 4,98** 
CP > CG 3.74% 
N >CG 6.68 
N >AP 3.99* 
Tree AG « CG 5.03** 
CP > CG 5.72 
AG > AP 4.04" 
N >CG 4.16 
Feedin; AG > CG 5.30** 
he GP Š CG Sar 
N $66 3.93* 
i 6.20** 
Scolding eu 3 on S50 
N> CG, 4.03* 
#5 Soi 
Note,—Significant ''q" tests (k, df 4/68). See note to 
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(p> .05), although acutes tended to exhibit 
larger size estimations than chronics. The dif- 
ference between chronics and acutes tended to 
be greater for goods than poors. 

An analysis of the paranoid-nonparanoid 
dimension compared the paranoids and non- 
paranoids in each subgroup and collapsed over 
all subgroups. Results indicated no significant 
differences (5 > .05) in size estimation be- 
tween the paranoids and nonparanoids. There 
was a trend for the paranoids in the acute- 
poor and chronic-good groups (the two groups 
which underestimated) to display larger size 
estimations than the nonparanoids in these 
groups. This suggests that the more extreme 
underestimations were due to nonparanoids, 


DISCUSSION 


The results indicated that schizophrenic 
task performance was determined more by 
subject dimensions than stimulus conditions. 
Stimulus content did not exhibit any signifi- 
cant effects. The interaction of premorbid ad- 
justment and chronicity was significant and 
remained constant across stimuli which did 
not allow any significant content effects. The 
individual comparisons indicated that the 
greatest between-group differences were exhib- 
ited by the large size estimation of acute 
goods and chronic poors as compared to the 
smaller size estimations of the chronic goods. 
The acute poors slightly underestimated as 
compared with normals, but were not signifi- 
cantly different from normals or chronic goods. 

The motivational quality of maternal cen- 
Sure which was considered to increase the 
anxiety level or drive of the poors relative 
to the goods and normals and consequently 
interfere with task performance (Rodnick & 
Garmezy, 1957) was not supported in the 
present results. There was a slight tendency 
for censure to increase the size estimation of 
the poors while the goods remained relatively 
constant; however, this tendency did not re- 
ceive any statistical support. It is possible 
that the present design was not able to suf- 
ficiently separate the effects of stimulus con- 
tent from the effects of subject variables. The 
Censorious cues used in the present study may 
not have been intense enough to increase 
drive and affect performance beyond the effect. 
of the organismic variables. Also, the medi- 
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cated state of the present Ss may have 
diminished the effect of the censure stimuli 
(Goldstein, Acker, Crockett, & Riddle, 1966) 
although there was no evidence of a relation- 
ship between medication and size estimation 
in any of the major subject groupings. 

The interaction of the subject variables of 
chronicity and premorbid adjustment, the sig- 
nificant sequence effect, and the consistency 
of subgroup differences across stimulus condi- 
tions lent support to the organismic model 
proposed by Silverman (1964). In regard to 
the effect of sequence, Rodnick and Garmezy 
(1957) suggested that in a repeated-measure 
design comparing different stimulus content, 
the censure scene should be placed last to 
avoid producing a carry-over effect to other 
stimuli. The significant sequence effect found 
in the present study suggests that order of 
presentation may produce a more potent 
carry-over effect than stimulus content. 

Although a Chronicity X Premorbid Adjust- 
ment interaction was found in the present 
results, the specific predictions concerning the 
performance of each subgroup were not con- 
firmed. In fact, present subgroup performance 
tended to be exactly the opposite to what 
was predicted by Silverman. It is proposed 
that this difference was due to Silverman's 
means of inferring the scanning disposition of 
each subgroup, and that an interpretation of 
the significant findings and trends in the 
present data as well as a reinterpretation of 
previous work would allow a speculative but 
more consistent view of the size-estimation 
performance of the different schizophrenic 
subgroups. In inferring the tendency for the 
chronic goods to overestimate, Silverman 
relied mainly on the data of Zahn (1959) who 
found in comparing goods and poors that 
poors equaled normals and goods overesti- 
mated. Zahn considered this result due to two 
main factors: First, the factor of chronicity 
in that mean hospitalization for goods was 
6.00. yr, although the sample could have ranged 
from very acute to very chronic; second, the 
factor of anxiety in that the goods who were 
responsible for the size overestimation were 
only those who were not on medication. In 
short, the overestimation could have been due 
to the acute goods in the sample, or the goods 
who were at a high-anxiety level, or both 
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The present results found for nonparanoids in 
both acute- and chronic-good groups that size 
estimation increased with a decrease in amount 
of medication, and in effect, supported Zahn’s 
observation. Also the present acute goods 
tended to overestimate in comparison with 
normals, Therefore, Zahn’s results which 
Silverman considered to represent the per- 
formance of chronic goods may in fact be 
more appropriate to acute goods. The infer- 
ences concerning the effects of chronicity on 
good, premorbid-adjustment schizophrenics 
from ad hoc analysis of Zahn’s results may 
have been inappropriate since chronicity was 
not controlled and anxiety seemed to have 
such a determining effect. It is even pos- 
sible that the medicated goods who performed 
similarly to normals in the Zahn study were 
actually a combination of the present low- 
estimating chronic goods and the high-esti- 
mating acute goods who when combined would 
appear similar to normals. 

The inferences concerning the overestima- 
tion tendency of the acute poors were drawn 
from a study by Harris (1957). The same dif- 
ficulty resides in this study as in the Zahn 
study. The average hospitalization of the 
overestimating poors was approximately 3 
yr. which again could represent a range of 
chronicity. Inferences from the performance 
of this group as being representative of acute 
poors would be extremely hazardous. The 
average chronicity of 3 yr. corresponded 
to the upper limit of the present acute group. 
The performance of the Harris poors corre- 
sponds to the present chronic poors who ex- 
hibited the greatest overestimation. Since the 
present acute poors did not underestimate 
extensively from normals (especially when 
performance over trials was considered), the 
overestimation result of Harris’ poors may 
mainly reflect an averaging of the perform- 
ance of chronic poors and acute poors. There- 
fore, Silverman’s inference of the size-estima- 
tion tendencies of acute poors may again have 
been premature and instead more relevant to 
chronic poors. The goods in the Harris study 
who did not differ from normals and were 
also considered to be acute demonstrated a 
slight tendency to underestimate. Due to his 
3-yr. average, there possibly were more 
acute than chronic goods in his good group. 
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Their performance would, therefore, be more 
similar to a combination of the present chronic 
and acute goods with the performance level 
of the acute goods being more heavily 
weighted because of their greater number. 
Such a combination of the present good 
groups would result in a performance similar 
to the goods of Harris. 

The present results would suggest a modi- 
fication in Silverman's classification of which 
subgroups are characterized by over- or 
underestimation. The chronic goods under- 
estimate and are most clearly different from 
all other groups including normals. The acute 
poors tend to underestimate. The chronic 
poors and acute goods overestimate and are 
not significantly different from one another. 

The underestimation of all groups relative 
to the standard may have been due to the 
requirements of the present task. The size- 
estimation task used in previous studies 
(Gardner, Jackson, & Messick, 1960) re- 
quired $ to adjust circles of light with a rela- 
tively small distance between the standard 
and the comparison. In the present task there 
was a much greater distance between the 
S and the stimulus which in itself could 
account for smaller size estimations. Also, the 
content embedded in the stimuli and the draw- 
ing of a line in a field restricted by the size 
of the paper may have tended to decrease 
size estimation. Therefore, there is the pos- 
sibility that although the simpler size-estima- 
tion task used in previous work has been 
found to be a valid measure of the extensive- 
ness of eye movement (Gardner, Holzman, 
Klein, Linton, & Spence, 1959; Gardiner & 
Long, 1962), the distance and stimulus con- 
tent factors introduced in the present study 
may have allowed the admittance of other 
factors besides that of scanning. Also, it could 
be questioned if the traditional psychophysical 
method, a variant of which was used in the 
present study, is capable of detecting sub- 
group differences in size estimation. Price am 
Eriksen (1966) stressed separating the judg- 
mental process from perceptual sensitivity 
analyzing schizophrenic subgroup differences. 
The traditional psychophysical measures, al- 
though supposedly reflecting the operation of 
a perceptual process such as the perception © 
size, may be reflecting the operation of judg- 
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mental processes such as the criterion of the 
observer or the particular manner of using 
judgment categories. Magaro (1966) found 
that the effects of an anchor on size estima- 
tion were more a function of the use of re- 
sponse categories than changes in perception. 
Price and Eriksen (1966) found that no 
paranoid-nonparanoid differences could be 
detected in a traditional psychophysical pro- 
cedure, while differences were obtained when 
the reliability and the S's subjective con- 
fidence in the judgmental responses were 
examined, The latter finding may be especially 
relevant to the inability of the present study 
to find any significant paranoid-nonparanoid 
differences. Although the lack of significance 
was probably due to the small number of 
paranoids in each group (N = 6), another 
interpretation could be that the present size- 
estimation measure was too global and in- 
sensitive to detect the type of paranoid-non- 
paranoid differences which may be detected 
with the use of more analytical dependent 
variables (Price & Eriksen, 1966). 
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Schizophrenic and normal Ss (N=230) solved concept-identification (CI) 
problems varying in stimulus complexity, simultaneously with either a normal 
or a schizophrenic stooge providing relevant or irrelevant cues. Major results 
were (a) CI performance was a negative function of stimulus complexity, 
(b) social cues from a normal stooge had greater effects on problem solving 
than social cues from a schizophrenic stooge, and (c) schizophrenics demon- 
strated a deficit in problem solving connected primarily with the use of social 
cues. Comparisons of these data with expectations based on a mathematical 
model of CI revealed several significant discrepancies, attributable primarily to 
the fact that normals are more sensitive to social cues than are schizophrenics. 


Pishkin and Blanchard (1963) reported a 
preliminary evaluation of the ability of schizo- 
phrenic and normal Ss to use social (i.e., from 
another person) and nonsocial (ie. from 
variations in a physical stimulus) sources of 
information in solving concept-identification 
(CI) problems. Generally, their results indi- 
cated that, in addition to being poorer 
problem solvers, schizophrenics showed a rela- 

- tive inability to profit from solution-relevant 
information provided by another person 
(actually, a confederate of E acting as an- 
other S). These results, and others like them 
(Davis & Harrington, 1957; Johannsen, 
1959), are consistent with Cameron’s (1947) 
“social disarticulation” characterization of 
schizophrenic behavior. Moreover, as shown 
by Pishkin and Blanchard, they are describ- 
able in quantitative detail by a mathematical 
model of CI performance (Bourne & Restle, 
1959). 

The present study is, in several respects, 
an extension of the Pishkin-Blanchard work. 
First, in the earlier experiment, social cues 
(when they were available) were supplied by 
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a peer confederate (C) (i.e., schizophrenic Ss 
were paired with schizophrenic Cs and normal 
Ss with normal Cs). An issue, just as impor- 
tant as the difference between normal and 
schizophrenic ability to use social and non- 
social cues in problem solving (but not 
nearly as thoroughly explored), concerns the 
relative saliency of cues provided by normal 
and schizophrenic partners. Recent findings 
(Pishkin, 1966) indicate that, in general, Ss 
fail to use as much information from a schizo- 
phrenic as from a normal person when making 
perceptual judgments. The present experi- 
ment, studying the four possible dyads formed 
with schizophrenic and normal Ss and Cs, 
sought to determine whether similar perform- 
ance trends are evident in a complex problem- 
solving task. j 

Second, in the preceding study all experi- 
mental problems were characterized by 8 
single level of stimulus complexity. It is well 
known that stimulus complexity not only 
governs the difficulty of a CI task (e; 
Bourne, 1957) but also tends to interact with 
other significant variables (e.g, Bourne & 
Restle, 1959; Pishkin, 1960). The Pishkin- 
Blanchard results might, therefore, be severely 
limited in generality, with greater (lesser OT 
reversed) differences between normal an 
schizophrenic performances existing in prob- 
lems of other complexity levels. Accordingly, 
tasks of three different complexities (the 
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intermediate of which was used by Pishkin 
and Blanchard) were employed in the present 
experiment to study the possible moderating 
influence of this variable on normal-schizo- 
phrenic differences. 

Thus, the major purpose of this work was 
to collect more extensive evidence on the abil- 
ity of schizophrenic and normal Ss to solve 
conceptual problems of varying levels of dif- 
ficulty using information provided by either 
or both social and nonsocial sources. In addi- 
tion, an attempt was made to describe these 
data quantitatively within the framework of 
a mathematical model used by Pishkin and 
Blanchard. The accuracy of that descrip- 
tion reflects one major assumption—that all 
sources of information, social and nonsocial, 
are additive—and thus serves as a minor test 
of the model. 


METHOD 
Subjects 


The Ss were 115 schizophrenics and 115 normals. 

They consisted of male patients and aides from 
Veterans’ Administration Neuropsychiatric Hospitals 
at Oklahoma City, Oklahoma; Little Rock, Arkansas; 
Tuskegee, Alabama; Tomah, Wisconsin, and Central 
State Hospital, Norman, Oklahoma; all patients had 
a current diagnosis of schizophrenic reaction, chronic- 
undifferentiated type. All patients (a) who were 
below 20 or above 45 yr. of age, (b) who had had 
electroshock or insulin therapy or psychosurgery, or 
(c) who were suffering from neurological, auditory, 
or visual disorders as indicated by the admissions’ 
physical examination were not included in the 
sample. None of the patients were on a psychotropic 
drug regimen which exceeded a daily dose of 100 mg. 
of chlorpromazine, The schizophrenic Ss were first- 
Psychiatric-admission patients with an average length 
of hospitalization of 48.14 days. The normal popula- 
tion consisted of male psychiatric aides in corre- 
sponding institutions who were also 20-45 yr. of age. 
All Ss were administered the vocabulary portion of 
the Shipley Institute of Living Scale prior to the 
experiment. 
_ Five schizophrenic and five normal Ss were as- 
signed to each of 46 experimental treatment condi- 
tions, The Ss were matched across all conditions for 
age and intelligence (Shipley vocabulary score). The 
Schizophrenic patients were matched for length of 
hospitalization in each of the experimental conditions. 
All matching was to the extent that no significant 
difference was reached between any pair of groups 
ìn age, intelligence, or hospitalization. 

In those conditions where a C was required, 
chronic-undifferentiated, schizophrenic patients in 
good contact, aides, and technicians were used. The 
Cs were dressed either in Veterans’ Administration 
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aide or patient uniforms as appropriate and were 
strangers to Ss. Aides always wore aide uniforms 
and patients always wore Veterans’ Administration 
patient uniforms. A total of 18 schizophrenic pa- 
tients and 14 aides participated as Cs. 


Procedure 


All Ss were administered a two-choice CI problem 
wherein they were required to learn to match their 
responses with the relevant dimension of geometric 
patterns, The correct and incorrect responses were 
not so defined to Ss, Form of figure was the relevant 
dimension; that is, Ss were required to respond by 
saying “A” or “B” for squares or triangles, respec- 
tively, regardless of the other (irrelevant) character- 
istics of the patterns. Three levels of complexity 
were established by varying one (color), three (color, 
size, and number) or five (color, size, number, vertical 
position, and orientation) irrelevant dimensions of 
the stimuli. The patterns were presented by a male 
E on 3 X 3 cards mounted in a slot above a T-shaped 
structure, Below the stimulus slot at S’s eye level 
were mounted two amber feedback lights which 
were positioned below the blocked letters, A and B. 
The feedback was provided simultaneously to S and 
C immediately following S’s response. The long arm 
of the T structure prevented S and C from viewing 
each other, although both were able to view the 
stimulus and the feedback lights simultaneously. 


Design 


The design was a 5 X 3 X 2 X 2 factorial with five 
stimulus-social cue conditions (two of which had 
either stimulus or social cues only), three degrees of 
problem complexity, two S populations, and two 
types of Cs, All Ss performed on the task for 192 
trials or until 16 consecutively correct responses were 
made. All patterns appeared equally often but with 
the restriction that the same pattern never ap- 
peared on successive trials; otherwise the order of 
presentation was random. In conditions where C 
participated he always responded before S in ac- 
cordance with a sequence of responses programmed 
on a slip of paper. The T structure prevented S from 
viewing the C’s program. The S was introduced to C 
as a patient or an aide, but was not aware that C 
was a stooge. Both S and C were given the same 
instructions, The S was informed that because C 
happened to arrive at the laboratory earlier he would 
always respond first and that the same feedback 
would be provided to C and S. A more detailed 
description of the task and instructions appeared in 
an earlier paper (Pishkin & Blanchard, 1963). The 
five basic conditions used in this experiment were 
designed in order to test the effects of availability of 
stimulus and social cues by combining relevant and 
irrelevant aspects of the two types of information. 
Table 1 summarizes these conditions along with the 
theoretical equations (developed in Pishkin & Blan- 
chard, 1963) for predicting performance within each. 
As noted above, the basic CI conditions were elab- 
orated to include three levels of task complexity. In 
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TABLE 1 


MAIN EXPERIMENTAL CONDITIONS AND THEIR 
CORRESPONDING THEORETICAL EQUATIONS 


ceai ostia: Equation N 
I st [E Ei 4n 30 

II soc 6- E 20 
III st + soc 0-2 REeSP iS 60 
IV st — soc 0- mae 60 
V „soc — st [E EYTPPGS 60 


Note.—st =stimulus information, soc=social information, 
0 =learning rate, k =proportion of available stimulus informa- 
tion used, /=proportion of available social information used, 
R-number of relevant stimulus dimensions, J=number of 
irrelevant stimulus dimensions, B =weight of background cues, 
S 2 weight of social information. 


Conditions (Cond.) II, III, IV, and V, where social 
information was available, either a schizophrenic 
(SC) or a normal (NC) stooge participated, with an 
equal number of schizophrenic and normal Ss paired 
with the two types of C. To summarize, the tasks 
over the five conditions were as follows: 


Condition I. The Ss performed individually on a 
CI problem and C was not present. 

Condition II. CI patterns were not available and 
Ss were required to guess which feedback light would 
go on after C had responded. The maximum number 
of "correct" guesses could be reached only by imi- 
tating C. 

Condition III. A CI problem was presented on 
which C was correct on every trial Thus, correct 
responses could be made by either responding to 
relevant stimulus dimensions or imitating C; both 
the stimulus and C cues were relevant. 

Condition IV. This was a CI problem on which 
C responded randomly. Solution could only be 
reached by ignoring C and by identifying the rele- 
vant CI dimension. 

Condition V. The C was arbitrarily correct on 
every trial and the pattern of guesses called correct 
had no relationship to the CI patterns, so that CI 
patterns would have to be ignored and solution 
could be reached only by imitating C. 


RESULTS AND DISCUSSION 


Analysis of variance was performed on 
errors to criterion and is summarized in Table 
2. It indicates that the main effects of com- 
plexity, C type, and stimulus conditions were 
statistically significant, whereas there was no 
overall influence upon CI performance of S 
type. Consistent with previous results, it was 


VLADIMIR PISHKIN AND Lyre E. Bourne, JR. 


demonstrated that as complexity increased 
the number of errors also increased linearly 
(Pishkin, 1960). The mean number of errors 
when Ss, Cs, and conditions were pooled were 
11.44, 25.45, and 37.82 for problems with 
one, three, and five irrelevant dimensions, re- 
spectively. The main effect of C showed that 
more errors were made when the origin of 
social cues was a schizophrenic stooge, as can 
be seen in Figure 1. This effect, coupled with 
the insignificant variance associated with S 
type, suggests that Pishkin and Blanchard's 
observation of an overall difference in problem 
solving between normals and schizophrenics is 
at least partly attributable to the confounding 
of S type and C type in their experiment. 
Figure 1 also indicates that the main effect of 
stimulus conditions is probably a result of 
fewer errors occurring in Cond. I, II, and III 
than in Cond. IV and V. This result suggests 
that the conflict in social and stimulus cues, 
which arises only in Cond. IV and V, is a 
significant inhibitor of problem solving. 
The C type main effect is further elab- 
orated by certain interactions. First, a sig- 
nificant Complexity x C Type interaction in- 
dicates that as complexity increased from one 
to five irrelevant dimensions comparatively 
poorer performance resulted in the conditions 
where C was schizophrenic as compared to 
normal. The significant S x C Type interac- 


TABLE 2 


ANALYSIS OF VARIANCE OF ERRORS 
TO CRITERION 


Source of variance df F 
Complexity (A) 2 8.34** 
Subject type (B) 1 pec 
Confederate type (C) i 14167. 
Stimulus conditions (D) 4 7.89 
AXB 2 EDS 
AXC 2 350 
AXD 8 
BXC 1 4.08** 
BXD 4 E 
CXD 4 4.05** 
AXBXC 2 3.19% 
BXCXD 4 "m 
AXCXD 8 ree 
A X E XD 8 TR 

XBXCXD 8 
Error 172 (6236.53) 
*F«10. 
b Number in parentheses =error mean square. 
* p <.05. 
M b <.01 
++ p <.001 
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Fic. 1. Mean errors produced by schizophrenic (crosshatched) and normal 
(blank) Ss in the five conditions. 


tion indicates, moreover, that the potency of 
normal Cs was significantly greater than that 
of schizophrenic Cs in facilitating or inhibit- 
ing CI performance of normals; schizophrenic 
Ss, on the other hand, demonstrated a ten- 
dency to be less differentially influenced by 
two types of stooges, regardless of whether 
they provided relevant or irrelevant informa- 
tion. The significant C Type X Stimulus Con- 
dition interaction clearly demonstrates that 
the number of errors committed by both S 
populations was a function of whether C and/ 
or stimulus cues were relevant or irrelevant to 
the solution of the problem; for example, as 
seen in Figure 1, performance in Cond. III, 
where both C and stimulus were relevant, 
was markedly superior to that of the Cond. 
IV where the social cues were irrelevant to 
the solution. 

An interaction involving complexity, S, and 
C types emphasizes the significant trends 


found in the Complexity x C Type and S x 
C Type terms. It is noteworthy that perform- 
ance became progressively poorer when com- 
plexity increased, especially by schizophrenic 
Ss with schizophrenic Cs, whereas perform- 
ance of normal Ss at the lowest level of com- 
plexity (one irrelevant dimension) with a rele- 
vant normal C was superior to all other 
groups. 

In order to evaluate the between-group 
differences with both S populations and C 
types, a series of difference scores plus the 
“criterion difference” (Lindquist, 1953) re- 
quired to reach statistical significance (P < 
.05) was computed for each pair of group 
scores, and these are presented in Table 3. 

With pooled populations it is clear that 
there were no significant differences in errors 
between any pairs of combinations in Cond. 
I, II, and III. However, as seen in Figure 1, a 
significantly greater number of errors was pro- 
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TABLE 3 
COMPARISON OF MEAN Errors 
BETWEEN CONDITIONS 
Condi- Schizophrenic S Normal S 


tion 


A. Difference between schizophrenic C and normal C 
for two S populations, CD* — 6.32 


II 3.8 8.8* 
III 8.9* 2.6 
IV —14.5* 2 

V 24.1* 10.9* 


B. Difference between S populations matched with 
different Cs, CD — 7.84 


II 10.2* 17.2* 
TII 74 1.1 
IV 2.9 —10.3* 

V 30.5* 17.3* 


C. Difference between mean errors of schizophrenic 
and normal Ss, CD — 9.19 


I 4.7 
IL 12.5* 

Ur 4.2 

IV 43 
V 24.1* 


D. Pooled S populations: Difference in mean 
errors, CD — 12.67 


Condition 
I Il Ill IV 
II 7.2 
III 83 11 
=31.1* —32.2* 
2:21:2*. —22.3* 9.9 


IV —23.9* 
V —14.0* 


* Criterion difference, 
* p «.05. 


duced in Cond. IV and V as compared to 
Cond. I, II, and III (Table 3, Section D). 
Section A of Table 3 represents differences 
in mean errors for schizophrenic and normal 
Ss between dyads performing with schizo- 
phrenic and normal Cs. A positive score indi- 
cates a greater number of errors with schizo- 
phrenic Cs, Significant differences for Cond. 
III, IV, and V indicate that the influence of 
schizophrenic Cs on performance of schizo- 
phrenic Ss was significantly less than that of 
the normal Cs. With normal Ss it is also clear 
that schizophrenic Cs had a less facilitating 
influence upon performance than normal Cs. 
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However, it is noteworthy that in Cond. Ill, 
where both stimulus and social cues were 
relevant, there were no differences in the 
performances of normal Ss as a function of C 
type; this finding is most likely due to the 
fact that the problem was sufficiently simple 
for normal Ss that it masked the effect of C 
type. 

Section B of Table 3 represents differences 
between the two S populations due to C type. 
A positive score in this section indicates a 
greater number of errors produced by schizo- 
phrenic Ss than by normal Ss. It is apparent 
that when NCs were used in Cond. II and V, 
a significantly greater number of errors was 
produced by schizophrenic Ss than by normal 
Ss. These data are consistent with other find- 
ings that schizophrenics are not able to gain 
relevant information from a social source to 
the same degree as normal Ss. Poorer per- 
formance by normals was evident in Cond. IV, 
however, where the function of C was to inter- 
fere with solution. 

Section C of Table 3 shows differences be- 
tween the two S populations for each condi- 
tion. The only significant differences appeared 
in Cond. II and V, again demonstrating defi- 
ciency of schizophrenic Ss in using social in- 
formation, regardless of the source. The fact 
that no differences were found between the 
two populations in Cond. III is probably at- 
tributable to the simplicity of the problem. 
In Cond. IV the interference effect of C for 
normal Ss might have been sufficient to over- 
come their typical advantage over schizo- 
phrenics, since normal Cs negatively influ- 
enced performance of normal Ss to a greater 
degree than that of schizophrenic Ss. 


Theoretical Analysis 


Calibration of theoretical parameters in 
Table 1 was accomplished using Equations 1, 
2, and 5 (derived in Pishkin & Blanchard, 
1963). Calibration data were taken from two 
subgroups of Cond. I (one and five irrelevant 
dimensions), Cond. II, and one subgroup 9 
Cond. V (five irrelevant dimensions). These 
parameters were derived by use of simultane- 
ous equations and applied in predicting Pet 
formances for the remaining groups. The cat- 
brated values are summarized in Table 4. Con- 
sistent trends are apparent: The rate 9 


———— —  — 
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stimulus-cue usage, the rate of social-cue usage 
(from a schizophrenic or normal source), as 
well as the weights of social-stooge cues—all 
of which are quantitative parameters, esti- 
mated from the data and required for an ac- 
curate fit between model and data—were 
smaller for schizophrenic than normal Ss. In 
addition, it is clear that the value of normal 
social cues was approximately twice as great 
as the value of schizophrenic social cues for 
both the normal and schizophrenic S popula- 
tions. 

Table 5 shows the predicted mean errors for 
conditions which were not included in the 
calibration of constants. It is apparent that 
the two predictions in conditions with stimu- 
lus cues only were adequate. Also the predic- 
tion of combined and redundant stimulus and 
social cues (Cond. III) was fairly close, at 
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TABLE 4 
CALIBRATION OF THEORETICAL PARAMETERS 
Group k B le Se | In | Su 
Normals 45 | 1.43 | .26 | 1.55 | .39 | 2.65 
Schizophrenics | .37 | 1.30 | .18 | .93 | .15 | 1.85 


Note.—k =proportion of available stimulus information used ; 
B=weight of background cues; /s=proportion of cues from 
schizophrenic stooge used by S; ln — proportion of cues from 
normal stooge used by S; Ss=weight of schizophrenic C; Sn 
—weight of normal C. 
least for normal Ss. However, there were 
severe discrepancies when stimulus and social 
information conflicted (Cond. IV and V), a 
result which reflects primarily the fact that 
normal Ss were significantly more sensitive 
than schizophrenics to social cues. The as- 
sumption of equivalent relative sensitivity 
made by the model is simply inappropriate for 
a description of these data. 


TABLE 5 
OUTCOME or THEORETICAL ANALYSIS TO CRITERION 
Group 
f Schizophrenic Normal 
Ure aat RR Stooges 
Error Error 
0 0 
i Predicted 
anche Predicted Obtained TER Predicted | Obtained 
I. Stimulus onl; 
One d (.111)> = (11.4) (132)^ — (8.8) 
"Three .07 19.8 (19.8) .083 16.2 (16.2, 
I Five (.05)> — (31.6) (.061)^ — (23.6, 
I. Social only 
N S (.074)> — (18.6) (137)^ = (8.4) 
EE N mud = (14.8) (.255)^ — (3.6) 
II. Stimul ial 
One pM S 13 9.0 (4.8) 171 6.0 (5.8) 
N A3 9.0 (3.8) 24 3.8 (3.4) 
"Three S .09 14.6 (14.2) 122 9.6 (7.4) 
N .091 14.3 (6.0) 18 5.5 (6.8) 
Five S .06 24.9 (31.8) .095 13,7 (15.4) 
N 071 19.0 (14.2) 147 7.6 (10.4) 
IV. Sti soci 
dom speal S .087 15.6 (14.6) .09 14.6 (12.8) 
N .072 19.0 Ho 074 19.0 (24.6, 
"Three S 059 26.5 36.2) .06 24.8 re 
N .050 31.5 (41.4) .055 28.0 51.8, 
Five S .045 37.0 (49.4) .05 31.5 (46.4) 
N .04 41.6 (86.2) 044 37.0 (96.4) 
V. Social-sti 
kun nena S 04 41.0 (21.4) (.082)* = (16.4) 
N 1054 270 | (02 | (ize | — (62) 
"Three S .03 60.0 (52.0) .057 27.0 (24.2) 
N .04 41.6 (30.4) 127 9.3 (17.8) 
Five S (.02)b — (98.2) .045 37.0 (37.6) 
N (031) — (56.8) 10 12.7 (21.6) 


* Confederate type; S =schizophrenic, N =normal. 
* Estimated from data. 
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Some general conclusions made from this 
theoretical analysis include (a) normals used 
more of the relevant stimulus information 
than schizophrenics, (5) since the B param- 
eter represented irrelevant, background stim- 
ulation arising from sources other than the 
stimulus pattern, it is apparent that normals 
were more sensitive to background cues than 
were schizophrenic Ss, (c) normals were more 
sensitive to social cues than schizophrenics 
whether those cues came from schizophrenic 
or normal Cs, (d) both normals and schizo- 
phrenics used more cues from a normal C than 
from a schizophrenic C, (e) the performance 
of normals working with both stimulus and 
social cues followed theoretical predictions 
adequately, thus giving some support to the 
additivity of cues assumption of the model, 
though the same conclusion obviously does not 
hold for schizophrenics, (f) discrepancies be- 
tween model and data resulted largely from 
the fact that schizophrenics did not use cues 
from normal Cs as effectively as predicted, 
and (g) the model assumes that schizophrenic 
Ss would use C (normal or schizophrenic) in 
the same way that the normal S used C. The 
significant interaction S X C type indicates 
that this particular assumption is untenable. 
Present findings clearly demonstrate schizo- 
phrenics’ deficit in processing of information 
Írom a social source, especially when such 
source is a peer, schizophrenic patient. 
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EFFECT OF PERCEPTUAL ISOLATION AND AROUSAL ON 
ANXIETY, COMPLEXITY PREFERENCE, AND 
NOVELTY PREFERENCE IN PSYCHOPATHIC 

AND NEUROTIC DELINQUENTS* 
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Psychopathic and neurotic delinquents were subjected to perceptual isolation 
(PI) and arousal (A) and were given pre- and postmeasures of novelty, com- 
plexity preference, and anxiety. The results showed that (a) there was a sig- 
nificant negative correlation between novelty preference and anxiety level and 
between complexity preference and anxiety level; (b) the psychopathic de- 
linquents had significantly lower pretest anxiety and significantly higher novelty 
and complexity preference scores; (c) after PI psychopathic delinquents in- 
creased their complexity preference scores significantly more; and (d) after A 


psychopathic delinquents did not shift scores, while neurotics significantly in- 


creased anxiety scores and decreased complexity preference scores. 


This investigation was concerned with the 
preference behavior of psychopathic and 
neurotic delinquents for novelty and com- 
plexity of stimuli under experimental condi- 
tions of perceptual isolation and arousal. Its 
principal aim was to test the validity of a 
theory concerning the etiology of psychopathy 
proposed by Quay (1965) which holds that 
the psychopathic personality represents patho- 
logical stimulus seeking. Quay (1965) states: 
“We are suggesting that the level and varia- 
bility of sensory inputs which are necessary 
for the maintenance of pleasant affect are 
, much greater for the psychopath than for the 
ordinary individual [p. 181].” In order to 
account for this pathological need for sensory 
input, Quay suggests two possibilities. The 
first is that the psychopath’s basal reactivity 
to stimulation is lower than normal, so that 


1The present paper is based on a dissertation 
submitted in partial fulfillment of the requirements 
for a PhD in psychology at the University of Mli- 
nois, This study was made possible by the kindness 
and cooperation of R. E. Gerard and R. B. Levin- 
son at the National Training School for Boys, Wash- 
ington, D. C. The author wishes to give grateful 
thanks to H. C. Quay and L. P. Ullmann for their 
helpful suggestions and continuing support through- 
Out the planning and execution stages of this study. 
The author also wishes to thank G. L. Paul, J. E. 
McGrath, J. S. Wiggins, and M. M. Ohlsen for their 
helpful assistance. 

? Request for reprints should be sent to the author, 
United States Army Hospital, United States Mili- 
tary Academy, West Point, New York 10996. 
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more sensory input is needed to produce effi- 
cient and pleasurable cortical functioning. 
The second is that there may be a more rapid 
adaptation to stimulation which causes the 
need for stimulus variation to occur more 
rapidly and with greater intensity. 

Quay (1965) concludes that: 


What we have suggested is that the psychopath, 
either due to a lessened basal reactivity or an in- 
creased rate of adaptation, quite frequently finds 
himself in a condition of stimulus deprivation. Since 
this condition is affectively unpleasant he is moti- 
vated to change this affective state by the seeking of 
stimulation. In a highly organized environment such 
as that in which modern man resides this seeking of 
either added intensity or added variability of stimu- 
lation may on occasion involve transgressions of both 
law and moral code. It is these transgressions, moti- 
vated by the search for added or more varied stimu- 
lus input, which most frequently bring the psycho- 
path to public attention [p. 182]. 


Another major aim of this investigation 
was to test the accuracy of a frequently made 
clinical observation concerning a character- 
istic which distinguishes the psychopathic 
from the neurotic personality. This observa- 
tion is that psychopaths are persons suffering 
from affective impoverishment; that is, psy- 
chopaths do not become anxious in situations 
which are highly anxiety eliciting for non- 
psychopathic individuals. Preference for novel 
and complex stimuli and an arousal condition 
therefore were variables in this study, so that 
it was possible to explore the relationship be- 
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tween anxiety and complexity- and novelty- 
preference behavior. Several studies (Berlyne 
& Lewis, 1963; McReynolds & Bryan, 1956; 
O'Leary, 1965) suggest that there is a nega- 
tive relationship between preference for com- 
plexity or novelty and anxiety. However, 
Barron (1953) found that complexity corre- 
lated .37 with the Pd scale of the MMPI, and 
Haywood (1962) found no significant differ- 
ences among three groups of low, moderate, 
and high anxiety Ss on choice of novel stimuli. 
It was hoped that the present study would 
clear up these discrepant results. 

Brief plan of experiment. This investigation 
used a group of psychopathic and neurotic 
juvenile delinquent males. Neurotic delin- 
quents were included because they constitute 
a group which theoretically should be on the 
opposite extreme of an anxiety and impulse 
control continuum from psychopaths, and be- 
cause they could be theoretically character- 
ized as “pathological stimulus avoiders.” They 
thus provided a marked contrast to the psy- 
chopaths. Both groups were administered a 
questionnaire-type anxiety scale, a preference 
for complex stimuli test, and a preference for 
novel stimuli test. Following this, both groups 
were randomly assigned to one of three ex- 
perimental conditions. The first condition con- 
sisted of a brief perceptual isolation experi- 
ence, the second was a relatively intense 
arousal condition, and the third was a “neu- 
tral” control condition. Immediately following 
exposure to these conditions, all Ss were ad- 
ministered a posttest of anxiety, complexity 
preference, and novelty preference. 

It was hypothesized that the psychopathic 
delinquents would be relatively unaffected by 
the arousal condition, but would show signifi- 
cant increases in novelty and complexity pref- 
erence as a result of perceptual isolation, while 
the neurotic delinquents would react in the 
opposite way. Furthermore, the pretest nov- 
elty and complexity scores were expected to 
be significantly higher for the psychopathic 
than for the neurotic delinquents. (For a com- 
plete statement of predictions see Skrzypek, 
1967.) 


METHOD 


From a sample of juvenile delinquent boys at the 
National Training School for Boys (NTS) in Wash- 
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ington, D. C., a group of psychopathic and neurotic 
delinquents were selected on the basis of their scores 
on the psychopathy and neuroticism subscales of ac 
behavior rating checklist developed by Quay and his 
collaborators (Peterson, Quay, & Cameron, 1959; 
Peterson, Quay, & Tiffany, 1961; Quay, 1964). These 
authors identified by factor-analytic methods four 
personality dimensions related to juvenile delin- 
quency. The two dimensions relevant to the present 
investigation were “unsocialized-psychopathic” and 
“disturbed-neurotic.” The psychopathic-delinquency 
factor was interpreted as reflecting overt, aggressive, 
amoral acting-out behavior, coupled with impulsivity, 
a rebellious attitude toward authority and discipline, 
and a relative freedom from family and other inter- 
personal ties. The neurotic factor “reflects a dimen- 
sion of withdrawal, escape, anxiety and repression 
[Quay, 1964]." 

Quay's (1964) behavior rating checklist has been 
shown to measure both these factors. It is com- 
prised of a 46-item checklist drawn from Peterson's 
(1961) Problem Checklist. The behaviors on this 
checklist are easily observable and are rated with 
adequate reliability by cottage personnel and teach- 
ers. Quay et al. (1966) report a correlation of .67 
between parents’ ratings on the neuroticism sub- 
scale, and .78 between parents’ ratings on the psy- _ 
chopathy subscale. h 

The mean for the psychopathy scale of the behav- 
ior rating, calculated on an N of 360 delinquents 
from NTS, was 7.973, SD = 4.98. The mean for the 
neuroticism scale, calculated on the same N, was 
7.932, SD = 4.19, The Ss were selected from a pool 
of 200 boys who were in the institution at time of 
testing. All remaining Ss who attained a psychopathy 
scale score of greater than 8 and a neuroticism scale 
Score of less than 8 were placed in the psychopathic 
group. All Ss who attained a neuroticism scale score 
of greater than 8 and a psychopathy scale score of 
less than 8 were placed in the neurotic group. There. 
were a total of 33 Ss who met this criterion for” 
psychopathy, and a total of 33 who met the criterion 
for neuroticism. 

Dependent variables. There were three main de- 
pendent measures used in the present study: (a) 
The Anxiety Differential (Husek & Alexander, 1963); 
an 18-item test in the form of a semantic differen- 
tial scale. (b) A measure of the degree of preference 
for complex versus simple random shapes (Mun- 
singer & Kessen, 1964), comprising two different sets 
(for pre- and posttest) of 28 pairs of pictures of 
random shapes, Each pair contains one “complex 
and one “simple” shape, complexity and simplicity 
being defined by the number of independent turns 
that each shape possesses. (c) A measure of the 
degree of preference for "novel" versus "familiar 
Pictures (Sieveking, 1965). This test consists of two 
different sets of 12 pairs of pictures, one picture J^ 
each pair being an unusual or incongruous picture: 
the other being a familiar one. A 

Procedure. The Ss were run through the exper 
mental procedures individually. Upon entering the 
experimental room S was introduced to E and given i 
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a brief sketch of what he would be required to do 
in the experiment. The E then informed S in the 
perceptual isolation and control conditions that if 
he participated in the study and cooperated satis- 
factorily, he would receive $2. The Ss in the arousal 
condition were told that they would receive a chance 
to earn up to $2 if they participated and cooperated 
by following instructions. The S was then asked 
whether he desired to participate in the experiment. 
Only one S, who was classified as a psychopath, 
refused to participate. Prior to seeing Ss, the entire 
potential S list was shuffled and coded as to psycho- 
pathic and neurotic classification by another psy- 
chologist working at NTS. Therefore, E was blind 
as to S classification. 

The Ss were first administered the Anxiety Dif- 
ferential test. To insure that Ss understood how to 
fill out the test properly, three examples were worked 
by S with E's help, and E read all the words on the 
test to S, in case S's reading ability was too poor for 
the test. Next the precomplexity-preference and the 
prenovelty-preference tests were administered. The 
order of administration of these two measures was 
counterbalanced. The three experimental conditions, 
perceptual isolation (PI), arousal (A) and control, 
followed. The psychopathic and neurotic delinquents 
were randomly assigned to these conditions, with the 
restriction that there would be 11 Ss in each condi- 
tion. 

The PI condition consisted of requiring S to lie 
quietly on a bed with his head on a foam rubber 
pillow. Each S in this condition wore thick cotton 
gloves, translucent goggles, and earphones through 
which white noise of a sufficient intensity to mask 
all outside sounds was fed. The Ss in PI were in- 
formed that an electrical timer and white-noise gen- 
erator were actually equipment for recording every 
movement they made. They were also told that other 
boys like them did very well in relaxing, and that Z 
wanted to see how well they could do. The Ss were 

ot informed of how long they were to remain in 
this condition. After administering the instructions, 
E turned off the lights, turned on the tape recorder 
which introduced the white noise through the ear- 
phones, and left the room. He quietly looked in 
upon each S at 2-min. intervals to insure that S was 
following instructions. Each S remained exactly 40 
min. in perceptual isolation. 

The Ss in the control condition were told that 
while E was engrossed in working on some numbers, 
they were to keep busy tracking a moving target on 
the pursuit rotor. The E then demonstrated the use 
of the pursuit rotor, and explained the electric timers 
to S. He then set S to tracking, and pretended to 
busy himself with adding up columns of figures. 
Each S completed eight blocks of 3 min. per block 
of tracking, with 2-min. rest periods in between 
blocks. The total time for the control condition was 
thus 40 min. 

The arousal condition consisted of Ss making suc- 
Cessively more difficult auditory discriminations be- 
tween the loudness of two tones. Pure tones of 1,000 
Cps were, used. The intensity of the constant tone 
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was 40 db. SPL. The comparison tone ranged in 
intensity from 53 to 40 db. SPL. At the beginning of 
the actual test trials, the intensity difference between 
the constant and comparison tone was 13 db. SPL. 
From there this difference decreased at a rate of 1 
db. per step, with 2 trials given to S per step. There 
were a total of 32 trials given to each S, with an 
intertrial interval of 10 sec. In the last 6 trials the 
intensity of the constant and comparison tone was 
exactly equal. Two seconds before each trial a 600 
cps warning tone, set at a constant intensity of 35 
db. SPL, was presented. The purpose of this tone 
was to make S aware that the comparison and 
constant tones were about to be presented. The order 
of presentation of the constant and comparison tone 
was randomized throughout all 32 trials, Whenever 
S made a correct discrimination, he was given a 
poker chip'which represented 10€. There was a total 
of 20 poker chips available. Whenever an incorrect 
discrimination was made, S was first administered a 
1-sec. burst of white noise set at an intensity level 
of 60 db. SPL, and then 1 poker chip was taken 
away from the total which he had amassed up to 
that point. 

The Ss in this condition were instructed that the 
discrimination task was a test of how well they could 
hear. They were led to believe that it was a very 
easy test, and that most boys like them had per- 
formed almost perfectly on it in the past. To make 
the discriminations, Ss were instructed to say “one” 
if the first tone was louder than the second, and 
“two” if the second tone was louder than the first. 
Six practice trials were administered prior to the 
actual test trials, in order to insure that Ss clearly 
understood what they were supposed to do, These 
practice trials involved extremely easy discrimina- 
tions, which all Ss who possessed normal hearing 
were expected to make correctly. After explaining 
the contingencies applying to correct and incorrect 
discriminations, the test trials were initiated. In 
order to eliminate possible cues given by E's facial 
expressions or movements, 5s were required to wear 
translucent goggles during the test trials. Since the 
arousal condition took only an average of 10 min., 
30 min. of tracking on the pursuit rotor was given to 
Ss in this condition after administration of the pre- 
test and prior to the discrimination-arousal condi- 
tion. This made the time interval elapsing between 
pre- and posttests equal to that of the other two 
conditions. 

The novelty- and stimulus-complexity-preference 
measures were readministered immediately after the 
three experimental treatments. Again, the order of 
presentation was counterbalanced for each group. 
The posttest Anxiety Differential was administered 
last. (See Skrzypek, 1967, for complete procedure and 
instructions.) 


RESULTS 


Scoring. A scoring method was employed to 
determine the frequency with which each S 
chose the more variable or complex random 
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| TABLE 1 
| MEANS AND STANDARD DEVIATIONS OF DEPENDENT VARIABLES 
Variable 
Ss 
AD pretest AD posttest | zs pretest RS posttest | NP pretest | NP posttest 
Psychopathic delinquents | 

Perceptual isolation 

M 39.18 39.36 14.90 19.90 8.27 9.09 

SD 8.795 10.259 6.378 4.700 2.686 2.300 
Arousal 

M 38.09 38.09 15.72 14.28 8.00 7.81 

SD 8.904 7.867 6.435 4.729 2.529 2.561 
Control 

M 40.00 40.54 17.27 18.00 9.36 8.90 

SD 8.933 7.763 6.619 6.496 2.292 2.343 

Neurotic delinquents 

Perceptual isolation 

M 96.18 96.90 10.72 12.90 4.45 4.54 | 

SD 18.334 18.571 5.497 4.570 2.339 2423 
Arousal 

M 93.90 111.54 13.18 8.90 4.09 3.63 

SD 17.563 17.084 6.438 5.575 2.385 2.292 
Control 

M 104.00 102.63 14.27 15.63 4.27 4.54 

SD 18.681 18.554 5.640 6.888 2.649 2.544 


Note.—AD =Anxiety Differential; RS =random shapes; NP. 7novelty preference, 


shape. Each of the 28 choices in the pre- and 
posttests of preference was given a score of 1 
if S chose the stimulus having greater varia- 
bility and a score of 0 if he chose the stimulus 
| with less variability. A similar scoring pro- 
| cedure was employed to determine the fre- 
| quency of choice of the novel or incongruous 
picture versus the familiar one. The Anxiety 
Differential test was scored according to the 
key given by Husek and Alexander (1963). 
Pre- and posttest means and standard devia- 
tions of the three dependent variables for the 

| psychopathic and neurotic delinquents under 
all three conditions are presented in Table 1. 
Analysis of Anxiety Differential scores. 
Anxiety Differential scores were analyzed by 
a 3 X 2 X 2 repeated measures design analy- 
sis of variance (Winer, 1962). This analysis 
involved three levels of experimental condi- 
tion, two levels of S classification (psycho- 
pathic and neurotic) and two testing times 
(pre- and posttest). The main effect of 
treatments along was nonsignificant. The 
main effect of S classification had a very sig- 
nificant affect, psychopathic delinquents as 
a group scoring low on the Anxiety Differ- 
ential,’ and neurotic delinquents scoring very 
high. The interaction of treatments with clas- 
sification was not significant. The main 


effect of pre- and posttesting, and the inter- 
actions of pre- and posttesting with classifica- 
tion and with treatments were all significant, 
as was the three-way interaction, which 
qualifies the other two significant interactions. 

The large increase in the Anxiety Differ- 
ential by neurotic delinquents from pre- to t 
posttest under the arousal condition was the 
factor responsible for all three significant 
interactions. Components of the sums ol 
squares were calculated for the B(Classifica- 
tion) X C (Testing) interaction under each of 
the three treatments. This resulted in mean 
squares and F ratios for each of these three 
interactions. The B x C interaction under the 
arousal condition was highly significant (F = 
37.62), while the F ratio for the B x C inter- 
action under the control and perceptual isola- 
tion condition was less than one in both cases. 
The ¢ tests for correlated means were also 
computed on the pre- and postdifferences for 
Psychopaths and neurotics under all three 
treatments. The only significant difference 
was for neurotics under the arousal treatment 
(t = 893, df = 10). 

Two one-way analyses of variance Were 
computed on only the pretest scores for the 
psychopathic and neurotic groups in order to 
ascertain whether there were any significant 
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differences among those scores within either 
group, There were no significant differences 
among the pretest Anxiety Differential scores 
within the three psychopathic or three neu- 
rotic delinquent groups. A two-way analysis 
of variance (Treatments X S Classification) 
was also calculated on the pretest scores in 
order to ascertain whether there was a signifi- 
cant overall difference between the pretest 
scores for the psychopathic versus the neurotic 
groups. The main effect of S classification 
was highly significant, the main effect level 
means indicating that the pretest scores were 
low for psychopaths and high for neurotics. 

The following conclusions were drawn from 
the above analyses of Anxiety Differential 
scores: (a) As was predicted, there were no 
significant differences between pre- and post- 
test for any of the psychopathic delinquent 
groups. (b) The only significant difference 
between pre- and posttest for the neurotic 
delinquents occurred under the arousal con- 
dition, Consistent with clinical theory, the 
neurotics significantly increased their scores 
from pre- to posttest. (c) Random assignment 
to experimental conditions was shown in that 
there were no significant differences between 
the pretest scores within the three neurotic 
or the three psychopathic delinquent groups. 
(d) As hypothesized, there was a significant 
overall difference between the pretest scores 


-of the psychopathic delinquents and the pre- 


test scores of the neurotic delinquents, with 
the neurotics having the higher scores. 
Analysis of random shapes scores. Com- 
plexity-preference test scores were analyzed 
in exactly the same manner as were the Anx- 
iety Differential data. The main effect of 
treatments alone and of testing alone did not 
reach significance. The main effect of S classi- 
fication was again highly significant. The in- 
teraction of treatments with classification 
(A X B) was not significant. The within-Ss 
interactions B x C and C X A(Classification 
X Testing and Testing X Treatments) were 
both significant. However, the AX B X C 
interaction failed to reach the .05 level of 
significance (F — 2.18, F — 3.15 needed for 
.05 level). It was close enough to significance, 
however, that a closer look at its components 
of sums of squares seemed justified. The sums 
of squares, mean squares, and F ratios for 
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the B X C interaction were computed for each 
level of A (treatments). The F ratio for the 
B X C interaction under the perceptual isola- 
tion treatment reached the .05 level of sig- 
nificance (F — 4.09). This was also true of 
the B X C interaction under the arousal con- 
dition (F — 4.15). However, the B X C inter- 
action under the control condition was not 
significant (F — .204). 

The significant F ratio for the B x C inter- 
action under the perceptual isolation condition 
means that the increase in preference for com- 
plexity from pre- to posttest was significantly 
greater for psychopathic than for neurotic 
delinquents. The significant B X C inter- 
action under the arousal condition signifies 
that the decrease in preference for complexity 
from pre- to posttest was significantly greater 
for neurotics than for psychopaths. In order 
to ascertain whether the psychopaths sig- 
nificantly decreased from pre- to posttest 
under the arousal condition, a ¢ test for cor- 
related means was computed on the difference 
between their pre- and posttest means. This 
difference fell just short of the .05 level. The 
3x2 analysis of variance (Treatments X 
Classification) on pretest scores alone showed 
that there was an overall significant difference 
between the pretest scores for psychopaths 
compared with neurotics (F = 4.52), with 
psychopathic delinquents having higher scores 
than neurotic delinquents. The one-way anal- 
yses of variance on pretest scores showed 
that there were no significant differences 
among the pretest scores within the three 
psychopathic or the three neurotic delinquent 
groups. 

The ¢ tests for correlated means were com- 
puted on the pre- and posttest differences and 
showed that (a) the difference between pre- 
and posttest scores under perceptual isolation 
was significant (¢= 6.65) for psychopathic 
and for neurotic (t= 2.14) delinquents; 
(b) the difference between pre- and post- 
test scores under arousal was significant for 
neurotics (£ — 5.87) but short of significance 
for psychopaths (£= 1.80); and (c) the 
differences between pre- and posttest scores 
under the control condition were not signifi- 
cant for psychopaths (t = 48) or for neurot- 
ics (£15): 

The following conclusions may be drawn 
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data: 


1. Both psychopathic and neurotic delin- 
quents increased their scores significantly 
from pre- to posttest under perceptual isola- 
tion. As would be predicted by Quay (1965) 
the increase for psychopaths was significantly 
greater than the increase for neurotics. 

2. As expected, neurotics decreased their 
scores significantly from pre- to posttest under 
the arousal condition. The decrease for psy- 
chopaths was not significant, which is con- 
sistent with Quay’s (1965) theory. 

3. Random assignment to experimental 
conditions was shown in that there were no 
significant differences between the pretest 
scores within either the psychopathic or neu- 
rotic delinquent groups. 

4. The differences from pre- to posttest 
under the control conditions were not sig- 
nificant within either the psychopathic or 
neurotic delinquents, indicating the adequacy 
of the control task. 

5. As predicted, there was a significant 
overall difference between the pretest scores 
of psychopaths compared with neurotics, 
with psychopaths having the higher pretest 
scores. 


Analysis of novelty preference scores. For 
novelty preference, the only F ratio in the 
three-way analysis of variance which reached 
significance was for the main effect of S clas- 
sification. The A X B X C interaction failed 
to reach significance. Nonetheless, the B x C 
interaction mean squares and F ratios were 
computed under each level of A. None of 
these F ratios approached significance. The 
t tests for correlated means were computed 
on the pre- and posttest differences for both 
neurotic and psychopathic delinquents. The 
only significant difference (t = 2.32) occurred 
for psychopaths under the perceptual isola- 
tion condition. 

Two one-way analyses of variance were 
computed on the pretest scores separately 
for the neurotics and psychopathic groups. 
A two-way analysis of variance on pretest 
scores was also computed. Neither of the F 
ratios for the one-way analyses reach sig- 
nificance, indicating that there were no overall 
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significant differences among the pretest 
scores within either the psychopathic or neu- 
rotic groups. The main effect of S classifi- 
cation in the two-way analysis was highly sig- 
nificant, showing that the pretest scores for 
psychopaths were significantly different from 
those for neurotics. The level means for main 
effects indicated that psychopaths had higher 
pretest scores than neurotics. 

The conclusions which may be drawn from 
the above analyses of novelty preference 
scores were as follows: (a) Novelty prefer- 
ence scores did not change significantly from 
pre- to posttest, with the single exception of 
psychopaths under perceptual isolation, who 
increased their preference for novelty signifi- 
cantly. (5) There were no significant differ- 
ences among pretest scores within either the 
three neurotic or the three psychopathic delin- 
quent groups, indicating that random assign- 
ment was achieved. (c) As was predicted, 
there was a significant overall difference be- 
tween the pretest scores for psychopaths when 
compared with neurotics, the psychopaths 
having the higher scores. 

Analysis of subject variables. A 3X2 
analysis of variance (Treatments X S Classifi- 
cation) computed on age, IQ, length of stay 
in the institution, and number of school 
grades completed showed that these variables 
were randomly distributed in both the psycho- 
pathic and neurotic delinquent groups over 
the three experimental conditions. Means and 
standard deviations for these scores may be 
found in Skrzypek (1967, Table 1). 

Correlations were computed between the 
pretest scores of the Anxiety Differential and 
the pretest scores of random shapes, novelty 
preference, and the behavior ratings. They 
appear in Table 2. All these correlations were 
significant well beyond the .01 level. The cor- 
relation between pretest scores for novelty 
preference and random shapes pretest scores 
was .805 for the psychopathic delinquents 
(N = 33) and .792 for the neurotic delin- 
quents (V = 33). The substantial correlations 
of —.840 between pretest anxiety scores an 
the psychopathy behavior rating scale serve 
to validate further Quay’s (1964) behavior 
rating scales. : 

Since there was a very high correlation be- 
tween both the neuroticism and psychopathy 
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subscales of Quay’s behavior checklist and the 
anxiety scores, the question can be raised 
whether anxiety is the crucial variable, and 
the classification as to psychopath and neu- 
rotic does not contribute significantly to the 
variance. With this in mind, an analysis of 
covariance was performed on the random 
shapes data. Using pretest anxiety scores as 
the covariate, an F ratio of 3.28 was obtained, 
which is significant at the .05 level. This 
indicates that classification still contributed 
significantly to the variance with anxiety 
scores held constant. 


Discussion 


The overall tenor of the results was con- 
firmatory of Quay’s (1965) theory of psy- 
chopathy. The immediately following material 
will discuss the data in the order with which 
the major hypotheses were presented in 
Skrzypek (1967). 


1. There was a high negative correlation 
between both preference for novelty and for 
complexity and the pretest scores of the Anx- 
iety Differential. To the extent that these 
three measures possess validity, the present 
data indicate that the more anxious a person 
is, the less likely he is to choose novel versus 
familiar or complex versus simple stimuli. 
This is consistent with the studies of Berlyne 
and Lewis (1963), O'Leary (1965), and 
McReynolds and Bryan (1956). 

2. Psychopathic delinquents had signifi- 
cantly lower pretest scores on the Anxiety 
Differential and significantly higher pretest 
scores on the complexity- and novelty-prefer- 
ence measures than neurotic delinquents. The 
fact that psychopaths attained lower Anxiety 
Differential pretest scores than neurotics 
serves to substantiate clinical observations 
and research such as that by Lykken (1957) 
and Hare (1965). Quay’s (1965) hypothesis 
that the psychopath is a pathological stimulus 
seeker is also supported by the present results, 
which showed that the psychopathic delin- 
quents had significantly higher pretest scores 
on both preference for complex stimuli and 
novelty than the neurotic delinquents. 

3. Under perceptual isolation, the increase 
from pre- to posttest of complexity preference 
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TABLE 2 


CORRELATIONS BETWEEN PRETEST SCORES OF THE 
ANXIETY DIFFERENTIAL AND RANDOM SHAPES, 
NOVELTY PREFERENCE, AND BEHAVIOR 
Ratincs 


Random| Novelty Behavior rating 
Ss shape prefer- 
ence | BR-P |BR-N 
Psychopaths 
Anxiety Differential] —.844 | —.656 | —.840 
Neurotics 
Anxiety Differential] —.807 | —.868 .699 
Total Ss 
Anxiety Differential] —.550 | —.838 | —.858 | .833 


Note.—BR-P =psychopathy subscale; BR-N =neuroticism 
LU N =33 in each group. 
?«01. 


was significantly greater for psychopaths than 
for neurotics. This suggests that psychopaths 
were affected to a greater degree by the 
perceptual isolation procedure than were the 
neurotics. Perceptual isolation can be con- 
ceptualized as having produced information 
deprivation to the central nervous system, and 
this in all likelihood functioned as a motiva- 
tional variable which caused the Ss to seek 
increasing amounts of stimulus variability. 
This motivational effect was stronger for psy- 
chopaths than for neurotics. 

4. The decrease from pre- to posttest under 
the arousal condition on complexity prefer- 
ence was significantly greater for neurotics 
than for psychopathic delinquents. These re- 
sults indicate that psychopaths were not af- 
ffected by the arousal manipulation. Their 
average pre- and posttest anxiety scores were 
identical, and there were no significant differ- 
ences between their pre- and posttest scores 
on preference for complexity and novelty. 
That neurotics were affected by the arousal 
treatment was indicated by the significant 
increase in their anxiety posttest score and 
the significant decrease in their complexity 
preference behavior. These results also sup- 
port Quay's contention that the psychopath 
possesses a greater than average need for 
increases in the intensity level of stimulation. 

5. As would be predicted from Quay's 
(1965) theory, the psychopaths in perceptual 
isolation received the highest of all posttest 
scores on complexity and novelty preference, 
while neurotics in the arousal condition 
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achieved the lowest of all scores on these 
measures. 


Although there was a significant difference 
between psychopaths and neurotics in their 
pre- and posttest novelty-preference scores, 
with only one exception these scores did not 
shift significantly from pre- to posttesting 
within either group as a result of the experi- 
mental manipulations. The failure of the 
novelty-preference scores to shift significantly 
may be due to a tendency on the part of the 
Ss to be consistent in their choices of either 
the novel or familiar stimuli. Unlike the 
random shapes, the novel stimuli are obvi- 
ously different from the familiar stimuli. Once 
an S commits himself to choosing either type 
of stimuli, he is likely to continue to make 
the same type of choices in an effort to make 
himself appear consistent. That this is a 
plausible hypothesis was shown by the fact 
that out of the total number of Ss (66), 21 
had identical scores on the pre- and posttest 
of novelty preference and 35 shifted by only 
1 point. 

The fact that only a relatively brief period 
of perceptual isolation was sufficient to pro- 
duce significant changes in complexity- and 
novelty-preference behavior is encouraging for 
future research in this area. Previous studies 
using sensory deprivation techniques usually 
involved a large number of S hours (up to 3 
full days) and extensive and expensive labora- 
tory equipment, a fact which is certainly pro- 
hibitive to most researchers, The condition 
used in this study involved only 40 min. of 
individual S time and only a limited amount 
of relatively inexpensive apparatus. This tech- 
nique is well within the budget and time 
limitations of most researchers. 

In conclusion, the data reported in the 
present investigation lend strong support to 
Quay's (1965) theory of psychopathy as 
pathological stimulus seeking. The results sug- 
gest a description of the psychopath as an 
individual who lacks tolerance for even short 
periods of inactivity and sameness, and who 
is highly motivated to seek added intensity 
and variability of stimulation after experi- 
encing such sensory restriction. Furthermore, 
the psychopath seems to find himself in a 
state of stimulus deprivation even in the 
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everyday environment, as shown by the high 
pretest scores on novelty and complexity 
seeking of psychopathic delinquents compared 
with neurotic delinquents. 

The strength of these results might be in- 
creased by the use of a normal group. How- 
ever, normative data on the Anxiety Differ- 
ential for normal Ss is available. Alexander? 
tested 759 incoming college freshmen and 
found the mean of these cases to be 56.00, 
SD = 10.91. As would be predicted by clin- 
ical theory, this mean falls about midway 
between the combined pretest scores of psy- 
chopaths (39.09, SD = 8.63) and of neurotics 
(98.03, SD = 18.15). 

Quay (1965) suggested that the psycho- 
path is a pathological stimulus seeker because 
of either a lessened basal reactivity or an 
increased rate of adaptation to stimulation, 
No direct evidence bearing on this point is 
offered by the present study. However, the 
results pertaining to the Anxiety Differential 
indicate that the psychopath does not evi- 
dence much anxiety even after experiencing 
an intense arousal treatment. If one can as- 
sume that there is a positive relationship be- 
tween physiological arousal and anxiety level 
immediately following an intensive stressful 
experience, then it would seem that the psy- 
chopath is indeed an autonomically unreactive 
individual. 


8S. Alexander, personal communications, July 1966. 
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CONFLICT AND DOMINANCE WITHIN FAMILIES 
OF DISTURBED CHILDREN* 


JOSEPH BECKER? AND EILEEN IWAKAMI 


University of Washington University of Illinois 

Twenty-six sets of clinic parents with sons referred for psychological treatment 
were compared with 42 sets of nonclinic controls in a structured interaction sit- 
uation from which conflict and dominance indices were derived. Crucial concerns 
were the internal consistency of the indices, whether they measured unitary 
constructs, and whether amount of verbal activity appreciably effected the fre- 
quency-type conflict measures. Split-half reliabilities of most indices were 
quite adequate; intercorrelations between dominance indices were considerably 
lower than for conflict indices. But the latter were strongly related to amount 
of verbal activity. Interaction indices were subjected to 2X2 (clinic, non- 
clinic; child present, absent) analyses of covariance. Nonclinic parents ex- 
hibited somewhat more conflict than clinic families, but adjustment of scores 


for verbal activity considerably attenuated these differences. 


Though family interaction has long been 
implicated as a crucial etiological factor in 
personality disturbance, systematic family 
investigations are a recent development. Con- 
sequently, methodological approaches are in 
a formative stage. Fontana’s (1966) excellent 
critique of family etiological studies strongly 
endorses direct observational approaches. A 
structured interactional variant of the direct 
approach based on work by Strodtbeck 
(1951) has been used in several recent studies 
of parental dominance and conflict (e.g., 
Becker & Siefkes, 1969; Farina, 1960; 
Farina & Dunham, 1963; Hetherington, 
1965). These investigations did not assess 
whether the indices used to estimate domi- 
nance and conflict measure unitary constructs, 
nor the vulnerability of the frequency con- 
flict indices to amount of verbal activity, nor 
the internal reliability of the indices. 


1 This study was supported by United States Pub- 
lic Health Service Grant MH-06781, from the Na- 
tional Institute of Mental Health and Grant 41-32- 
66-306 from the University of Illinois Research 
Board. Appreciation is expressed to Gary F. Bean- 
blossom and Gerald Edwards of the University of 
Washington Testing Bureau and to Ethel Shaeffer 
of the Biostatistics Division of the University of 
Washington Medical School for assistance in com- 
puter analysis of these data. Portions of this paper 
were presented at the 1967 Western Psychological 
Association Meeting. 

? Requests for reprints should be sent to Joseph 
Becker, Department of Psychiatry, School of Medi- 
cine, University of Washington, Seattle, Washington 
98105. 


The present study sought to clarify these 
measurement issues within the context of 
determining whether discriminable patterns of 
conflict and dominance characterize the par- 
ents of children referred for psychological 
treatment from parents of nonclinic controls. 

A “startling” dearth of studies on relations 
between parental interaction and child behav- 
ior has been noted by Hoffman and Lippett 
(1960, pp. 960-961). Clinicians tend to as- 
sume that relations between the parents of 
behaviorally disturbed children are more con- 
flictual and less equalitarian than those be- 
tween the parents of better adjusted children. 
Several clinical (e.g., Lidz, Cornelison, Terry, 
& Fleck, 1958; Wynne, Ryckoff, Day, & 
Hirsch, 1958) and experimental studies 
(Farina, 1960; Lerner, 1965) of interaction 
between the parents of psychotically disturbed 
children support these assumptions. But find- 
ings on the parents of less severely disturbed 
children are less consistent (e.g, Bandura, 
1960; Baruch & Wilcox, 1944; Burchinal, 
Hawkes, & Gardner, 1957; Krug, 1965; 
McCord, McCord, & Howard, 1961). To what 
extent disparities between results of these 
studies are due to such variations as children's 
ages, social class sampling, and/or methodo- 
logical approach is indeterminate at present. 

Initially, it was planned to subdivide clinic 
children into groups with predominantly 
conduct (acting out, aggressive) or persona" 
ity (shy, withdrawn, anxious) disturbances. 
But since most clinic children displayed mixe 
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FAMILIES OF DISTURBED CHILDREN 


types of behavioral deviance as gauged by 
clinical evaluation and objective rating scales, 
families of clinically referred children were 
pooled in the data analyses. 


METHOD 


Subjects 


Clinic subjects (Ss) were males referred to the 
Psychological Clinic of the University of Illinois and 
their parents. Twenty-six consecutive admissions who 
met the S criteria were used. Child criteria in- 
cluded age 5-12, Caucasian, and no mental retar- 
dation, organic brain dysfunction, psychosis, or seri- 
ous physical impairment. Both biological parents had 
to be available, and able to meet intellectual, mental 
status, and organic requirements similar to those 
specified for their children. 

Nonclinic controls were solicited by mail and paid 
$10.00. Prospective volunteers were selected ran- 
domly from the rolls of schools attended by the 
dinic sample. Slightly over 2076 of the families 
written to participated. Nonclinic child-parent groups 
met the same criteria as the clinic patient-parent 
groups except that families with histories of serious 
behavioral disturbance were eliminated (five). 

The surnames of both groups appeared to reflect 
proportionately the predominantly British and Ger- 
man backgrounds of the regional population. 


Procedure 


The three family members (mother, father, son) 
were seen simultaneously, but individually, during 
the first phase of the experimental session. 

Each parent was seen briefly for socioeconomic 
data (Hollingshead, 1957), critical social history, and 
administration of the Wechsler Adult Intelligence 
Scale (WAIS), Vocabulary subtest. Parents then 
privately completed the following: (a) Behavior 
Rating Schedule (BRS) (Becker & Krug, 1964), 
ratings were obtained on the male child S, his 
sibs, self, and spouse; (b) the Femininity subscale 
of the California Psychological Inventory (CPI); (c) 
the Franck Drawing Completion Test (FDCT); (d) 
and a Child Problem Questionnaire (CPQ) (Jack- 
Son, 1956), consisting of 12 hypothetical problem 
situations, These CPQ items require an indication of 
how the parent would cope with his child's aggres- 
sion, Finally, a 5-min, spontaneous speech sample 
(SSS) about family life was tape-recorded. 

During the individual phase of the child’s testing, 
he first indicated in counterbalanced order how one 
of his parents would react to the CPQ items. Then 
the It Test of sex-typing, a five card TAT mea- 
sure of covert sex identification, and the Vocabulary 
and Block Design subtests of the Wechsler Intel- 
ligence Scale for Children (WISC) were given. Next, 
he indicated how his other parent would react to the 
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CPQ problems and concluded by recording a 5- 
min. speech sample. 

In the subsequent joint phase, parents and child 
were brought together. Both parents were present 
throughout the joint session; the child for only 
one-half. Family members were again presented with 
the CPQ. Problems were read one at a time by the 
experimenter (E). Each situation was discussed until 
a solution was reached. A counterbalanced design 
determined which half of the session the child 
would be present in, and whether odd or even 
numbered problems would be presented first. J oint 
sessions were tape-recorded. 

To the extent feasible, the Es did not know 
whether families were clinic or nonclinic ones. Data 
were coded so that scorers did not know their source. 


Interactional Indices 


The indices of conflict and dominance derived from 
the joint discussion of the CPQ were taken from 
Farina (1960).4 Farina (1960) reported the inter- 
rater reliabilities to be generally satisfactory for the 
less reliable indices (ranging from .74 to .96; over- 
all average rho reliability = .89). Interrater reliabili- 
ties on five cases for the same indices in the present 
study were slightly less (range .70 to .97; overall 
rho =.84). In addition, odd-even reliability coeffi- 
cients for the conflict-dominance indices were ob- 
tained. These Spearman-Brown corrected reliability 
coefficients are given after each index below. All 
r's were significant (p < .01, N = 68) except for 
Vielding-Maximum. A test-retest study of related 
family interaction variables by Ferreira, Winters, and 
Poindexter (1966) using a 6-mo. interval yielded 
fairly satisfactory correlations ranging from .57 to 
.76. 

Indices of conflict used were (a) Frequency of 
Simultaneous Speech (.81) (Freq. Simult, Sp.) ; 
(b) Duration of Simultaneous Speech (.84) (Durat. 
Simult. Sp); (c) Interruptions by Father (.63) 
(Interrupt.-Fa.) ; (2) Interruptions by Mother (.57) 
(Interrupt.-Mo.); (e) Interruptions-Total (Inter- 
rupt-Tot); (f) Disagreements and Aggressions— 
Father (.65) (Disagr. & Aggr.Fa.); (g) Disagree- 
ments and Aggressions—Mother (.56) (Disagr. & 
Aggr-Mo); (h) Disagreements-Total  (Disagr.- 
Tot.); (i) Failure to Agree (60) (Fail. Agree); 
(j) Verbal Activity (.87) Verb Act.. 

Indices of dominance were (a) Speaks first (.67) ; 
(b) Speaks Last (.79); (c) Speaks First and Last- 
Total (Spk. 1st & L-Tot.); (d) Passive Acceptance 
of Solution (.65) (Pass. Acc. Sol); (e) Total Time 
Spoken (.87) (Tot. T. Spkn.); (/) Yield—Maximum 
(15) (Yld-Max); (8) Vield—Minimum (40) 
(Yld.-Min.). 


3 The results for BRS sib, self, and spouse ratings, 
CPI FDCT, SSS, and It are not discussed 
because these measures are not directly relevant to 
the focus of this paper. 

4A more detailed discussion of the rationale and 
scoring of indices is available in Farina (1960). 
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RESULTS 
Supplementary Variables 


Clinic and nonclinic groups were compared 
on the concomitant variables of age, esti- 
mated IQ, education, and socioeconomic class. 
Estimated IQs and interdependent educational 
and social class levels of the nonclinic parents 
were higher than those of the clinic parents. 
The mean social class score for both groups 
fell within Hollingshead's Level IIT, but at 
opposite ends of the score range. 


Intercorrelations between Concomitant and 
Dependent Variables 


Since the pattern of intercorrelations for 
the clinic and nonclinic parents did not differ 
essentially, the groups were pooled with the 
results shown in Table 1. The parental sup- 
plementary variables on which the groups 
differed, IQ, education, and socioeconomic 
level, were highly intercorrelated. These three 
supplementary variables were positively re- 
lated to the conflict variables: Freq. Simult. 
Sp., Durat. of Sp., and Verb. Act. for both 
fathers and mothers, and to Interrupt. for 
mothers. The supplementary variables were 
unrelated to the dominance indices except 
that brighter mothers showed less passive 
acceptance. 


Intercorrelations between Dependent 
Variables 


For both fathers and mothers, the conflict 
variables were positively related to each other 
with the exception of Fail. Agree. For fathers, 
conflict variables were unrelated to dominance 
variables except for Verb. Act. which cor- 
related positively with three of the four domi- 
nance variables. For mothers also, conflict and 
dominance variables were unrelated except 
that Pass, Accept. Sol. was negatively re- 
lated to conflict measures. Unlike the conflict 
variables, the dominance variables were un- 
related to each other except for Pass. Accept. 
Sol. which was negatively correlated with 
Speaks First and Last. The extent of the 
intercorrelations between Verbal Activity and 


5 An extended report is available from the senior 
author containing tables of these and other data 
from this study, 
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the other conflict measures should be noted 
especially. 

The intercorrelations of Table 1 yielded 
stronger internal consistency support for a 
dimension of conflict than of dominance. Be- 
cause of the high intercorrelation between the 
supplementary variables and the similarity of 
their relations to the dependent variables, 
scores in subsequent analyses were adjusted 
for only the supplementary variable of socio- 
economic class. 


Analyses of Conflict Indices 


The scores for each conflict index were 
initially analyzed by a 2 X 2 (clinic, non- 
clinic; child present, child absent) repeated 
measures analysis of covariance adjusting for 
socioeconomic class and using an unweighted- 
means solution for the unequal group sizes ' 
(Winer, 1962, pp. 374-378). Several trends 
were apparent. Nonclinic parents consistently 
scored higher on the conflict variables and sig- 
nificantly so on Freq. Simult. Sp. (5 < .01), 
Interrupt-Fa. (p< .05), Interrupt.-Total 
(p < .05), and Verb. Act. (p < .005). The 
present-absent effect was significant only for 
Disagr. & Aggr-Fa. (p < .025), Disagr. & 
Aggr.-Tot. (5 < .025), and Verb. Act (p< 
.001), with conflict always higher under ab- 
sent conditions. The only significant inter- 
action was on Verb. Act. (p < .05). This 
interaction resulted from a sizable increase 1n 
verbal activity during the child's absence by 
nonclinic parents, whereas the verbal output 
of clinic parents remained relatively constant. 
Nonclinic parents tended to be more con- 
flictual in the child's absence on every Con- 
flict variable but one (Fail. Agree), whereas 
no pattern was evident for the clinic parents. 

Interpretation of the results on these 
frequency measures is complicated by the role 
of verbal activity. Multiple covariate analyses 
of variance adjusting the log transformed con- 
flict scores for both socioeconomic class an 
verbal activity resulted in a significant clinic- 
nonclinic difference on Freq. Simult. Sp. 
($ < .05) only. Absence of the child was 
related to higher conflict between the pud 
on Disagr. & Aggr.-Fa. (F = 9.11, df = 1/6 
p< .005), Disagr. & Aggr.-Mo. (FS 
dj = 1/65, p «.05), and Disagr. & Age 
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Tot. (F = 8.52, df 1/65, p < .005). None 
of the interactions approached significance. 


Analyses of Dominance Indices 


Scores on the dominance indices were also 
analyzed by a 2 X 2 repeated measures analy- 
ses of covariance (Winer, 1962, p. 312) with 
scores adjusted for group differences in socio- 
economic class. None of the F tests for main 
effects or interaction on any of the dominance 
variables were significant. Fathers tended to 
be dominant on all dominance variables for 
both groups with nonclinic fathers consist- 
ently, though not significantly, more dominant 
than clinic fathers. 


Behavior Rating Schedule 


Parents’ ratings on each of the subscales 
of the BRS were subjected to an analysis of 
covariance adjusting the scores of each group 
for the parents’ estimated IQ and socioeco- 
nomic class. Clinic mothers (CM) and fathers 
(CF) rated their children’s behavior as sig- 
nificantly more deviant than the control 
mothers (NCM) and fathers (NCF). Paren- 
tal ratings of children in both groups were 
highly correlated (7s for all subscales sig- 
nificant at p < .01 for all subscales in both 
groups). 

Clinic mothers rated themselves as signifi- 
cantly more deviant than nonclinic mothers, 
whereas clinic and nonclinic fathers’ selí- 
ratings did not differ. Clinic mothers rated 
themselves as more deviant than their hus- 
bands, whereas the nonclinic parents’ self- 
ratings did not differ. 


Discussion 


Despite their face validity, the intercor- 
relations among the dominance indices afford 
negligible support for viewing them as mea- 
sures of a unitary construct. While partialing 
out the effects of verbal activity on inter- 
correlations between the frequency-type con- 
flict -indices attenuates their relatedness, 
moderate support does remain for their tapping 
a common construct. Since most of the domi- 
nance and conflict indices have respectable 
split-half and interrater reliability, further 
investigation of their validity and correlates 
is indicated. Previous studies using these 
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indices without correction for the effects of 
verbal activity must be interpreted cautiously, 
If Farina’s (1960) finding of greater verbal 
activity when parents cannot agree is repli- 
cated, it would be useful to evaluate conflict 
findings with and without verbal activity par- 
tialed out. The present study had too few 
failures to agree to meaningfully assess this 
problem. 

Evaluation of the substantive findings of 
this study is hampered by the possibilities of 
sampling bias inherent in the 20% volunteer 
rate for controls. 

The finding that nonclinic parents inter- 
acted somewhat more conflictually supports 
the contention that qualitative aspects of 
intrafamilial conflict may be more related 
to psychopathology than quantitative ones 
(Spiegel, 1957; Wynne et al., 1958). These 
investigators argue that realistic adapta- 
tion is fostered by exposure to the explicit 
identification of distinctive viewpoints, their 
articulation, mutual respect for differences 
and constructive conflict resolution. Empirical 


support for these theoretical speculations ` 


are indirectly provided by several studies 
(Ferreira, Winter, & Poindexter, 1966; Fisher, 
Boyd, Walker, & Sheer, 1959; Singer & 
Wynne, 1963). As with anxiety, it appeats 
unlikely that amount of conflict per se beats 
any simplistic functional relation to severity 
of psychopathology. Why greater conflict 
occurred between the nonclinic parents In the 
child's absence is unclear. Possibly nonclinic 
parents exercise greater judgment and con- 
siderateness of their children by reducing td 
verbal activity which permits the chi 
greater expression. 

In conclusion, the BRS r 
validation as a classification instr i 
previous MMPI findings of more selí-repor' FÉ 
disturbance in clinic mothers than father 
controls (Goodstein & Rowley, 1961) e 
supported. The relatively high clinic S 
parental correlations on child Wes 
not support etiologic notions of neu s 
based on discrepant parental perception 
the child. 


eceived further 
ument, all! 
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EYELID CONDITIONING AND PERSONALITY: 
POSITIVE RESULTS FROM NONPARTISANS 


ELLEN V. PIERS! anp ELIZABETH P. KIRCHNER ? 


Pennsylvania State University 


The Ss selected on the basis of either high- or low-response probability on the 
last 60 of 300 conditioning trials obtained significantly different scores on 
measures of manifest anxiety, extraversion, and neuroticism, and significant 
correlations occurred among these personality measures. The results are dis- 
cussed in terms of the theoretical positions of Spence and Eysenck. 


Investigators concerned with relationships 
between eyelid conditioning and personality 
. variables have typically offered support for 
the hypotheses of either Spence (e.g., Taylor, 
1951) or Eysenck (e.g. Franks, 1957), or 
they have reported negative findings (e.g., 
Franks, 1963; King, Kimble, Gorman, & 
King, 1961) which the appropriate theorist 
has claimed resulted from the absence of 
certain necessary conditions. 

K. W. Spence alone (1964) and in col- 
laboration (J. T. Spence & K. W. Spence, 
1966; K. W. Spence & J. T. Spence, 1964, 
1966), working within the Hullian frame- 
work, has argued for a positive association 
between conditionability and anxiety, as mea- 
sured by the Manifest Anxiety (MA) scale 
(Taylor, 1951). This relation is derived from 
the assumption that the degree to which an 
individual admits to characteristically exhibit- 
ing manifest symptoms of anxiety, as de- 
scribed by items on the scale, will be related 
to the magnitude of his emotional responsive- 
ness and therefore to the level of drive (D) 
in a conditioning situation. K. W. Spence 
considered that results from numerous studies 
at Iowa and elsewhere generally supported 
his position. He explained the failure of other 
investigators to reproduce his positive results 
as a function of such variables as the follow- 
ing: too small a sample, nonnaive Ss, 
laboratory conditions which failed to arouse 
emotionality, and inclusion of voluntary re- 
sponders. He and J. T. Spence (1964) re- 

1 Requests for reprints should be made to Ellen V. 
Piers, Department of Psychology, Pennsylvania State 
University, University Park, Pennsylvania 16802. 

?'The authors wish to thank David Higgins for 


his kindness in making available the conditioning 
data. 


ported high correlations between the MA scale 
and Eysenck's Neuroticism (N) scale and 
positive correlations between N and condi- 
tioning but no significant correlations be- 
tween Eysenck's Extraversion (E) scale and 
conditioning. 

Eysenck (1965) has hypothesized that con- 
ditioning is correlated positively with intro- 
version (or negatively with extraversion) on 
the basis of Pavlovian theory that cortical 
inhibition depresses conditioning, and from his 
assumption that cortical inhibition is charac- 
teristic of extraverts, while cortical excitation 
is-characteristic of introverts. Using his 
Maudsley Personality Inventory (1962) which 
gives E and N scores, he and his colleagues 
reported significant negative correlations be- 
tween conditioning and E which he felt were 
a function of the fact that experimental con- 
ditions conducive to inhibition have been met. 
These conditions included (a) partial as com- 
pared with complete reinforcement, (b) We 


as opposed to strong CS and UCS, and (0) | 


i ingle 
discrimination learning as opposed to SIDE 


stimulus conditioning. He reporte 
no correlation between conditioning and 
Eysenck has acknowledged d 
Kus may, on occasion, correlate posit 
with conditioning performance. He previo hip 
(1957) hypothesized that such a rela 
comes about because of the correlation 0 13 
MA scale with extraversion rather than 
cause of the observed correlation. of zn ut 
scale with neuroticism. This position ollo 
from his theory that neuroticism an f pe 
version are orthogonal dimensions EC 
sonality, with conditioning related not fo 
extraversion-introversion dimension bu tried 
neuroticism. More recently (1965) he has 
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that MA scale | 


d extra- | 


EYELID CONDITIONING AND PERSONALITY 


to incorporate Spence’s findings by distin- 
guishing between autonomic arousal (brought 
about by emotion-producing conditions) as 
opposed to cortical activation (excitation) 
which is related to the E-I dimension. Ac- 
cording to this position, he predicted that 
autonomic arousal results in correlations be- 
tween N and conditioning while inhibition- 
producing conditions lead to negative correla- 
tions between conditioning and E and little 
or no relation between conditioning and N. 

The purpose of the present study was to 
throw further light on these two positions 
from a neutral theoretical viewpoint, taking 
into account the specific experimental condi- 
tions proposed by both theorists. In contrast 
to most studies which have investigated 
personality-eyelid conditioning variables by 
selecting Ss on the basis of their personality 
scores and reporting the subsequent perform- 
ance of these Ss on 50 or fewer conditioning 
trials, the present study utilized Ss selected 
on the basis of their eyelid conditioning over 
300 trials and reports subsequent performance 
on personality measures. 


METHOD 
Subjects 


The original Ss were 270 students of both sexes 
who had participated in an eyelid-conditioning study 
while enrolled in the introductory psychology course 
at Pennsylvania State University. No requirements 
were made as to naïveté concerning psychology ex- 
periments. The Ss for the present study were 45 
of the highest conditioners and 34 of the lowest 
conditioners from the original pool. Fifty-five of the 
Ss were male and 24 were female. This is approxi- 
mately the ratio of males to females in the University 
undergraduate population. 


The Conditioning Study 


The conditioning apparatus and procedure are de- 
scribed elsewhere (Prokasy, Carlton, & Higgins, 
1967). For purposes of the present study it is suf- 
ficient to report that (a) the CS and UCS can be 
considered relatively weak, (b) CS-UCS intervals of 
360, 500, and 860 msec. were used and found not 
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to be significantly related to mean response proba- 
bility; hence data from the three CS-UCS intervals 
were combined, (c) there was a 50% partial rein- 
forcement schedule subdivided into three patterns 
of reinforcement which are irrelevant to the present 
study except insofar as it was necessary to assure 
equal representation of each pattern, (d) there was 
no attempt to arouse emotion, either by the ap- 
paratus, by surroundings, or by the taped instruc- 
tions which instructed Ss simply to sit quietly and 
let their reactions take care of themselves, and (e) 
for each S the 300 conditioning trials took the better 
part of 1 hr. 


Procedure 

For the present study the 270 Ss were ranked on 
the basis of their mean response probability on the 
last 60 of the 300 conditioning trials, and attempts 
were made to find current addresses for Ss in roughly 
the upper and lower quartiles of this distribution. 
This method resulted in the formation of two groups, 
each with 63 potential Ss, who were sent the MA 
scale and the Maudsley Personality Inventory 
(MPI) with a cover letter explaining the project 
and offering them $1.25 for completion of the scales. 
Forty-five high conditioners (HC) and 34 low condi- 
tioners (LC) returned the completed scales, (In- 
ability to assume that the material actually reached 
all potential Ss precludes any attempt to interpret 
the differential rate of return.) For the HC group 
the mean response probability of the last 60 trials 
was .89 +.09; for the LC group the probability was 
18 + .14, 


RESULTS 


Before analyzing the scores of the HC and 
LC Ss, means and standard deviations on the 
personality variables for the total group and 
for each sex were computed and found to be 
similar to other United States college student 
populations (Knapp, 1962; Spence, Farber, 
& Taylor, 1954). These data are shown in 
Table 1. Since mean sex differences were 
found to be nonsignificant on all measures, 
data for both sexes combined were used in 
subsequent analyses. 

Table 2 presents the MA scale and MPI 
means of the HC and LC Ss. It can be seen 
that HCs were characterized by significantly 
higher scores on all three personality vari- 


TABLE 1 
MEANS AND STANDARD DEVIATIONS OF Ss ON PERSONALITY MEASURES 
s Females Total 
Measure tes NS (N ee T9) 
Manifest Anxiety scale (MA scale) 13.18 + 7.58 14.96 + aie 18 = 15 
Extraversion (MPI) 30.29 + 9.37 28.96 + 10.6: de Ris 
Neuroticism (MPI) 20.51 + 10.39 21.04 + 10.95 A 
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TABLE 2 


COMPARISONS OF PERSONALITY SCORES FOR PRE- 
SELECTED HicH AND Low CONDITIONERS 


Group 
Variable t 
[o 
(N = 48) (N = 34) 

"Manifest Anxiety scale 

MA scale 16.04 + 8.53 | 10.65 + 5.91 | 3.32** 
Extraversion (MPI) 28.02 + 10.84 | 32.35 + 7.45 | 2.10* 
Neuroticism (MPI) 24.18 + 10.54 | 16.03 + 8.55 | 3.79** 


* p « .05 (one- or two-tailed test). 
** p « .01 (one- or two-tailed test). 


ables, although differences between E scores 
were significant at a lower level (p < .01 for 
MA scale and N, p < .05 for E). 

Table 3 presents the intercorrelations be- 
tween conditioning and the personality vari- 
ables. Inspection shows that all correlations 
were significant beyond the .01 confidence 
level except that for Conditioning and Extra- 
version, which was significant only beyond 
the .05 level. It must be remembered that 
these correlations are based on extreme scores 
which were used in order to make the results 
comparable to many of the Spence studies. 
It is possible, therefore, that correlations over 


% RESPONSE PROBABILITY 


—O— HIGH ANXIETY 
—O— LOW ANXIETY 


49. 3. On aoe SO 
BLOCKS OF 30 TRIALS 
Fic. 1. Conditioning scores for high- and 
low-anxiety Ss. 


ELLEN V. PIERS AND ELIZABETH P. KIRCHNER 


the whole distribution of conditioning scoi 
might be slightly lower. J 

To determine the degree to which the 
relation between the MA scale and conditio 
ing might be attributed to the relation 
tween these variables and E, a partial r 
computed, removing the effects of E. 
original r of .30 between conditioning 
MA scale was thereby reduced to .24 w 
was still significant at the .05 level. On t 
other hand, when a similar manipulation 
moved the effects of N, the correlation b 
tween MA scale and conditioning dropped 
.06, which was not significantly diffe 
from zero. 

One of the unique features of the pi 
study was the very large number of conditio 
ing trials. To test statements (e.g, J 
Spence & K. W. Spence, 1966) that 
quency curves for high- and low-anxiety levi 
will gradually diverge over trials and also; 
observe the effect of a longer experim 
period on the two levels, conditioning sco 
were plotted for Ss in the upper and 
20% of the MA scale distribution (raw s 
21 or over and 7 or under, respectivel 
These curves are shown in Figure 1. It cam. 
be seen that divergence did in fact occur in 
course of the early trials and that these di 
ences then remained fairly constant through: 
out the whole conditioning period, while | 
mean response probabilities appeared to reac 
asymptote late in the period, after app! 
mately 240 trials. (The same general eff 
but less marked, was evident when condition- 
ing scores of Ss in the upper and lower 20 
of the E distribution were plotted.) 


Discussion 


The standard conditioning procedures 
which Ss of this study were exposed were 


TABLE 3 


CORRELATIONS AMONG CONDITIONING AND 
PERSONALITY MEASURES 


i Jg 
Measure Gondi- | MA scale E 
Manifest Anxiety Scale 
MA scale .305** n 
Extraversion (MPI) — .223* |= 3997 
Neuroticism (MPI) 41**| .794**. — 
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designed to be either emotion arousing or 
inhibition producing. The large original pool 
of Ss did permit comparisons of an adequate 
sample at the extremes of conditionability. 
Also, relatively weak CS and UCS were em- 
ployed under partial reinforcement and 
massed trials. Other variables specified by 
Spence or Eysenck were not considered in the 
selection and treatment of Ss. 

Under this empirical approach, results sup- 
ported the relationship between conditioning 
and both Extraversion and Anxiety (Neuroti- 
cism). In other words, without attempting to 
fulfill most of Spence’s conditions, a cor- 
relation was obtained higher than he, himself, 
has reported (1964) for conditioning and the 
MA scale. Contrary to his findings, a signifi- 
cant correlation between conditioning and E 
was also found. 

Eysenck (1965) has suggested that under 
ordinary conditions of eyelid conditioning, 
correlations with neither E nor N are likely 
to appear. Under inhibition producing condi- 
tions he reported no significant correlations 
between conditioning and N. Since positive 
correlations were found for both E and N, 
the present findings do not support either of 
these positions. Further, the substantial nega- 
tive relation between E and N lends weight 
to the accumulating evidence (Jensen, 1965) 
hat for college students, as well as psy- 
chiatric populations, these dimensions can 
hardly be considered orthogonal. Finally, the 
correlation between MA scale and condition- 
ing does not appear to be due, as Eysenck 
hypothesized earlier, to their relationship with 
Extraversion. 

Although Spence's main position is sup- 
ported to a much greater degree than is Ey- 
senck’s, it would seem that neither Eysenck's 
nor Spence's recent explanations can account 
for all the significant relationships found in 
his study. These positive data present a dif- 
ferent problem than do negative data in test- 
ing a theory. The failure to find relationships 
where they are predicted can be ignored or 
explained away on the grounds that necessary 
conditions have not been met. On the other 
hand, when significant relationships are ob- 
tained empirically and persist over some 
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period of time, in the absence of some of the 
proposed conditions, they must be acknowl- 
edged. Existing theories would thus appear to 
need modification to incorporate findings such 
as are presented here. 
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REMEMBRANCE OF FEELINGS PAST: 
A STUDY OF PHENOMENOLOGICAL GENETICS * 


ANNE-MARIE ALLERAND BLOCH °? 


Teachers College, Columbia University 


The hypothesis that subjective experience may be partially genetically de- 
terminated was tested with a sample of 24 pairs of monozygotic (MZ) twins, 24 
pairs of dizygotic (DZ) twins, and 24 pairs of singleton siblings (Sibs). The 
verbal reports of their experiences of eight emotional states were content 
analyzed, and the intrapair similarity of the descriptions was evaluated. Since 
the MZs were significantly more alike than either the DZs or Sibs, who did 
not differ from each other, the presence of a genetic factor was inferred. The 
general language patterns of twins were also investigated, with results in- 
dicating no greater similarity among MZs than among DZs. 


This study is concerned with exploring the 
hypothesis that the language of emotional ex- 
perience is partially determined by genetic 
factors and to this end has compared the re- 
ports of emotional experiences written by 
monozygotic (MZ) twins, dizygotic (DZ) 
twins, and singleton siblings (Sibs). The role 
of twinship per se as a possible factor con- 
tributing to experiential similarity was also 
investigated. 

A basic assumption is that verbal reports 
reflect physiological reactions which are pre- 
sumably characteristic of emotional states. 
While there is not necessarily a one-to-one 
relationship between physiological events and 
verbal descriptions, and while there are obvi- 
ously many factors likely to interfere with a 
report of physiological experiences, it never- 
theless seems reasonable to assume that, for 
example, two persons reporting that they feel 
flushed are experiencing similar physical sen- 
sations. The few studies dealing with verbally 
reported correlates of physiological reactions 
relevant to emotional states tend to support 
this basic assumption (Malmo & Shagass, 


1 This study was based on a doctoral dissertation 
submitted to the Department of Psychology, Teach- 
ers College, Columbia University in partial fulfill- 
ment of the requirements for the doctoral degree. 
The author wishes to express her appreciation to the 
members of her dissertation committee, Rosedith Sit- 
greaves and David Ricks, and especially to her 
chairman, Joel R. Davitz, for his encouragement and 
guidance throughout this research. 

2 Requests for reprints should be directed to Anne- 
Marie Allerand Bloch, 5369 Broadway, Oakland, 
California 94618. 
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1949; Mordkoff, 1964; Ruesch & Finesing 
1943; Theron, 1948). 

Although situational (Hunt, Cole, & 
1958) and cognitive (Schachter, 1964) 
have been shown to be determinants of e 
tional states, it is clear that emotional ri 
tions also involve certain physiological event 
Efforts have been made to identify spe ifi 
physiological responses associated with p 
ticular emotional states, but these have 
met with consistent success (Arnold, 19 
Ax, 1953; Graham, 1955; Schachter, 19 
Other studies, however, have unequivo 
demonstrated individual differences and 
sistencies in physiological reactions (D | 
1946; Lacey & Van Lehn, 1952; Schachtel 
1957). Research with both animal and hu a 
Ss supports the hypothesis of a genetic b 
for these individualistic patterns (D 
1962; Richmond & Lustman, 1955), 
there is evidence that certain constituti 
factors play a role in determining subjective 
responses to distress (Schaffer, 1966). f 

Studies of twins and their families 5 
also found evidence of a genetic basis 
certain physiological functions (Duffy, 19 
Jost & Sontag, 1944) as well as for som 
psychological traits. Of particular relevance? 
the finding that both sociability (Scarr, 19 
and traits associated with the dimension? 
introversion-extraversion (Eysenck, 19 
Shields, 1962) have strong genetic dete 
nants. It seems plausible that differe! 
along these dimensions would be reflect . 
subjective experiences. The demonstration. 
a neurophysiological basis for introversi 


en 


SUBJECTIVE EXPERIENCES IN TWINS 


extraversion by Shagass and Kerenyi (1958) 
is also of considerable interest. 

Although it is assumed that an individual’s 
genetic inheritance influences his physiologi- 
cal reactions during emotional states and thus 
his verbal descriptions of these states, it is 
not assumed that genes operate independently 
of environmental influences; they are always 
interacting with both the genetic and extra- 
organismic environments. Therefore, it is hy- 
pothesized that persons with similar genetic 
backgrounds may not only tend to be predis- 
posed to certain internal responses, but may 
also tend to elicit similar environmental reac- 
tions. For example, a smiling, active infant is 
more likely to get attention from adults than 
a quieter, less responsive one. Since his innate 
responsiveness (Freedman, 1965) and activity 
(Scarr, 1966) are constantly being reinforced, 
perhaps this child comes to value interper- 
sonal relations more and tends to construct 
his world more in terms of the degree of com- 
fort of these relationships than a less re- 
sponsive child whose subjective experiences 
might reflect his lesser interest in people, 
stemming from his relative lack of experience. 

Moreover, assuming that greater similarity 
of environment leads to more similar experi- 
ences and thereby increases the likelihood of 
similar reactions, twinship per se may be a 
significant determinant of emotional similar- 
ity. For this reason a comparison of the re- 
ports of the DZ twins and Sibs was planned 
since the former are no more alike genetically 
than the latter, sharing on the average about 
50% of their genetic material. 


METHOD 


Verbal descriptions of eight emotions, Anger, Affec- 
tion, Delight, Disgust, Excitement, Fear, Sadness, 
and Worry, were obtained from a sample of MZ 
twins, DZ twins, and Sibs, These descriptions were 
content analyzed, and on the basis of this analysis, 
the intrapair similarity of the descriptions was eval- 
uated. A detailed explanation of the procedure and 
the system of categorization may be found elsewhere 
(Allerand, 1967). 


Subjects 


The Ss, 13-18 yr. of age, came from the San 
Francisco Bay Area. About 2596 of them were identi- 
fied through one large public school system and a 
few pairs through Scout leaders. Most, however, 
were known to someone already participating in the 
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TABLE 1 
AGE (IN YEARS) AND VOCABULARY SCORES 


MZ DZ Sib 
Variable 
M SD M SD M SD 
Age 160 | 13 | 158 | 13 | 159 | 12 
Vocab* | 10.0 | 24 | 104 | 24 | 104 | 2.5 


Note.—N = 144 Ss. 
aN = 20 multiple-choice items. 


study. All the pairs were of the same sex, and when 
12 pairs of males and 12 pairs of females within 
each of the three groups had agreed to participate, 
the group was considered closed. The total sample 
included 24 pairs of MZ twins, 24 pairs of DZ 
twins, and 24 pairs of siblings, 18 mo, or less apart 
in age. The Ss, all native-born, white Americans, 
were on the average 16 yr. old and attended either 
public or parochial schools. With respect to both age 
and verbal intelligence the groups were remarkably 
similar (Table 1). The vocabulary scores fell very 
close to the norms obtained by Miner (1957), indi- 
cating that the present sample was intellectually 
representative of an average group of high school 
students, An estimate of their socioeconomic status 
was made on the basis of parental occupation, when 
it could be specified. According to a 7-point scale 
(Warner, Meeker, & Eells, 1949), MZ twins were 
more heterogeneous than either DZs or Sibs, who 
were restricted to the top four social classes. 


Zygosity Determination 


The zygosity of the twins was determined by ad- 
ministering a questionnaire devised by Nichols and 
Bilbro (1966), which enabled them to determine 
zygosity with about 93% accuracy when cross- 
validated against extensive blood typings. While all 
the twins were actually diagnosed MZ or DZ on 
the basis of this questionnaire, two genetically de- 
termined characteristics were noted at the time of 
interviewing, earlobe type (free or attached) and the 
presence or absence of middigit hair. Any differ- 
ences on these characteristics was to be weighed 
toward a diagnosis of dizygosity in the event that 
the questionnaire answers were inconclusive. This did 
not prove necessary, but it was noted that in no 
event did discordance on these characteristics occur 
on questionnaire-diagnosed MZ twins. The four pairs 
of twins diagnosed monochorionic (identical) ac- 
cording to their hospital records were also found to 
be identical by the questionnaire. 


Instrument and Procedure 


The questionnaire consisted of two parts. The first 
part contained the identifying information (put 
aside upon completion until the analysis of the data 
was finished), the zygosity-determination questions, 
and the material dealing with the subjective experi- 
ences. After the zygosity questionnaire had been 
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completed by the twins, Ss read directions which 
introduced them to the subject of emotional descrip- 
tions and defined the task for them, They then 
turned to the first emotion and described a situation 
which made them feel delighted, for example. Follow- 
ing this description, they were asked to describe how 
they felt during the experience of Delight. Prior to 
their actually beginning to write, any questions they 
had were dealt with, and they were encouraged to 
try to recapture and to describe their feelings as 
vividly and as richly as they could. The distinction 
between the situational descriptions and the descrip- 
tions of their feelings was stressed at this time. The 
eight emotions were presented in random order, the 
questionnaires were randomly numbered, and all the 
protocols were content analyzed without knowledge 
of which group S was in. 

The second part of the questionnaire consisted of 
several brief subtests. The first of these was a 20- 
item vocabulary test (Thorndike & Gallup, 1944), in- 
cluded because one would expect verbal intelligence 
to be a factor influencing the language of emotional 
experience, and any differences appearing along this 
dimension among the groups would have to be taken 
into account when interpreting the data. 

An abbreviated (50-item) adjective checklist was 
the next subtest. The Ss were asked to describe them- 
selves and then at the end of the questionnaire to 
describe their co-twin or sibling, using only 15 
adjectives. Within each of the three groups, half the 
pairs described themselves first and half described 
their partner first. Two measures based on these 
checklists were obtained: (a) Assumed Similarity, 
defined as the number of adjectives checked in com- 
mon on the self-descriptions and the descriptions of 
the twin or sibling, and (b) Actual Similarity, the 
number of adjectives checked in common on the 
self-descriptions of each member of a pair. 

The first 10 questions from the Comprehension 
subtest of the Wechsler Intelligence Scale for Children 
(WISC) appeared next in the questionnaire. These 
were included with the idea of looking at the simi- 
larity of the language of twins in response to a 
standard stimulus in a “nonemotional” context. 

Finally, the twins completed a checklist concerning 
the activities they did together, such as belonging to 
the same clubs, going to parties, etc. This was a 
rough measure of the degree of similarity of en- 
vironment and was included since similarity of en- 
vironment might affect the similarity of language. It 
consisted of 12 true-false items, scored in the direc- 
tion of independence (false). The score was the total 
number of false responses made by a pair. 


Administration 


The questionnaires were administered in Ss’ homes. 
Nearly always, those who knew each other were 
tested together in small groups. In the few cases 
where this was impossible to arrange, the exhorta- 
tion that they not discuss the questionnaire seems to 
have been effective, since one S volunteered that he 
had tried to find out what the study was about, but 
that his friend, an earlier S, had refused to tell him. 
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However, pair members were always tested to- 
gether. The time taken to complete the questionnaire 
ranged 14-3 hr. 


Analysis of Emotional Descriptions 


Once the emotional experiences were obtained, 
they were analyzed according to a modified version 
of Farmer's (1967) system of classification, The sys- 
tem divided all possible responses into a number of 
categories, the most important of which were: 


1. Physiological responses. These references to 
awareness of immediate stimulations from the body 
included 20 major categories consisting of 41 sub- 
categories, many reflecting varying levels of intensity 
of feeling (high, medium, low) or distinguishing 
between sensations felt globally and those experienced 
in a specific part of the body. Some of the major 
categories were Activation Level (AL), Discomfort 
(D), Tension (T), Weight (W), and Cardiovascular 
(C). Examples of responses falling under these cate- 
gories follow, and the appropriate subcategory is also 
indicated: “I was in a frenzy" (AL, high); “I feel 
tired” (AL, low); “I ache all over” (D, general) ; 
“My jaw feels stiff and tight" (T, specific) ; “My 
head felt like a ton” (W, heavy); “My heart starts 
to pound” (C, intensification). 

2. Functioning. This involved references to emo- 
tional, motor, cognitive, sensory, and speech func- 
tioning while experiencing the emotion. "These five 
categories were subdivided virtually the same way: 
Intensification, Enhancement, Depression or Slowing 
Down, and Dysfunction. Examples include the fol- 
lowing: “I can't think straight" (Cognitive Dys- 
function) ; “I feel graceful” (Motor Enhancement) į 
*My senses seemed dulled" (Sensory Depression) j 
*[ was higher than high on liquor" (Emotional Tn- 
tensification) ; “I can't talk" (Speech Dysfunction), 

3. Expressive behavior. This category included 
references to laughing, smiling, crying, gesturing, pni 
noninterpersonal shouting (e.g, “I laugh a lot”; “I 
felt like crying”). d 

4. Interpersonal relations. References in this cate- 
gory referred to a relationship between $ and lu 
other person or animal, including such major sub- 
categories as Moving Toward, Moving Away, am 
Moving Against another. Moving Toward subs 
categories ranged from General Positive Feeling cal 
felt friendly") to Wish for Proximity or Physi 
Contact (“I wanted to hug him"), with categories 
in between these extremes covering various degrees 
of closeness, Similar gradations of feeling were ie 
flected in the categorization of responses falling * 3 
the category, Moving Away. These ranged 2) 
Lack of Identification (“I was revolted by ku) 
and Avoidance of Interaction ("I did not "e felt 
talk to anyone”) to Physical Withdrawal ( ed"). 
like hiding") and Isolation (“I felt abandon po 
The category Moving Against included such 
categories as General Negativism (“I was d not 
with them”), Passive Negativism (“I pretende 
to hear her"), Verbal Aggression (“I gave an 
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stort”), and Physical Attack (“I punched him 
» 


Attitudes. These references to a system of be- 
or feelings usually bound to an object and in- 
lving an evaluation of it included general atti- 
as well as attitudes toward the self. Positive 
gs, negative feelings, and hostile or self-destruc- 
ve feelings were among those distinguished. 
Change. Here references were to wishes either 
‘resist or hasten change. 

Control. References in this category were to 
us degrees of control, ranging from a sense of 
control to one of freedom (“I keep a tight grip 
self”; “I feel happy-go-lucky”), 

Time. These references were to a sense of the 
of time, either of its speeding by or of its 
ig down. 


e extent to which these categories were sub- 
d has been described at length (Allerand, 
7). In all, there were 127 different categories. For 
putational purposes no distinctions were made 
ing them; they all counted equally in the totaling. 
nce Farmer's (1967) categories were developed 
material gathered from children, some changes 
yere made to do justice to the more elaborated 
onses of the present adolescent sample. Addi- 
0 subcategories were added when a given cate- 
Bory could logically be subdivided further. In this 
dy, for example, the category Time was divided 
(a) Sense of Speeding of Time, and (b) Sense 
owing of Time. This older sample, as it turned 
did make fairly frequent references to each of 
subcategories, Very few of the categories were 
ed entirely from the present sample. One that 
however, was Color, which had not been an- 
ted, but was included to cover such rare 
onses as “Everything was yellow,” and “I feel 
le," remarks made by identical twin girls with 
rence to two different emotions, 


liability 
Fifty descriptions of emotional experiences were 
d in the reliability check. Two judges coded 


, making 460 judgments in all and agreeing 
of the time, 


ity Evaluation 


hen all the data had been coded, the protocols 
identified, and the intrapair similarity was 
ted. Each time both members of a pair re- 
d the same code, that is, used the same sub- 
ory in their descriptions of the same emotion, 
" was scored. For example, if 5 said that when 
elt delighted, he was "ready to explode," and if 
is twin reported feeling "ready to burst with de- 
” a "hit" was recorded, since both these re- 
onses fall under the same category (Pressure). If 
t two responses had referred to two different 
ns, Anger and Delight, for example, then a hit 
d not have been counted. 
be considered a hit, responses had to refer to 
similar experiences, Some of the physiological 
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responses, for example, were subdivided to dis- 
tinguish between global and local bodily feelings. 
"Thus in describing Fear, one twin might say he felt 
tense all over, while his co-twin felt his stomach 
tied up in knots during his experience of Fear. 
Though both report feeling tension when afraid, a 
hit would not be scored since their responses fall 
into two distinct categories (Tension, general and 
Tension, specific). 

Another example of a hit would be if S wrote 
that when he felt angry, he had an impulse to 
“shout insults at someone,” and if his twin described 
“speaking sarcastically” when he was angry. In this 
case, both responses would be scored as Verbal 
Aggression. However, if the second twin had said he 
felt like throwing something at someone, a hit would 
not have been recorded since this response consti- 
tutes a separate category, Desired Physical Attack, 


Emotional Similarity Index 


When all the data had been analyzed for "hits," 
that is, for identical subcategories within the same 
emotion, the average number of categories used 
across all the emotions was computed for each pair. 
The proportion of hits to mean categories was then 
calculated, For example, if Twin A and Twin B 
used 25 and 35 categories in their respective descrip- 
tions, and if they used the same categories in de- 
scribing the same emotion 10 times (10 hits), then 
their overall proportion of similarity would be 10/30 
or 33, This proportion is referred to as the Emo- 
tional Similarity Index (ESI). 


RESULTS 


Among the MZ twins the mean ESI (.201, 
a = 08) was found to be considerably higher 
than that of either DZ twins (.147, ø = .06) 
or siblings (.131, « = .06), Since the ESI was 
computed as a proportion of categories used 
in the descriptions of emotions, a square-root 
transformation of the scores was performed to 
insure the homogeneity of the variance among 
the three groups. These transformed scores 
were used to compute the analysis of variance 
summarized in Table 2, which indicates a sig- 


TABLE 2 


ANALYSIS OF VARIANCE OF THE EMOTIONAL 
SIMILARITY INDEX 
(TnANsrORMED SCORES) 


Source 
| 
Groups (MZ, DZ, Sib) 

Sex 
Interaction 


Error 
Total 


*p «0l. 
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nificant difference (F = 5.63, p < .01) among 
the groups. Neither the difference between 
pairs of male and female Ss nor the interac- 
tion Sex X Group identity was significant. 

In view of the significant differences found 
in the analysis of variance, two £ tests were 
computed, one to evaluate the difference be- 
tween the MZs and DZs, and the other to 
compare the DZ twins and Sibs. Since these ¢ 
tests were done after the analysis of variance, 
a significance level of .02 (two-tailed) was 
used instead of a level of .05. This more con- 
servative significance level was adopted in 
the spirit of the multiple-comparison proce- 
dure proposed by Dunn (1961). The differ- 
ence between the transformed means of the 
MZ and DZ twins (.44 and .37, respectively) 
was significant beyond the .02 level (t= 
2.43), while the difference between the means 
of the DZ twins and the Sibs (.37 and .35, 
respectively) was not statistically significant 
(t= .69, .2<p<.5). Therefore, the data 
supported the hypothesis that MZ twins are 
more similar in their descriptions of emotional 
experiences than either DZs or Sibs. However, 
the prediction of greater similarity among 
DZs as compared with Sibs was not sustained. 


Socioeconomic Status 


Since no relationship was found between the 
estimated social class rating and ESI (r= 
—.07), the observed difference in socioeco- 
nomic level between the MZs and the others 
need not be considered as having any signifi- 
cant bearing on the finding that heredity 
does play a part in determining the report of 
the subjective experience of emotions. 


Richness of Description 
Two measures of the richness of descrip- 
tions were made. The first compared the 
TABLE 3 


RICHNESS or DESCRIPTION: NUMBER OF DIFFERENT 
CATEGORIES USED PER PAIR 


MZ DZ Sib 
Group 
M SD M SD M SD 
Males | 30.3 8.9 | 33.1 9.0 | 37.0 | 9.3 
Females | 41.6 9.7 | 44.3 13.9 | 37.9 | 7.1 
t 2.87** 2.32" 
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total number of categories used over all emo- 
tions for each S with that of his co-twin or 
sibling. The correlations, .69 (MZ), .75 
(DZ), and .43 (Sib), indicated that there 
was a tendency for twins to use the same 
number of categories in their emotional de- 
scriptions. Both correlations for the twins 
were significantly different from zero (p< 
.02, two-tailed test). 

A more refined measure of richness of 
description was made by computing the total 
number of different categories used by each 
pair. Table 3 shows the means and standard 
deviations of these scores according to sex and 
group identity. Significant sex differences were 
found only among the twins. MZ differences 
were significant at the .01 level, while DZ 
differences reached the .05 level of signifi- 
cance. Group differences were not significant. 


Similarity of Activities 


Analysis of the Twin Activity Checklist 
(one score per pair equal to the total number 
of items answered by each in the direction of 
greater independence) revealed strong sup- 
port for previous findings that MZ twins form 
closer bonds of attachment and exhibit greater 
intrapair similarity with respect to habits and 
activities (Smith, 1965). The difference be- 
tween the mean of the MZs (8.54, ¢ = 4.03) 
and the mean of the DZs (12.29, o = 4.42) 
was significant at the .01 level (two-tailed 
test), indicating more independent activities 
among the DZ twins. The correlation between 
the Activity score and the ESI for each pait 
was computed, since the greater similarity of 
the emotional experience of MZ twins might 
be attributable to their spending more time 
together. The results (MZ, r = .01; DZ, 7= 
05; MZ+DZ, r=—.11) ruled out this 
possibility. 

Actual and Assumed Similarity of Personality 


It is clear from Table 4 that MZ twint 
strongly assume their twin’s personality an 
their own to be similar (t = 4.17, ? < 002 : 
DZ twins and Sibs did not differ in this "i 
spect. However, there were no differenc A 
among the groups with respect to Actual Ba 
larity, nor was there a significant differen 
between the MZ twins’ mean Assumed DE 
larity score and their Actual Similarity so" 
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The correlations between the ESI and the 
Actual Similarity scores were computed to 
determine if those pairs who describe them- 
selves most similarly also tended to experience 
emotions most similarly, but no relationship 
was found (MZ, r = .07; DZ, r = —.07; Sib, 
E05) 


WISC Comprehension Subtest 


Finally, in an attempt to see if the language 
habits of MZ twins are more alike regardless 
of what they are talking about, the number of 
identical words appearing in the twins' an- 
swers to the first 10 WISC Comprehension 
questions were counted. Twin A's answer to 
each question was compared with Twin B's 
answer to the same question, and the number 
of identical words (excluding a, an, and the) 
appearing in both answers was tabulated, and 
so on for all 10 questions for each pair of 
twins. The mean number of identical words 
per pair was 24.2 (MZ) and 25.5 (DZ) with 
Standard deviations of 11.5 and 13.8, respec- 
tively. Thus it can be concluded that in gen- 
eral, MZ twins did not tend to use the same 
words more often than DZs, and therefore 

|. the discovered similarity of their language of 
experience appeared to be independent of 
their general language habits. 


Discussion 


The results of this study lend support to 
the idea that heredity in part determines re- 
ported phenomenological experience. While 
the overall effect was relatively small, it is 
important to note that the ESI increased con- 
siderably with increased genetic similarity: 
the MZs’ mean ESI (20.1%) represented an 
increase of 3696 when compared with the 
DZs’ mean (14.7%) and an increase of 53% 
When compared with the mean of the Sibs 
(13.196). This was an average increase of 
about 45% in going from a group with an 
average of 50% of their genes in common 
(DZs and Sibs) to one with all of their genes 
the same (MZs). Ideally, of course, this study 
Would have included an additional control 
- Broup of unrelated pairs matched for age and 

Sex. 

This finding is consistent with the mounting 
evidence that affective experiences and be- 
havior are in part genetically determined 
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TABLE 4 
ASSUMED SIMILARITY AND ACTUAL SIMILARITY SCORES 


MZ DZ Sib 


Note.—N = 15 of 50 items. 
* p < 002, £ = 4.17, in comparison with DZ, 


(Freedman, 1965; Gottesman, 1963). In view 
of the fact that social introversion (Gottes- 
man, 1963), sociability (Scarr, 1965), and 
extraversion-introversion (Eysenck, 1956; 
Shields, 1962) have all been found to have a 
genetic basis, it seems reasonable to suppose 
that persons who differ with respect to this 
general trait of involvement with others ex- 
perience emotions differently. The highly so- 
ciable person, for example, might be more 
likely to report changes in his relationships 
with others during different emotional experi- 
ences, while the introverted, less sociable 
person might be more likely to focus on 
changes within himself. Introversion, further- 
more, has been found to be associated with 
strong internal inhibition (McConaghy, 
1962), and persons with a great capacity to 
delay gratification have been found to have 
a longer, future-time perspective than those 
whose internal inhibitions are weaker (Kahn, 
1967). Consequently, one might expect the 
subjective experience of MZ twins to reflect a 
more similar sense of time. There is also evi- 
dence that extraverts tend to outbursts (Mc- 
Conaghy, 1962). Again, these characteristics 
might well be reflected in descriptions of emo- 
tional experiences. 

A frequent criticism of the classical twin 
study is that the finding of greater intrapair 
similarity among MZ twins may be attributa- 
ble to greater environmental similarity 
rather than to heredity. The present study has 
found evidence which does not support this 
view. It remains to be shown, however, that 
those aspects of the MZs' environment which 
are more similar have a significant influence 
on their experience of emotions. Attributing 
MZ similarities primarily to environmental 
effects is too simplistic a solution. 

The problem of environmental similarity is 
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indeed a difficult one with which to deal, and 
critics of the twin method often suggest that 
parents of MZs create similarities between 
them by the very fact that they consider them 
to be identical, while the parents of DZ twins 
in considering them to be different help to 
create differences. 

A critical test of environmental bias is pos- 
sible by examining the behavior and attitudes 
of parents who are wrong about the zygosity 
of their twins. Scarr (1968) has done just 
this, and by comparing them with those of 
parents of correctly identified twins, she was 
able to separate their reactions to similarities 
and differences based on genetic relatedness 
from those arising from their belief that their 
twins should or should not be similar. 

Though Scarr’s sample was admittedly 
small (N = 12), strong trends were found in 
the data suggesting that genetic relatedness 
of twins determined the similarity of treat- 
ment by parents, rather than the parents’ 
belief as to the zygosity of their children. 
That is, MZ twins incorrectly considered to 
be DZ were treated more like correctly identi- 
fied MZ pairs by their mothers, while DZ 
twins misclassified MZ were treated more 
like correctly identified DZ pairs. 

Some investigators have found evidence 
that special differences exist in the environ- 
ment of identical twins (Eysenck & Prell, 
1951; Misbach & Stromberg, 1941). The 
presence of any of these factors can lead only 
to the underestimation of the effect of hered- 
ity. 

Similarity of reported emotional experience 
is closely linked to similarity of expressive- 
ness, and while it might be thought that emo- 
tional expressiveness is primarily learned 
through parent-child interaction and that for 
this reason MZ twins are more similar with 
respect to their subjective experiences, Bron- 
son (1966) has shown that this is not the 
case. She found that the level of emotional 
expressiveness in a child is not a function of 
parental attitude towards him, but depends 
on the kind of person the parent is. There- 
fore the ESIs may be considered to be inde- 
pendent of this trait. 

Two phenomena may be involved in the 
finding that female twins gave significantly 
richer descriptions than male twins. The well- 
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known greater verbal facility of girls is pre. 
cluded as an explanation, as is inhibition 
among the boys due to the female sex of the 
investigator, since among Sibs, there were no 
sex differences with respect to richness. In 
our present society there is considerable pres- 
sure on boys to repress or suppress their feel- 
ings. In many cases to be a man is to be 
emotionally controlled, and this tends to work 
toward decreased emotional sensitivity both 
toward oneself and others. Consequently, 
boys may not experience their feelings as 
freely as girls. 

The data suggested that richness of emo- 
tional experience is also increased by having 
a same-age peer in whom one can confide, and 
that this experience is usually more common 
among girls than boys because of social pres- 
sures. Thus each female twin has an unusual 
opportunity to share her experiences, and in 
so doing, she may enlarge her range of re- 
sponse not only through the addition of some 
of her twin's ways of reacting to her own 
repertoire, but also through the making ex- 
plicit of some of her heretofore unlabeled or 
unrecognized feelings. 
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Rate of decline of performance as a function of progressively weakened con- 
text was found to be no greater for schizophrenics than for normals. The 
variable of strength of association, however, was found to differentiate the 
verbal behavior of schizophrenics from that of normals; schizophrenics per- 
formed at a level comparable to normals when rewarded for choosing the 
strong associations of stimulus words but were significantly inferior to normals 
when rewarded for choosing weak associations. 


Chapman, Chapman, and Miller (1964) 
presented evidence that the verbal behavior 
of schizophrenics is characterized by an ex- 
cessive reliance on the strongest normal as- 
sociation to a word and a consequent neglect 
of a word’s weaker associations. They also 
suggest that the extent to which this tend- 
ency differentiates schizophrenics from nor- 
mals is determined by the strength of the 
contextual cues which serve to guide the 
associative response. When contextual cues 
which indicate the appropriate associative 
response are very strong, schizophrenic per- 
formance should closely approximate that of 
normals; when such contextual cues are re- 
duced, schizophrenic performance should 
deteriorate relative to that of normals. 

The two experiments reported here were 
attempts to examine the generality of the 
theory by using methods to define context and 
strength of association which are somewhat 
different from those of Chapman et al. 


EXPERIMENT I 


The first experiment examined the hy- 
pothesis that schizophrenics are unable to 
use weak contextual cues to the same degree 
as normal Ss. 


Method 


Materials. The method used to define context was 
cloze procedure (Taylor, 1953). Words were sys- 
tematically deleted from a passage, and the re- 
maining context provided cues to the deleted words. 


1 Requests for reprints should be sent to C. William 
Deckner, Department of Psychiatry, Vanderbilt Uni- 
versity, Nashville, Tennessee 37203, 


Either every tenth, eighth, or fifth word was deleted 
from three passages of first-grade difficulty and 
three passages of third-grade difficulty (Webster's 
Practice Readers, Grover & Bayles, 1942). Thus, a 
test unit was comprised of six different passages of 
three frequencies of deletion and two levels of 
difficulty. No passage appeared more than once in 
a test unit, and the order of the passages was 
counterbalanced. E 
Subjects. A group of 36 male schizophrenics 
divided into two groups of 18 on the basis of their 
scores on the Phillips Scale of Premorbid Adjustment 
(Phillips, 1953) and a control group of 18 male gen- 
eral medical patients were Ss. All Ss were Veterans 
Administration Hospital patients. The good premorbid 
and poor premorbid groups, respectively, had a mean 
of 7.00 and 9.93 yr. since first admission, Both 
groups were variable with regard to subclassifica- 
tion of schizophrenia. The mean IQ scores obtained 
on the Peabody Picture Vocabulary Test (Dunn, 
1954) by the normals, good premorbid schizop- 
phrenics, and poor premorbid schizophrenics were 
96.78, 101.33, and 99.05, respectively. Their mean 
ages were 37.83, 40.89, and 40.39, respectively. Pa- 
tients who were over 50 yr. of age, who had less than 
a sixth-grade education, or who were unable to 
read at the fifth-grade level were not used. No 
schizophrenic Ss employed had at any time het 
given any other psychiatric diagnosis. No nori 
Ss employed had been given psychiatric diagno’ 
of any kind. í 
Procedure. To begin, Ss were given a test 0 
reading ability, shown a sample passage, and given A 
warm-up passage to complete. They were M : 
"There are words missing from these short stories. 
Please make your best guess as to what the wor! 
should be and then write them back in the b 
spaces. Write only one word in each blank pe 
Following the warm-up passage, E answered bs 3 
cedural questions and instructed S to complete 
test passages. " 
The words supplied by Ss were given appropriate- 
ness scores of 2, 1, or O points by two by 
students who were trained on the judging tas bility 
established to have a high interjudge rely 
(r=.85). Neither judge was informed as !? 
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purpose of the study, and identifying data on the 
response sheets were in code. The sum of the 
judges’ ratings was used as the measure of the 
dependent variable for each deletion. 

Prediction. Schizophrenic performance was pre- 
dicted to be more impaired than that of controls 
by increases in contextual deletion. Schizophrenic 
performance, therefore, was expected to approximate 
normal performance most closely with every tenth 
word deleted (minimal deletion), and be most 
inferior to normal performance with every fifth 
word deleted (maximal deletion). 


Results 


Table 1 is the summary table of the analy- 
sis of variance of the total cloze scores. 

Most pertinent to the authors’ prediction 
is that while the main effect of frequency of 
deletion reached significance (p < .05), this 
variable did not differentially influence the 
performance of the three groups of Ss. The 
Groups X Deletions interaction was not sig- 
nificant (p> .15) which showed that the 
rate of decline as a function of frequency of 
deletion was probably no greater for schizo- 
phrenics than for normals. 

Of less importance, the overall level of 
performance of the schizophrenics was in- 
ferior. As is shown, the main effects of the 
groups variable (p< .05) and the level of 
difficulty (p < .01) were significant. The only 
interaction which closely approached signifi- 
cance was Level of Difficulty x Frequency of 
Deletion (p = .06). 

Perhaps a purer measure of the ability to 
use weak contextual cues is the number of 
2-point (maximum credit) words supplied. 
(One-point words were excluded in this analy- 
sis.) Using this dependent variable, the main 
effect of frequency of deletion was found to 
be highly significant (p < .01), but again 
the performance of the three groups was not 
differentially affected (Groups X Deletions 
interaction p > .15). Again the schizophrenics 
were generally inferior to the normals (p < 
.05), and the main effect of level of difficulty 
was significant (p< .01). No interactions 
reached statistical significance. 


EXPERIMENT II 


The second experiment was an attempt to 
demonstrate the schizophrenic’s hypothesized 
exaggerated reliance on the strongest associa- 
tion evoked by a word and relative neglect of 
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TABLE 1 


ANALYSIS OF VARIANCE OF CLOZE Task PERFORMANCE 
AS A FUNCTION OF Lever Or DirricULTY 
AND AMOUNT or DELETION 


Level of difficulty (B) 


B X Ss within grou 
Frequency of deletion (C) 


sae 
173 
C X Ss within groups 
BXC 2,96 
AXBXC 
B XC X Ss within groups 
*p <05, 
*> <01. 


a word’s weaker associations. A concept- 
formation task was constructed for this 


purpose. 
Method 


Materials. Materials consisted of the Minnesota 
Word Association Norms (Jenkins & Palermo, 1963) 
and two decks of 3 X 5 in. cards on which stimulus 
and response words were printed. 

Each deck consisted of 55 pairs of cards. On the 
first card of each pair was a stimulus word shown 
by the Minnesota norms to elicit a response with 
high commonality (e.g, NEEDLE). Immediately fol- 
lowing and completing the pair was a card with 
two response words, one of which accounts for the 
high commonality (e.g, "thread"). The other re- 
sponse word was one which the norms show is given 
with some frequency as an association to the stimulus 
word (eg, “nail”) but much less frequently than 
the alternative previously described, Strength of 
association, therefore, was defined in terms of 
cultural frequency of association (cf. Mednick & 
Devito, 1958). The two response words were the 
same part of speech, either both concrete or both 
abstract, and randomly positioned on the card, 

Subjects. The Ss were the same as those used in 
Experiment I. 

Procedure, Each S was seated across a table from 
E. With illustrations, the following instructions 
were given: 


All you have to do is look at a card with a 
single word on it. Then look at a second card 
which has two words on it and choose one of 
these two words by saying it aloud. One of the 
words is the nickel word. Each time you choose 
the nickel word, I will place a nickel on the 
table beside you. All the money you win will be 
yours to keep. 


There was no time limit and S was permitted to 
progress through the cards at his own rate, Every 
effort was made to have Ss feel as little pressure 


as possible. 
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TABLE 2 


ANALYSIS OF VARIANCE OF TRIALS TO CRITERION ON 
Concert FORMATION Task witH Two 
LEVELS OF ÁssOCIATIVE STRENGTH 


Source of variation df MS F 
Between Ss 53 
Normals, good PM, and poor 
PM (A 2| 1,322.01 8.69* 
Order of shift (B) 1 293.37 1.93 
AX 2 71.29 
Ss within Groups A and B 48 152.15 
Within Ss 4 
Strength of association (C) 1| 13,511.70 | 106.57* 
AX € 2 817.68 6.45* 
BXC 1 0,33 
AXBXC 2 395.36 3.12 
C X Ss with Groups A and B 48 126.79 


* p «.0t. 


For half of both the schizophrenic and normal 
groups the "nickel word" in the first deck of cards 
was the strong association to the stimulus word. 
For the other half groups the reward was given 
for choosing the weak association to the stimulus 
word. 

After each S had either reached the criterion of 
9 out of 11 correct choices or had progressed 
through the entire deck of 55 sets, E put aside 
the first deck and said: "That's fine. Now see if 
you can guess the nickel word with these cards." 
With the introduction of the second deck, a shift was 
required; Ss previously reinforced for choosing the 
strong associations were now reinforced for choos- 
ing the weak associations, and vice versa. 

As no stimulus word or response word was re- 
peated throughout the two decks, the task was not 
one of remembering which response was rewarded on 
a previous trial. In order to reach criterion, an S was 
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Fic. 1. Mean trials to criterion as a function of 
strength of association in good and poor premorbid 
schizophrenic and normal Ss. (G, triangles: good 
premorbid schizophrenics; P, squares: poor pre- 
morbid schizophrenics; N, circles: normals.) 
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assumed to have attained the concept of associative 
strength or at least some formulation related to it, 

Predictions. No difference was predicted between 
the schizophrenic and normal groups when rein- 
forcement was contingent upon choosing strong as- 
sociations, while a significant difference was predicted 
between the groups when reinforcement was con- 
tingent upon choosing weak associations. 


Results 


Table 2 is the summary of the analysis of 
variance on the number of trials (up to 55) 
required to reach criterion. The main effect of 
the strength of association variable was sig- 
nificant, indicating that all Ss performed more 
poorly when weak rather than strong associa- 
tions were reinforced (p < .01). Furthermore, 
as predicted, the interaction Groups X Strength 
of Association (p < .01) was significant. 

To clarify the source of the interaction, 
the differences among groups were analyzed 
separately at each level of strength of as- 
sociation by simple analysis of variance. When 
reinforced for responding to strong associa- 
tions, Ss were not different. However, group 
differences did occur (p < .01) when weak 
associations were reinforced (see Figure 1). A 
Neuman-Keuls test indicated that the schizo- 
phrenic groups, while not different from each 
other, performed more poorly than the normal 
group (p < .01). 

Of less importance, the groups main effect 
was significant (p < .01), indicating the not- 
mals to be superior to both schizophrenic 
groups. However, the generality of this find- 
ing is limited by the interaction finding al- 
ready described. The order of shift main effect 
and the other interactions did not rea 
significance. 


DISCUSSION 


Chapman, Chapman, and Miller 
suggest that the “meaning responses” elicit 
by words are only one form of associative 
response. In Experiment II it was found that 
schizophrenics manifest the predicted be- 
haviors not only in response to the stroni 
and weaker meanings of words, but also; : 
response to the stronger and weaker associa 
tions elicited by words of a variety of types 
The schizophrenics showed inferior pa 
ance relative to normals when rewarded 2 
choosing the weak associations but perio 
at a level comparable to normals when ™ 


(1964) 
elicited 
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warded for choosing the strong associations. 
This appears to provide a broader base of 
support for Chapman’s hypothesis. 

Contrary to prediction, in Experiment I it 
was found that the performances of schizo- 
phrenics and normals decline uniformly with 
increases in frequency of deletion. This sug- 
gests that of the two variables specified by 
Chapman et al., schizophrenic verbal behavior 
may be primarily determined by the strength 
of association variable. The variable of context 
may not differentiate schizophrenics from 
normals at all, or it may operate only in inter- 
action with strength of association. Since the 
two variables were measured independently in 
the two experiments such an interaction could 
not occur and the effect of context would not 
be demonstrated. 

It is also possible that the present defini- 
tion of context is too general, and that ‘“con- 
textual constraint” is provided, in Chapman’s 
sense, only by certain classes of words; for 
example, adjectives and nouns might con- 
strain meaning in a different way from prepo- 
sitions or conjunctions. Since Chapman’s defi- 
nition of constraint does not specify such fac- 
tors, they remain a matter for further study. 

The inconsistency of the findings with the 
cloze task with those of Chapman also sug- 
gests the relevance of attentional dysfunc- 
tion theory in schizophrenia (Silverman, 1964; 
Venables, 1964). Chapman’s multiple-choice 
tasks emphasize the ability to use critical 
cues which are spatially separated from the 
point of associative choice. For example, 
“farmer” and “cows” are the critical con- 
textual cues in the following item. 


The farmer has too many cows and must 
buy a new pen. 
The word pen in this sentence means: 


1. Pick-up truck 
2. Writing implement 
3. Fenced enclosure 


In contrast, the critical cues in a cloze task 
are very likely to be in close proximity to the 
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point of associative choice; that is, the words 
adjacent to the blanks are those most likely 
to be decisive and also those immediately pre- 
sent in the stimulus field (see Cromwell & 
Dokecki, 1968). An impairment of at- 
tention in schizophrenia would make this 
difference between the two experimental tasks 
critical for differences in performance; the 
same results could not necessarily be expected 
from the two procedures. However, the dis- 
crepancy which occurs points to the need 
for greater specification of the types of con- 
textual constraints which differentiate the 
verbal behavior of schizophrenics from that 
of normals. 
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COGNITIVE LEVEL AS A FUNCTION OF 
DEFENSIVE ISOLATION * 


LILA LOWENHERZ snp MELVIN FEFFER ? 
Yeshiva University 


In previous studies the cognitive structuring of interpersonal content has been 
assessed by means of a role-taking task (RTT) which requires the individual 
to tell a standard Thematic Apperception Test (TAT) story to an ambiguous 
picture and then to retell the story from the perspective of each of his TAT 
figures. The present study evaluated the possibility that a failure to coordinate 
perspectives on the RTT occurs when the TAT figures represent defensively 
isolated aspects of the self-structure. The Ss were given the RTT under two 
different conditions. Under one condition, Ss were assigned defensively isolated 
attributes, asked to tell a story built around TAT figures endowed with these 
attributes, and then required to take the perspectives of these figures in their 
role taking. The other condition differed in that the assigned attributes were 
not defensively isolated. The Ss showed more difficulty in coordinating perspec- 
tives under the defensive isolation condition than under the nonisolated condi- 


tion. 


Piaget (1950) has characterized cognitive 
development as a reorganization of behavior 
in which focusing sequentially upon single 
aspects of the experiential field becomes 
subordinated to focusing upon a number of 
aspects of the field in relation to one another. 
Exemplifying this conception is Piaget’s 
(1950) classic study of the child's judgment 
of quantity in which an invariant amount of 
liquid is poured successively into containers 
of different shapes. Piaget attributed the 
younger child's lack of conservation to his 
focusing upon one dimension of the situation 
at a time, for example, attending first to 
changes in the height of the liquid while 
neglecting compensatory changes in the cir- 
cumference of the container and then shifting 
attention to circumference while neglecting 
height. By the same token, he viewed the 
older child's attainment of conservation as a 
function of his ability to coordinate changes 
along both dimensions, that is, to consider 
changes in height in relation to changes in 


1This article is derived from the first author's 
doctoral dissertation, completed at Yeshiva University 
in 1966 under the second author's supervision. The 
study was supported in part by Public Health 
Service Research Grant MH-07506 from the Na- 
tional Institute of Mental Health. 

? Requests for reprints should be sent to Melvin 
Feffer, Department of Psychology, Yeshiva Uni- 
versity, 55 Fifth Avenue, New York, New York 
10003. 


circumference. His position would suggest 
that a defining characteristic of “sequential” 
decentering in contrast to “relational” de- 
centering is isolation or lack of contact be- 
tween relevant aspects of experience. 

In general, the decentering concept has been 
restricted to an interpretation of impersonal 
cognitive functioning. An extension of the 
decentering concept to the interpersonal area, 
however, has been suggested (Feffer, 1967) 
and evaluated empirically by means of à 
projective role-taking task (RTT). In the 
RTT, S is first required to tell a standard 
Thematic Apperception Test (TAT) story to 
an ambiguous drawing. He is then asked to 
retell the story from the viewpoint of each 
of the figures in his story. Decentering level 
has been measured in terms of the degree to 
which S is able to focus upon his initial story 
from the particular perspective of each of his 
figures while at the same time maintaining 
continuity between his different versions of 
the story. This measure of the ability io 
coordinate different perspectives has been 
found to be associated with decentering a 
tivity on impersonal conservation and part- 
whole tasks (Candell, 1965; Feffer & Gout 
vitch, 1960), chronological age (can 
1965; Feffer & Gourevitch, 1960; Worm 
1963), and effectiveness of social interactio 
(Feffer & Suchotliff, 1966). 


352 


COGNITIVE LEVEL: FUNCTION oF DEFENSIVE ISOLATION 


In the present study, the projective nature 
of the RTT itself becomes the focus of in- 
quiry in that the possibility is raised that a 
dynamic factor is involved in S’s ability to 
coordinate the different perspectives of his 
projected figures. This possibility is outlined 
in the following rationale: In constructing 
stories involving ambiguous TAT figures, in- 
dividuals may typically express a variety of 
self-attributes, some of which have been 
defensively isolated as a means of avoiding 
the arousal of anxiety. In being required, in 
role taking, to assume the perspectives of the 
various figures in his stories, the individual 
is by that token required to act as the subject 
of these defensively isolated attributes. In a 
sense, then, he is forced to behave as though 
these attributes were syntonic aspects of the 
self. Under these circumstances, a reactivated 
defense against a possible confrontation with 
the anxiety-provoking attribute could take the 
form of isolation or lack of coordination be- 
tween the various perspectives involved. The 
present study was directed toward an hy- 
pothesis which follows from this rationale, 
namely, that a failure to coordinate perspec- 
tives will occur when the TAT figures rep- 
resent defensively isolated aspects of the self- 
structure. 


METHOD 


The procedure in the first part of the study was 
directed toward determining for each individual, 
those attributes which were defensively isolated 
and those attributes which were not defensively 
isolated. In the second part of the study, Ss were 
given the RTT under two different conditions. 
Under one condition, Ss were assigned their de- 
fensively isolated attributes, asked to tell a story 
built around TAT figures endowed with these char- 
acteristics, and then required to take the per- 
spectives of these figures in their role taking. The 
other condition differed in that the assigned at- 
tributes were not defensively isolated. A com- 
parison of decentering level under these two con- 
ditions constituted the evaluation of the study’s 
hypothesis, It was predicted that a greater degree 
of sequential decentering in role taking would be 
evidenced under the condition of defensive isolation 
than under the nonisolation condition. 


Measure of Defensive Isolation 


The projective assumption was basic to the mea- 
sure of defensive isolation; that is, any personality 
characteristic which S ascribed to a TAT figure was 
assumed to be an attribute of the self. If, further, 
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S reacted to this attribute as being unacceptable and 
unlike the self, this was taken as an indication that 
the characteristic was defensively isolated from the 
rest of the self-structure. 

The method for assessing defensive isolation, as 
so defined, followed a modified version of a pro- 
cedure described in Pile, Mischel, and Bernstein 
(1959). A group of female second-year college stu- 
dents were shown seven stimulus cards projected 
onto a screen by an opaque projector. Five of the 
pictures were TAT cards (Numbers 4, 10, 12F, 
13MF, and 18GF), and two were based upon the 
Symonds Picture Story Test (Numbers B3 and B8). 
The Ss were told that their stories were being col- 
lected as part of a larger study attempting to find 
a simple, reliable method of assessing creativity 
from the TAT. They were also told that selected 
Ss would be asked to return for futher testing. 
Standard TAT instructions were used and 6 min. 
were allowed for each story. 

The first author analyzed each TAT story for at- 
titudinal content and, for each TAT figure, con- 
structed one statement which best represented S's 
characterization of that figure. The criteria? used 
in constructing statements have been described in 
detail in Lowenherz (1966) and were directed 
toward insuring that statements were similar in 
respect to clinically relevant variables, These criteria 
are summarized as follows: 


1, The attributes of each TAT figure were drawn 
from that part of the story concerned with events 
taking place in the present. 


Example of TAT content. I think he was faced 
with a great disappointment. . . . He seems to 
feel that he is a failure and is not worthy of his 
wife's love . . . 

Derived Statement. He seems to feel that he is 
a failure and is not worthy of his wife’s love. 
[The quality of disappointment takes place in 
the past and is not used in the construction of 
the statement.] 


2. Statements about the TAT figure were drawn 
from the story directly, rather than from attributes 
which were ascribed to the figure by other characters 
in the story. 


Example of TAT content. The husband was 
fired from his job. He's angry and barks at his 
wife, claiming that she was the one who caused 
his failure. In the picture, one sees the wife, who 
loves her husband dearly and doesn't like to see 
him feeling so defeated and worthless . . . 

Derived statement. When angry, he blames 
others for his misfortunes. [This is chosen in 
preference to he feels defeated and worthless 
because the latter are feelings reflected through 
another character.] 


3' There was agreement in 35 out of 40 instances 
(87%) with another person who independently con- 
structed statements from the stories in terms of these 
criteria. 
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3. Attributes implicit in the story were used only 
when S did not explicitly and directly ascribe quali- 
ties to the TAT figures. 


Example of TAT content. Laurie and Peter have 
only been married for a few months. Laurie has 
just returned from shopping and has found 
Peter lying at the bottom of the stairs... . He 
tells her that his back is broken and to call a 
doctor. . . . The doctor comes and Peter is taken 
to the hospital but his spinal cord was servered 
and he becomes an invalid and Laurie has to 
support them. 

Derived statement. He is helpless and needs to 
be cared for. [No explicit quality has been ascribed 
to the male figure, hence an inference is made.] 


4. Where appropriate, the statements were made 
reciprocal Thus, “He is helpless and needs to be 
cared for" is complemented by the statement "She 
is helpful and supporting," for the female figure in 
the story aforementioned. 

5. Statements were all changed to designate fe- 
males. Thus, the statements illustrating Criteria 2 
and 3 were changed to "When angry she blames 
others for her misfortunes” and “She is helpless 
and needs to be cared for." 


Each S produced stories from which it was pos- 
sible to construct statements characterizing between 
6 and 12 TAT figures. For those Ss who produced 
less than 12 statements, “fillers” were randomly as- 
signed from the total pool of statements so that 
there were 12 statements per S. The statements were 
typed on ditto stencils and run off; thus, although 
individually tailored, each packet of statements had 
the appearance of a group inventory. 

About 5 wk. after the first session, Ss were again 
seen in a group and were told that they were going 
to be given a different type of task. Each S was 
given her individually tailored packet of statements 
and was asked to rank these from "most like me" 
to “least like me.” In addition, each was asked to 
dichotomize an identical set of statements into the 
categories of “personally acceptable" and “not per- 
sonally acceptable. There was no indication that 
any of the Ss saw any connection between the state- 
ments and the TAT which had been administered 
earlier. 

An attribute was defined as defensively isolated 
if S considered it as "not personally acceptable" 
and had placed it in either of the two most extreme 
rankings in the "less like me" direction. A non- 
isolated attribute was defined as one which S con- 
sidered "personally acceptable" and which she had 
placed above the midpoint of her rankings in the 
“more like me" direction. Where possible, a pair of 
isolated and a pair of nonisolated attributes were 
selected for each S. Each attribute of the pair was 
drawn from a separate TAT card. Attributes which 
served as fillers for a given S were not defined as 
either isolated or nonisolated for that S. 


Role-Taking Conditions 


Twenty-nine Ss, whose TAT productions and 
rankings met the above criteria, were called back 
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6 mo. later. On the opaque projector, they were 
shown a new stimulus card depicting an old woman 
and a young girl facing each other in an ambiguous 
relationship. All Ss were asked to make up two 
stories for this card. As the basis for each story, 
S was given one of her pairs of attribute statements, 
She was asked to assign each statement of the pair 
to whichever figure she wished and to construct a 
standard TAT story built around the figures so 
endowed. After both stories were completed, she 
was then required to take the roles of her characters 
following the standard RTT procedure. 

The S was given two booklets which had been 
individually prepared for her. The two attribute 
statements were pasted onto the top of the first 
page of each booklet. Although all Ss were treated 
in the same manner during the testing, the booklets 
were numbered so that half of the Ss made up 
stories for the pair of defensively isolated attributes 
first, and the other half for the pair of nonisolated 
attributes first. For each pair of attribute state- 
ments the instructions were as follows: 


In this test for creative imagination, you are 
again to make up a dramatic story about a 
picture which has two characters in it. However, 
this time there is an additional instruction. At- 
tached to the first page of each booklet are two 
qualities or characteristics. In making up your 
story, you must assign one quality to each 
character, 

Tell what led up to the event shown in the 
picture, describe what is happening at the moment, 
and tell how it will end. Include what each char- 
acter is feeling and thinking. You will have about 
five minutes for each story. 


After they had finished writing the initial stories, 
Ss were given the role-taking part of the task; 
that is, they were asked to go back and retell each 
story from the perspective of each of the two char- 
acters in the story. For the first story, the role of 
the young woman was assigned first; and for the 
second story, the role of the old woman was as- 
signed first. Detailed instructions for group a 
ministration of the RTT are given in Feffer and 
Suchotliff (1966). 

RIT decentering categories. As has been noted, 
role-taking performance is evaluated in terms pu 
degree to which S is able to refocus upon his ae 
story from the perspectives of his characters while al 
the same time maintaining continuity between these 
various versions of the initial story. The scoring 
categories * of the RTT are ordered along 4 di- 


4A detailed description of the RTT procedure a 
scoring criteria is available in mimeographed fo 
from the authors or for a fee from the Ane 
Documentation Institute. Order Document No. 9 itis 
from ADI Auxiliary Publications Project, Pg a 
duplication Service, Library of Congress, Wasa 
ton, D. C. 20540. Remit in advance $2.50 for oe 
film or $6.25 for photocopies and make checks Pry 
able to Chief, Photoduplication Service, Li 
of Congress. 


COGNITIVE LEVEL: FUNCTION or DEFENSIVE ISOLATION 


TABLE 1 


SIGNIFICANCE TEST FOR DECENTERING UNDER THE 
DIFFERENT CONDITIONS 


Mean RTT scores 


N z p 
Defensive | Nonisola- 
isolation | tion condi- 
condition tion 
11.5 144 29 3.0 -001 


a Mean scores are presented for descriptive purposes and were 
not used as a basis for statistical inference. They were derived 
by assigning weights to the role-taking task (RTT) categories. 

b Probability estimate is one-tailed (Wilcoxon Matched 
Pairs Signed Ranks test). 


mension of increasing subtlety of coodination be- 
tween S’s versions of the initial story. Categories at 
one extreme reflect obvious inconsistency or dis- 
continuity between the characters’ viewpoints as, 
for example, S's describing the woman as “angry” 
in the initial story and as “happy” when taking 
the woman’s role (simple refocusing). At the other 
extreme, the highest category (change of perspec- 
tive) reflects an overall synthesis of two different 
perspectives. For example, S in the role of one 
woman may describe herself from an internal 
orientation and the other from an external view- 
point: “She feels cheated because she thought that 
the landlady had agreed not to raise her rent, but 
now she went ahead and raised it.” In taking the 
role of the other woman, the inner and outer 
orientations are reversed, but at the same time co- 
ordinated: “The landlady could see that she was 
angry, but she had no choice—expenses had gone 
up.” 

Role-taking performance was scored in terms of 
the highest category attained in each story situation. 
Interjudge reliability was determined by the degree 
of correlation between the scoring of the first and 
second authors. The Spearman rank difference cor- 
relation was +.69 (N =20, p= < .01), Further in- 
formation relevant to interjudge and test reliability 
may be found in Feffer (1959), Feffer and Gourevitch 
(1960), and Feffer and Jahelka (1968) . 


RESULTS 


As may be seen in Table 1, RTT scores are 
significantly lower under the defensive isola- 
tion condition than under the nonisolation 
condition. In terms of the measures used, 
then, Ss evidenced a greater degree of se- 
quential decentering when required to take 
the viewpoints of defensively isolated at- 
tributes than when required to take the view- 
points of nonisolated attributes. 

Since the decentering measure as well as 
the judgments of defensive isolation are based 
upon stories written by S, the possibility was 
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considered that the findings were influenced 
by a productivity variable. For example, 
being required to deal in writing with self- 
attributes which are unacceptable and/or 
unacknowledged may result in the desire to 
escape from the situation by writing less, 
this in turn resulting in a lower level of RTT 
functioning. In order to examine this pos- 
sibility, verbal productivity under the various 
conditions was assessed. No significant differ- 
ences were found in the number of words pro- 
duced under the different conditions. 


DISCUSSION 


The present findings can be interpreted as 
supporting the hypothesis that a failure to 
coordinate perspectives occurs when these 
perspectives represent defensively isolated 
aspects of the self-organization, This inter- 
pretation is based, in part, upon the accept- 
ance of the projective assumption that any 
personality characteristic which the individual 
ascribes to unstructured material is an at- 
tribute of the self. Clearly, however, inter- 
pretations other than those based upon the 
projective assumption can be advanced as, 
for example, the possibility that Ss might 
have had difficulty in role taking solely be- 
cause they were required to assume view- 
points which they disliked. In addition, it 
should be noted that even if one were to ac- 
cept the projective assumption, it does not 
follow that the present findings are interpret- 
able only in terms of the anxiety-defense 
paradigm set forth in the rationale. One could 
argue with equal justification that forcibly 
confronting the individual with an anxiety- 
provoking attribute in role taking produces 
not a defensive reaction of isolation, but a 
disorganized, undifferentiated response which 
is directly manifested in a lack of coordination 
between perspectives. 

These problems obviously require further 
study, Should such research, however, con- 
solidate the position that defensive isolation 
is basic to a difficulty in “conserving” role 
relationships on the RTT, then this would 
focus attention on a question of considerable 
import; namely, what implication does this 
type of isolation have for the development 
of conservation in general, including the con- 
servation of impersonal reality? 


| 
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Within Piaget's framework, the conditions 
for the development of conservation have been 
articulated in terms of an equilibration model. 
Essentially, this model suggests that the in- 
dividual seeks contact with the environment 
in order to consolidate modes of organizing 
experience. Given the inherent properties of 
an average environment, however, the in- 
dividual encounters internal contradictions 
between the ways in which he has previously 
organized experience. This leads to a cognitive 
reorganization whereby the internal conflict is 
resolved; the cycle then repeats itself. 

Diametrically opposed to the internal 
dialectic of Piaget's equilibration process are 
the assumptions of an external reinforcement, 
drive-reduction model which, as White (1959) 
has systematically noted, underlie the anxiety- 
defense paradigm. The contrast between the 
equilibration and drive-reduction models has 
been further underscored by the implication 
(Schachtel, 1954; White, 1959; Wolff, 1960) 
that they refer to essentially antagonistic 
modes of behavior. These writers have ob- 
served that it is precisely when strong drives 
are not in operation that the organism is 
directed toward the exploration of new facets 
of the environment. In the presence of strong 
motivation, including anxiety, exploratory be- 
havior gives way to a narrow form of learning 
specifically directed toward reduction of the 
high drive state. Such behavior, in supplant- 
ing exploration, may thus isolate the in- 
dividual from further contact with new, po- 
tentially contradictory, dimensions of experi- 
ence necessary to further development. 

These conceptions would imply that ex- 
ploratory behavior is necessary for the de- 
velopment of conservation in both impersonal 
and interpersonal realms. It seems reasonable 
to assume, however, that the world of inter- 
personal relationships differs from the world of 
objects in having a greater potential for 
eliciting strong motive and anxiety states. 
Therefore, there may be a greater number 
of occasions in the interpersonal sphere for 
exploratory behavior to give way to a drive- 
reducing, isolating form of learning. From 
this viewpoint, defensive isolation can be 
seen as behavior, which in supplanting ex- 
ploratory activity, prevents the development 
of conservation in particular role relationships. 
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The extension of the decentering concept to 
the interpersonal area provides a framework 
for evaluating this suggested relationship be- 
tween isolation due to dynamic, defensive 
factors, and the development of conservation, 
For example, one could compare training 
procedures based on "cognitive conflict" to 
those based on external reinforcement with 
respect to their relative effectiveness in induc- 
ing conservation on both the RTT and im- 
personal tasks (cf. Gruen, 1965; Smedslund, 
1961a, 1961b; Wohlwill & Lowe, 1962). In 
addition, levels of motivation and anxiety 
could be varied in order to evaluate whether 
the effectiveness of cognitive conflict train- 
ing is more readily disrupted on the RTT 
than on impersonal tasks. Such studies in 
providing information regarding the sug- 
gested relationship between dynamic factors 
and the equilibration model would, by the 
same token, be relevant to Wolíf's (1960) 
conceptualization of the relationship between 
the classical Freudian view of development 
and Piaget’s “conflict-free” approach to the 
problem. 
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RESPONSES OF PHENYLKETONURIC CHILDREN ON 
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A task which required continuous attending was given to 11 phenylketonuric 
children and 11 nonphenylketonuric control Ss (matched for age, IQ, sex, and 
race). The phenylketonuric Ss made significantly fewer correct responses and 
significantly more late responses than their matched controls. The performance 
of the phenylketonuric Ss also changed significantly over the sequence of 
stimulus presentations, with fewer correct responses toward the end. 


The evidence that has been collected to 
date indicates that phenylketonurics treated 
early (prior to 4 mo. of age) do not attain an 
intelligence level commensurate with that of 
their unaffected siblings (Berman, Waisman, 
& Graham, 1966). It has not yet been deter- 
mined whether further modifications in dietary 
treatment would eliminate the discrepancy. A 
further question which is raised by the re- 
sults of the Berman et al. study is whether 
deficits are present in specific areas of func- 
tioning and, if there are deficits, whether they 
are shared by both treated and untreated 
phenylketonurics, 

The results of a study on delayed response 
in phenylketonurics (Siegel, Anderson, Fisch, 
& Brown, 1967)* suggested that phenylketo- 
nurics were more affected by increased delay 
on a color-form task than their matched con- 
trols. More errors were made by both treated 
and untreated phenylketonurics. The indica- 
tion that there may be a specific deficit in at- 
tention span led to the present study with the 
purpose of using a different method to investi- 
gate attention in order to help clarify the 
extent of the deficit, 

The method used in the present study was 
based on that of Rosvold, Mirsky, Sarason, 
Bransome, and Beck (1956) and Mirsky and 
Van Buren (1965). In order to detect lapses 


1 This investigation was supported by Research 
Grant HD 01396 from the National Institute of 
Child Health and Human Development. 

? Requests for reprints should be sent to V, Elving 
Anderson who is now at the Dight Institute for Hu- 
man Genetics, Zoology Building, Minneapolis, Min- 
nesota 55455. 

3 F. S. Siegel, V. E. Anderson, R. O. Fisch, & B. 
W. Brown, Unpublished manuscript, 1967. 
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in attention Rosvold et al. developed 
which required continuous attending. 
present study the task was modified sine 
original task required the ability to re 
discriminate letters of the alphabet, W 
some of the 5s were not able to do. 


METHOD 
Subjects 


The experimental sample of 11 phenylketonum 
was obtained through the Phenylketonuric Clin 
the University of Minnesota Hospitals (Tabl 
All experimental Ss were considered to have cl 
phenylketonuria since they met the following cfi 
(a) They all had elevated fasting serum p 
nine levels (21-43 mg. 26) while on a regulai 
(b) They had normal or low serum-tyrosine 
on tests during the previous year. (c) When hi 
an elevated serum-phenylalanine level, they exe 
in the urine an increased amount of phenylal 
and its metabolites. 

In addition, 7 of the 11 Ss had sibs with & 
nosis of phenylketonuria at the time of study 
Ss 1E, 12E, and 14E (Table 1) were sibs. 
7E, 8E, 11E, and 13E were from different fal 
but had phenylketonuric sibs not included jl 
study. None of the other Ss had phenylket 
relatives. 

The ages at which a low-phenylalanine d 
initiated or discontinued are also shown in TI 
The effectiveness of dietary control at the 
the study was rated according to the criteri 
by Koch, Acosta, Fishler, Schaefer, and Wi 
(1967). 

A control S matched for age, sex, IQ, am 
with each experimental S was obtained throug 
Minneapolis Public Schools (Table 1). Am 
tional requirement was that control Ss have 
vious motor difficulty such as cerebral 
other physical impairment. Ss 1C, 2C, 6C, 
10C had IQs over 80. The medical diagn 
S 12C was Down’s syndrome. The medical f 
for the remaining control Ss did not provided 
mation that would help to establish the eU 
their retardation. 


IE 1C F 
?E 2C M 
4E 4C M 
6E 6C M 
TE 7C M 
8E 8C M 
10E 10C F 
HE 11C F 
NE 12C M 
IE 13C M 
ME 14C F 


Note. —E = experimental Sa; C = control Ss, 
* WISC; all others by Stanford-Binet. 


Apparatus 
Eight picture vocabulary items selected from the 
1960 Stanford-Binet revision were made into 2X2 
in, slides. The slides were projected by a Carousel 
Projector onto a screen 9X6 in. and were auto- 
matically presented at 1,9-sec, intervals. The slide 
| WM exposed 1,2 sec. with a dark interval of .7 sec. 
| screen was mounted on the surface of a wooden 
tnelosure 19 X 14 X 13 in, The slides were projected 
Saa the rear, thus hiding the projector from S's 


Through the use of photoelectric cells and relays it 
Was possible to record five types of responses; the 
Correct response and four types of errors, The 
and the five counters were in a separate unit 
in view of E. 

The S was seated 4 ft, away from the screen and 
Ë was seated behind S and out of his line of vision. 
S held a plastic unit 18: 3X 1 in. on which 
the response button was mounted. 


Procedure 


All Ss were familiar with the picture items from a 
Previous test, The first slide presented showed a 
and the following instructions were given: 


Tam going to show you some pictures, One of the 
ures is a tree, Every time you see a tree press 

the button, [E handed the switch to S and ob- 

tved S to determine that the correct 

Wis used] Let's try a few. Remember, press the 

button every time you see a tree. 


Following the instructions 10 slides were presented 

eluding 3 Possible correct stimuli, When $ re- 

Pitted correctly to all stimuli, testing was begun. 
latal of 120 slides, including 30 correct 


Permission to make these slides was granted by 
the Houghton-Miffiin Company. 


PKU CHILDREN ON A Continuous PERFORMANCE Test — 


TABLE 1 
CHARACTERISTICS OF THE EXPERIMENTAL AND CONT&OL Sunjects 
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A task which required continuous attending was given to 11 phenylketonuric 
children and 11 nonphenylketonuric control Ss (matched for age, IQ, sex, and 
race). The phenylketonuric Ss made significantly fewer correct responses and 
significantly more late responses than their matched controls. The performance 
of the phenylketonuric Ss also changed significantly over the sequence of 
stimulus presentations, with fewer correct responses toward the end. 


The evidence that has been collected to 
date indicates that phenylketonurics treated 
early (prior to 4 mo. of age) do not attain an 
intelligence level commensurate with that of 
their unaffected siblings (Berman, Waisman, 
& Graham, 1966). It has not yet been deter- 
mined whether further modifications in dietary 
treatment would eliminate the discrepancy. A 
further question which is raised by the re- 
sults of the Berman et al. study is whether 
deficits are present in specific areas of func- 
tioning and, if there are deficits, whether they 
are shared by both treated and untreated 
phenylketonurics. 

The results of a study on delayed response 
in phenylketonurics (Siegel, Anderson, Fisch, 
& Brown, 1967)? suggested that phenylketo- 
nurics were more affected by increased delay 
on a color-form task than their matched con- 
trols. More errors were made by both treated 
and untreated phenylketonurics. The indica- 
tion that there may be a specific deficit in at- 
tention span led.to the present study with the 
purpose of using a different method to investi- 
gate attention in order to help clarify the 
extent of the deficit, 

The method used in the present study was 
based on that of Rosvold, Mirsky, Sarason, 
Bransome, and Beck (1956) and Mirsky and 
Van Buren (1965). In order to detect lapses 


1This investigation was supported by Research 
Grant HD 01396 from the National Institute of 
Child Health and Human Development. 

? Requests for reprints should be sent to V. Elving 
Anderson who is now at the Dight Institute for Hu- 
man Genetics, Zoology Building, Minneapolis, Min- 
nesota 55455. 

3 F. S. Siegel, V. E. Anderson, R. O. Fisch, & B. 
W. Brown. Unpublished manuscript, 1967. 


in attention Rosvold et al. developed a task 
which required continuous attending. For the 
present study the task was modified since the 
original task required the ability to readily 
discriminate letters of the alphabet, which 
some of the Ss were not able to do. 


METHOD 
Subjects 


The experimental sample of 11 phenylketonuric Ss 
was obtained through the Phenylketonuric Clinic at 
the University of Minnesota Hospitals (Table 1). 
All experimental Ss were considered to have classical 
phenylketonuria since they met the following criteria: 
(a) They all had elevated fasting serum phenylala- 
nine levels (21-43 mg. %) while on a regular diet. 
(b) They had normal or low serum-tyrosine values 
on tests during the previous year. (c) When having 
an elevated serum-phenylalanine level, they excrete 
in the urine an increased amount of phenylalanine 
and its metabolites. s 

In addition, 7 of the 11 Ss had sibs with a diag- 
nosis of phenylketonuria at the time of study. 1d 
Ss 1E, 12E, and 14E (Table 1) were sibs. Ss 4B 
7E, 8E, 11E, and 13E were from different families 
but had phenylketonuric sibs not included in the 
study. None of the other Ss had phenylketonune 
relatives, i 

The ages at which a low-phenylalanine 
initiated or discontinued are also shown in ^ $ 
The effectiveness of dietary control at the time 
the study was rated according to the criteria m 
by Koch, Acosta, Fishler, Schaefiler, and Wo 
(1967). 


e diet was 
Table 1. 


x, IQ, and race 
A control S matched for age, sex, hrough the 


with each experimental S was obtained tl ddi- 
Minneapolis Public Schools (Table 1). An 20v. 
tional requirement was that control Ss 
vious motor difficulty such as cerebral pi sni 
other physical impairment. Ss 1C, 2C, 6C, 5 js for 
10C had IQs over 80. The medical diagnose rds 
S 12C was Down's syndrome. The medical "nfor 
for the remaining control Ss did not provide gy 9 
mation that would help to establish the etiolo 
their retardation. 
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TABLE 1 
CHARACTERISTICS OF THE EXPERIMENTAL AND CONTROL SUBJECTS 
waste A Age at study 19 Special diet in experimental Ss 
pall 

E c E c Age on Age off Eum 
1E 1C F 6-6 6-2 95 95 12 
2E 2C M 7-0 6-11 92 94 A ai Good 
4E 4C M 8-4 8-1 76 76 6-10 7-11 
6E 6C M 8-7 8-7 104 107 0-9 Good 
iE 7C M 10-7 10-0 75 74 0-6 9-0 j 
8E 8C M 10-8 10-9 85 85 0-2 Good 
10E 10C F 11-2 11-6 92 96 2-0 Excellent 
ME 11C F 11-10 12-4 61 69 1-4 5-0 
12E 12C M 13-5 13-2 43 42 Never on diet 
13E 13C M 13-9 13-7 78* ae 11-6 12-9 
ME 14C F 15-9 16-2 50 48 Never on diet 
M 10- 8 10-7 714 784 


Note.—E = experimental Ss; C = control Ss. 
+ WISC; all others by Stanford-Binet, 


Apparatus 


Eight picture vocabulary items selected from the 
1960 ‘Stanford-Binet revision were made into 2 X2 
in, slides.4 The slides were projected by a Carousel 
Projector onto a screen 9X6 in. and were auto- 
matically presented at 1.9-sec. intervals. The slide 
Was exposed 1.2 sec. with a dark interval of .7 sec. 
The screen was mounted on the surface of a wooden 
pete 19 X 14 X 13 in. The slides were projected 
on the rear, thus hiding the projector from S’s 
pu the use of photoelectric cells and relays it 
E pe to record five types of responses: the 
ti ect response and four types of errors. The 
pur and the five counters were in a separate unit 
n view of E, 

Es Was seated 4 ft, away from the screen and 

The S SMS behind S and out of his line of vision. 

E^ eld a plastic unit 18 X 3 X $ in. on which 
sponse button was mounted. 


Procedure 


o B Ss were familiar with the picture items from a 
tree, lous test. The first slide presented showed a 
» and the following instructions were given: 


Sm going to show you some pictures. One of the 

the Bits, a tree. Every time you see a tree press 

erven get: [E handed the switch to Sand ob- 

Bis is Ws determine that the correct procedure 

butt a Let’s try a few. Remember, press the 
on every time you see a tree. 


F Y 
Bh 5 Ms instructions 10 slides were presented 
sponded: Possible correct stimuli. When S re- 
tota] Correctly to all stimuli, testing was begun. 
Of 120 slides, including 30 correct stimuli, 


"E 
t Hen to make these slides was granted by 
ughton-Mifflin Company. 


were presented. In this series S’s responses were 
recorded as correct, incorrect, or late. A late re- 
sponse was one which occurred between slide presen- 
tations, that is, in the blank interval. Upon com- 
pletion of the tree (T) task S was given a rest 
period and allowed to leave his chair. 

When testing was resumed, the following instruc- 
tions were presented. "Now we are going to do 
something a little bit different. Press the button 
when you see a tree but only if the tree comes right 
after a ball. If you see a ball and then a tree, press 
the button. Let’s try a few.” Eleven pretest slides 
were presented which included three correct combi- 
nations plus possible errors such as a tree not 
preceded by a ballor a ball not followed by a tree. 
Incorrect responses were discussed and the testing 
was not initiated until S performed correctly. Before 
starting testing S was told, “We're going to do some 
just like that. Press the button when you see a tree, 
if it comes right after a bal ." On the ball-tree 
(B-T) task no S required more than three presenta- 
tions of the pretest series of slides. 

For the B-T task 201 slides were presented in- 
cluding 30 possible correct stimuli and 30 stimuli 
where a tree was not preceded by a ball. For the 
B-T task five counters recorded responses as fol- 
lows: (a) correct, (b) tree not after ball, (c) ball, 
(d) other incorrect. responses, and (e) late responses. 
Responses were also recorded by E as they occurred 
in order to obtain information about sequence. 
Hand recording was not as reliable as the automatic 
counters. Exact information was obtained for the 
sequence of responses to the correct stimulus, while 
there were some missing data for the sequence of 
incorrect responding. 

Testing time for 
required 64 min. 
structions, pretest, 
20 min. 


the T task was 3.8 min. B-T 
Total testing time including in- 
and rest time was approximately 
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TABLE 2 


NUMBER AND Tyre or RESPONSES TO THE "TREE" 
SERIES BY EXPERIMENTAL AND CONTROL 


SUBJECTS 
Responded to Chose wrong | Responded 
Matched tree stimulus late 
pair 
E c E c E c 
1E 1C| 28 27 1 1 1 1 
2E 2C| 27 29 1 2 2 2 
4E 4C| 29 25 0 1 2 3 
6E 6C| 22 30 1 0 6 0 
7E 7C| 28 29 0 2 4 0 
8E 8C| 30 30 2 1 0 0 
10E 10C| 29 29 0 0 0 0 
11E 11C| 30 30 2 1 1 0 
12E 12C| 19 28 2 1 2 1 
13E 13C| 30 29 0 0 0 2 
14E 14C| 28 30 0 2 0 1 
Total | 300 316 9 11 18 10 
M 27.3 28.7 8 18 16| 9 


Note.—E = experimental Ss, C = control Ss, 


RESULTS 


Nearly all Ss performed well on the T task 
(Table 2). Very few correct responses were 
omitted and few errors were made. 

The data from one pair of Ss (Pair 12), 
the most retarded, had to be omitted from 
the results on the B-T task. It was clear from 
their performance that they had not main- 
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tained the instruction set and had become 
confused during the course of the task. The 
data for the remaining 10 Ss are shown in 
Table 3. 

Since matched pairs of Ss were used, it was 
possible to compare the number of correct 
responses (excluding late responses) by com- 
puting the value for the differences between 
pairs. The ¢ value of the difference was 5.01, 
significant at the .001 level. Thus, it appeared 
that the phenylketonuric group (which in- 
cluded Ss who were retarded, nonretarded, 
on diet, and off diet) made significantly fewer 
correct responses than their matched controls 
(Table 3). 

Table 3 shows that the frequency of errors, 
other than late responses, was approximately 
the same in both groups. The occurrence of 
late responses, however, was significantly 
more frequent in the phenylketonuric group 
(t = 3.38, p < .01). 

A late response could follow either a correct 
or incorrect stimulus. From the record on 
hand-scored tally sheets, an attempt was made 
to determine where the late responses had 
occurred. The experimental group made 47 
late responses, 27 after a correct stimulus, 
7 after an incorrect stimulus, but the remain- 
ing 13 late responses could not be placed in 
sequence. When the late responses known to 


TABLE 3 


NUMBER AND TYPE OF RESPONSES 


TO THE “TREE AFTER BALL” SERIES 


BY EXPERIMENTAL AND CONTROL SUBJECTS 


ae a ae Chose wrong stimulus 
Matched Matter ball Rene 
PX Tree Ball | Other | Tree Ball | Other 
E c E E E c c c LES 

IE 1C| 16 25 1 0 0 8 0 0 3 3 
2E 2C| 25 25 3 0 0 5 2 0 7 : 
4E 4C| 17 16 3 3 4 0 0 0 H y 
6E 6C} 22 29 0 0 0 1 0 0 5 1 
7E 7C | 17 26 5 2 2 1 0 0 9 Ü 
SEU BCSMNU2S 30 1 0 0 0 0 0 9 0 
10E 10C 21 28 2 0 1 2 0 0 2 0 
ME 11C 23 28 1 1 0 1 0 0 3 0 
BE 13C| 23 30 1 0 1 7 0 1 5 1 
14E 14C 19 25 0 1 0 4 0 1 2 

Total 208 262 17 7 8 29 2 2 47 i 
M 208 | 262 17 4 8 29 2 2 EE 

Note.—E = experimental Ss; C = control Ss. 


PKU CHILDREN on A CONTINUOUS PERFORMANCE TEST 


Missed Tree After Ball 


Errors 
(per cent of total possible) 


lOl- 15l- p E 
150 20! 50 


Q Uer vier 
50 100 


Chose Wrong Stimulus 


Si- l0- I5- 9 Ue 
100 
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Responded Late 


5I- IOl- 15k 


150 20! 50 100 150 20! 


Sequence of Slide Presentation 
(B-T series) 


Fic. 1, Frequency of errors on the B-T series by type of error and by 
sequence of slide presentation. 


follow the correct stimlus were added to the 
correct responses the difference between the 
two groups was eliminated. 

Data showing the sequence of responses in 
blocks of 50 slide presentations are shown in 
Figure 1. The number of possible correct re- 
sponses in each of the four blocks was 7, 7, 8, 
and 8, respectively. 

Figure 1a shows that the experimental 
group made fewer correct responses as test- 
ing proceeded. The slope of the line for the 
experimental group was significantly different 
from zero (£ = 3.52, p < .01). In the control 
group the ¢ value was 1.61 (< .10). All 
the experimental Ss had a positive slope 
whereas the control Ss tended to differ from 
each other. 

Having found that the slope of correct re- 
sponding over the stimulus presentations was 
different for the two groups, the question 
remained whether the two groups differed 
from the outset. Each experimental S's per- 
formance for the first 50 stimulus presenta- 
tions was compared with that of his matched 
control. The value of the difference between 
matched pairs was 3.86 which was significant 
at the .01 level. 

Figure 1b presents the number of errors of 
commission made by each group and Figure 
1c presents the late responses. Fourteen errors 
(1 error of commission and 13 late responses) 
made by the experimental group are missing 
from Figure 1. The errors were recorded by 


the automatic counters, but were not located 
in sequence by hand scoring. There were no 
missing data for the control group. There 
appeared to be a sharp upswing in the number 
of errors, especially late responses, made by 
the experimental group toward the end of the 
test session. Other differences between the two 
groups might have emerged if the length of 
the test session had been extended. 

In the analysis of a test designed to mea- 
sure attention in children it would seem ap- 
propriate to expect a relationship between 
performance and mental age (MA). In the 
study (see Footnote 2) on delayed response 
a relationship between MA and performance 
was found. In the experimental group in the 
present study the correlation between MA 
and correct responses on B-T was r= 41 
(ns). Analysis of the parallel data for the 
control group showed r = .72 (p < .05). The 
correlation between correct response on the 
B-T task and IQ was r=.15 (ms) for the 
experimental group and r = .22 (ms) for the 
controls, The relationship between CA and 
correct response was 7 = .13 for the experi- 
mental group and r= .35 for the controls. 


Discussion 


In a task requiring continuous attending 
over a period of approximately 63 min. the 
performance of the phenylketonuric group in 
choice of correct stimulus was significantly 
below that of their matched controls. The 
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performance of the experimental group 
changed significantly over the sequence of 
stimulus presentations showing a highly sig- 
nificant positive slope of missed responses, 
whereas in the control group the slope 
was in the same direction but did not reach 
significance. 

A comparison between the two groups on 
the initial level of performance also indicated 
that the experimental group made fewer cor- 
rect responses from the beginning and from 
the slopes it appeared evident that the differ- 
ence between the two groups increased as 
testing continued, Thus, it would appear that 
the phenylketonuric Ss not only showed an 
impaired attention span as compared to the 
control group, but they were also increasingly 
affected by the task which required con- 
stant attending. In an earlier study (see Foot- 
note 2) in which discrete trials in delayed 
response were presented to the same Ss, only 
the performance of the experimental group 
declined over the course of the trials. 

When late responses following a correct 
stimulus were added to the total correct re- 
sponses, the performance of the two groups 
did not differ. The finding that most of the 
late responses occurred immediately after the 
correct stimuli suggested that experimental Ss, 
while maintaining a level of attention suf- 
ficient to recognize the correct sequence of 
stimuli, were not able to respond within the 
allotted time. 


V. E. ANDERSON, F. S. ScEL, R. O. FrscH, AND R. D. Wirt 


The errors and late responses which haye 
been identified suggest that a sharp upswing 
in late responses in the experimental group 
occurred after 150 slide presentations.. Thus 
one extension of the present study would be 
to increase the number of slide presentations 
in the B-T task. 

The phenylketonuric Ss were a heterogene- 
ous group consisting of some Ss with average 
intelligence, some retardates, some Ss on diet, 
and some off diet. Although the total sample 
was too small to evaluate the effect of intel- 
ligence level and diet on performance, it was 
clear from the results that the experimental $5 
individually and as a group performed more 
poorly than their matched controls. 
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SENSORY-DECISION THEORY ANALYSIS OF THE 
PLACEBO EFFECT ON THE CRITERION FOR 
PAIN AND THERMAL SENSITIVITY (d’)* 


W. CRAWFORD CLARK 2 
New York State Psychiatric Institute 


Administration of a placebo described as a potent analgesic to 22 normal Ss 
sharply decreased the proportion of pain responses to radiant heat stimulation, 
suggesting that the threshold for pain had been raised. However, analysis of 
the data by sensory-decision theory demonstrated that thermal sensitivity (d') 
remained unchanged. The effect of the placebo was solely to alter Ss' response 
bias or likelihood ratio criterion (5 < .005). 


Quantification of the pain experience still 
eludes us. Even in the study of experimentally 
produced pain where the investigator has con- 
siderable control over a large number of vari- 
ables such as stimulus parameters, the selec- 
tion and training of Ss, and many other as- 
pects of the experimental situation, there is 
very little agreement concerning either the 
threshold for pain or the effect of analgesics. 
Even one of the best procedures from the 
viewpoint of classical psychophysics, in which 
accurately determined amounts of radiant 
heat are applied to the skin according to the 
method of limits, yields widely varying re- 
sults. Mean thresholds obtained by different 
investigators vary between 182 and 348 
mcal/sec/cm* (Hardy, Wolff, & Goodell, 
1952), and individual thresholds range from 
170 to 472 mcal/sec/cm* (Clausen & King, 
1950). Particularly distressing is the fact that 
the experimental pain threshold in man fails 
to respond dependably to powerful analgesic 
agents (Beecher, 1959). If the threshold is 
a valid indicator of sensory-physiological 
function, experimental pain thresholds would 
be expected to be raised by analgesics known 
to be effective relievers of pathological pain. 
Yet, even when the experimental pain experi- 
ence has been made to resemble that of 
pathological pain by creating the slowly de- 
veloping, sustained, aching pain produced by 
a tourniquet (Beecher, 1966), by ice-water 

1 This research was supported in part through 
Grant FR-05560 from the National Institute of 
Mental Health. 

? Requests for reprints should be directed to W. 
Crawford Clark, New York State Psychiatric Insti- 
tute, New York, New York. 


immersion (Wolff, Kantor, Jarvik, & Laska, 
1966), tooth-pulp stimulation (Sherman, 
Fiasconaro, & Grundfest, 1963), or intra- 
muscular injections of hypertonic. saline 
(Jarvik & Wolff, 1962), the effects obtained 
with analgesics are inconsistent (Kutscher & 
Kutscher, 1957), and threshold variability 
remains a problem. 

Tf physical and physiological variables can- 
not account for inconsistent responses to pain, 
perhaps psychological variables can. Non- 
sensory factors such as direct instructions, 
covert suggestion, social context, and even 
personality have been shown to exert a deter- 
mining influence on the pain threshold (Hall, 
1953). Instructions which induce a change in 
the criterion for pain alter the threshold for 
experimental pain (Wolff & Goodell, 1942); 
even a description of the sensations to be 
experienced produces a lower threshold 
(Clausen, Gjesvik, & Urdal, 1953). Sugges- 
tion, it is well known, will often reduce the 
anguish associated with pathological pain, 
particularly when augmented by a placebo 
(Hass, Fink, & Hártfelder, 1963). The impor- 
tance of the experimental context has been 
demonstrated by McKenna (1958), who 
found that a large number of his Ss would 
not have regarded the noxious stimulus as 
painful had they encountered it outside the 
laboratory. Pain tolerance can also be influ- 
enced by social factors such as the saliency 
of group membership (Lambert, Libman, & 
Poser, 1960). With respect to personality 
variables, psychiatric patients whose pre- 
dominant symptom was anxiety had a lower 
threshold for experimental pain stimulation 
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TABLE 1 
RATING SCALE FOR CATEGORIZING STRENGTH OF THERMAL EXPERIENCE 
ALONG THE OBSERVATION CONTINUUM 
Cı C: Cs Ca Cs Cs |C: Cs Ce Cio Cu Cn 
Ro Ri Ra Rs Rs Rs Re Ri Rs | Ro | Rio Ru Ry 
nothing | maybe | faint warm | moderate} low  |hot very | very | faint | pain | very extremely 
some- | warmth heat hot faint | pain painful | painful 
thing pain (with- 
draw 
gun) 


Note.—C and R designating criterion and response category were not in the scale viewed by S. 


than did normal controls (Malmo & Shagass, 
1949) or depressed patients (Hemphill, Hall, 
& Crookes, 1952). In addition, it has been 
reported that good tolerators of both patho- 
logical and experimental pain tended to be 
extraverted (Petrie, Collins, & Solomon, 1960) 
and to score high on a “psychopathic” index 
(Schalling & Kareby-Levander, 1963). Thus, 
threshold variability may be due not to 
changes in sensory sensitivity but to altera- 
tions in S's response bias—his general tend- 
ency to inhibit or to emit the response “pain.” 
This possibility has been raised explicitly by 
Barber (1963), who concludes that although 
in some instances “hypnotically suggested 
analgesia" produces some amelioration of the 
pain experience, often the result is not a 
reduction in pain, but rather an unwilling- 
ness to admit to the E that pain had been 
experienced. 

It should now be clear that pain thresholds 
obtained by traditional psychophysical pro- 
cedures depend upon both sensory and non- 
sensory variables. Thus, when pain thresholds 
are reported to be functions of such variables 
as sex, fatigue, and anxiety level, in which 
physiological and psychological differences are 
intertwined, it is impossible to decide whether 
a difference in threshold represents altered 
sensory sensitivity or altered response bias. 
Fortunately, a new approach, based on the 
application of the theory of signal detection 
to human observers, has recently been de- 
veloped (Green & Swets, 1966). The aspect 
of sensory-decision theory which is of par- 
ticular importance here is the division of the 
traditional threshold into two components of 
observer performance. One component pro- 
vides a relatively pure measure of sensory 
sensitivity (d' which ideally remains un- 


altered when nonsensory variables such as the 
subjective costs and values of yes and no 
are manipulated. The other independently 
determines the location of the observer's sub- 
jective criterion by relating it to the likelihood 
ratio (Lx). Using this procedure, Clark 
(1966) demonstrated that inhibiting instruc- 
tions which induced a cautious attitude caused 
Ss to raise their criteria for visual flicker, 
while facilitating instructions had the opposite 
effect. In spite of this shift in response bias, 
sensory sensitivity (d') remained unaltered, 
even though analysis of the same data by 
the method of constant stimuli indicated that 
the inhibiting instructions had raised the 
flicker threshold. In the present investigation, 
sensory-decision theory was used to study the 
effect of a placebo described as a potent 
analgesic on thermal sensitivity (d') and the 
subjective criterion for pain. 


METHOD 


The 22 paid volunteers, mostly college students, 
each served in a control and a placebo session 0? 
different days. Sessions were counterbalanced to per- 
mit a statistical analysis of possible order effects. 
The Hardy-Wolff-Goodell Dolorimeter with its n 
extended below 100 mcal/sec/cm? to zero jnten: : 
was used to present radiant heat stimuli of E 
duration to two 3.5-cm. diameter patches of In x 
ink applied to either side of the midpoint c. 
wrist and elbow on the volar surface of the pi 
preferred forearm. The approximate threshold of e m 
S was determined by obtaining rating repo 
stimuli of 200, 100, 260, 0, 300, and when we 
330 mcal/sec/cm? in intensity, presented in the 0i ular 
listed. On the basis of these responses, & parap D. 
set of stimulus intensities tailored to each Be 
used throughout the experiment. The average M 
of the stimuli over all Ss were So=0, om. 
Se= 208.6, Ss= 271.0, and S, = 308.8 meal/see/ ai 
The stimuli, 25 at each intensity, were Pte 


: very 
alternately to the distal and proximal spots € 


= 94.4, 
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TABLE 2 
Dnvc REACTION CHECKLIST 
Percentage of Ss 
giving a positive 
Item response 
Start |During| End 
1. Press left fingernail into the 
thumb; is there any loss of 
feeling? 27.3 | 50.0 | 27.3 
2. Any feeling of an upset stom- 
ach or nausea? 4.5 | 13.6 | 00 
3. Slight headache or heaviness 
in the head? 36.4 | 40.9 | 13.6 
4, Any dizziness, or faintness? 13.6 | 9.1 | 13.6 
5. Is your mouth drier than 
usual? 44.5 | 27.3 | 13.6 
6. Do you feel high, light-headed, 
or euphoric? 18.2 | 31.8 | 9.1 
7. Do you feel tense or nervous? | 364 | 31.8 | 9.1 
8. Do you notice any tingling 
sensations in the arm or 
elsewhere in the skin? 40.9 | 364 | 36.4 
9. Press your teeth against your 
tongue. Doesitfeel numb? | 22.7 | 40.9 | 18.2 
10. Are you unable to concentrate?| 4.5 | 18.2 | 13.6 
11. Do you feel depersonalized, 
that is, that you're not 
really here, or you are out- 
side yourself looking in? 22.7 | 13.6 | 9.1 
12. Is there any feeling of bodily 
change or lightness? 273 | 182 | 9.1 
13. Do you feel confused? 45| 91| 91 


20 sec, the intensities being randomized with the 
restriction that each appear once every 15 trials. 
The following instructions were given: 


We wish to determine your ability to feel 
warmth, heat, and faint pain. A variety of heat 
intensities, including zero, will be applied to your 
arm. Some stimuli will be so weak that you will 
feel nothing at all, others will be hot, while others 
will produce a very faint pain sensation. Remem- 
ber, we don't want to see how much pain you can 
Stand; rather, we want to know how good you 
are at detecting the presence of a just noticeable 
amount of pain. You may use as many descriptive 
categories as you wish. As an aid in maintaining 
consistency, here is a list of possible responses [see 
Table 1]. If the stimulus is too painful, you may 
remove the projector from the skin. Alternate the 
spots, starting with the one closer to the wrist. 


During the placebo session, these additional instruc- 
tions were given. 


This drug is an analgesic or "pain-killer" which 
has been in use a long time and is, of course, com- 
Pletely safe, To date it has been used for internal, 
Pathological pain. We are interested in determining 
its efficacy to reduce sensations of heat and pain 
Produced by external thermal stimulation. I must 
first ask you a few questions to make certain that 
it is safe to administer the drug. 
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The questions on the medical checklist emphasized 
the importance of excellent health as a prerequisite to 
taking the drug, queried the strength of S’s reaction 
to analgesics and anesthetics taken previously, and 
suggested that the drug was a narcotic of some sort 
by assuring him that a single dose could not be habit 
forming in any way. The S was asked if he were 
willing to sign a “release” at the end of the experi- 
ment, which included the statement that he was in 
complete control of himself and that he would not 
drive for the rest of the day. The S then swallowed 
a bitter, white powder consisting of 392-mg. lactose 
and 8-mg. quinine, followed by a glass of water.® 
This amount of quinine is well below the minimum 
analgesic dose of 300 mg. (Goodman & Gilman, 
1965). The idea that an analgesic had actually been 
administered was further strengthened by obtaining 
S's responses to a drug reaction checklist (see 
Table 2) near the beginning, middle, and end of the 
approximately 90-min. test session. The effectiveness 
of the induced mental set was demonstrated by the 
fact that all Ss but one reported some of the follow- 
ing effects: numbness of the tongue or fingers, slight 
headache, faint nausea, dizziness, depersonalization, 
confusion, and euphoria. 


RESULTS 


The group data for the cumulated proba- 
bility of occurrence of any response (R) or a 
higher one (R+) conditional upon a par- 
ticular stimulus intensity being present ap- 
pear in Table 3. (Three Ss who failed to give 
any "nothing" responses had to be excluded 
from the study.) Isosensitivity curves may 
be generated from confidence-rating data by 
compressing the ratings to form a dichotomy 
at each of the confidence levels used to rate 
each of the stimuli (Hake & Rodman, 1966; 
Swets, Tanner, & Birdsall, 1961). In a simi- 
lar manner—but substituting category ratings 
for confidence ratings—the responses in each 
of the rating categories, R; (7 — O to 12), to 
each of the five thermal intensities, Sọ to S4, 
were averaged over Ss and converted to condi- 
tional probabilities, that is, the probability 
of a particular response conditional upon a 
stimulus of a particular thermal intensity 
being present. Next, these conditional proba- 
bilities were cumulated successively from R;» 
to Ro, about the criteria from Ci» to C; to 
express the probability that a particular re- 
sponse or a higher one from the rating scale, 


8The author thanks B. Montefusco, Senior 
Pharmacist at the New York State Psychiatric Insti- 
tute, for providing the capsules and suggesting their 
composition and appearance. 
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TABLE 3 
STIMULUS-RESPONSE MATRIX: CUMULATED PROBABILITY OF A RESPONSE 
OR A HIGHER ONE TỌ A PARTICULAR STIMULUS 
Stimulus Response category 
rU d 7 F E 
mcai " 
on Re | Re, | Rea | WE OB [Bite | Bt VER|VERE| Re+ PO VEH | BE 
A. Control condition 
$,(308.8) 1.000 | .998 | .993 | .982 | .945 | .905 | .771 | .716 | .629 | .445 | .155 | 04 
$3(271.0) 1.000 | .994 | .973 | .938 | .843 | .738 | .509 | .444 | .329 | .162 | .024 | 002 
S2(208.6) | 1.000 .993 | .940 | .766 | .535 | .358 | .211 | .087 | .056 | .011 | .002 
Sı (94.4) | 1.000 .760 | .385 | .114 | .051 | .025 | .011 .004 
So (0.0) | 1.000 .336 | .064 | .018 | .007 | .004 | .002 
B. Placebo condition 
S,(308.8) 1.000 | .998 | .974 | .932 | .898 | .832 | .658 | .554 | .414 | .247 | .102 | 007 
$3(271.0) | 1.000 997 | .975 | 910 | .837 | .739 | .608 | .396 | .307 | .204 | .098 | .022 
$,(208.6) | 1.000 .984 | .889 | .720 | .485 | .320 | .185 | .075 | .056 | .020 | .007 
Sı (94.4) | 1.000 .609 | .338 | .095 | .042 | .024 | .009 007 
So (0.0) | 1,000 251 | .044 | .007 | .005 004 


Note.— Each row represents 550 responses, 25 to each stimulus by 22 Ss. 


Ri,, had occurred. It may be profitable to con- 
sider an example from Table 3, Section A. 
The cumulated conditional probability of the 
responses, Rj,, that is, from “maybe some- 
thing,” “faintly warm,” etc. to “extremely 
painful,” locates the position of the rating 
criterion, C,, used by the Ss to separate the 
sensory experience reflected in the response, 
“nothing” (Ro), from subjectively more in- 
tense sensory experiences. Thus for the zero- 
intensity stimulus, So, the cumulated proba- 
bility is expressed as pg)(Ri) = .336. Simi- 
larly, the same criterion, Cı, applied to the 
stimulus of immediately higher intensity, S,, 
yields the point ps,(Ri,) = .760. These two 
values, which are analogous, respectively, to 
the false affirmative and hit rates obtained 
in the binary decision procedure, determine 
the upper right-hand point in quadrant S,—S, 
of Figure 1; for this point, d' — 1.13, and 
Cı = .85. C, is analogous to the likelihood 
ratio criterion, Lx, and is determined in the 
same way—by dividing the probability that 
a particular observation value (x) arose from 
an S; distribution (i = 1 to 4) by the proba- 
bility that the same observation resulted from 
the adjacent lower distribution, S; ;. In this 
manner, isosensitivity, or receiver-operating 
characteristic, curves (Figure 1) were con- 


structed from the group data in Table 3, with 
response categories containing fewer than 10 
responses being omitted, to determine whether 
the results follow the form predicted by sen- 
sory decision theory. The results are plotted on 
normal-normal coordinates, that is, on prob- 
ability scales transformed so that the normal 
deviates are linearly spaced. The ordinate 
represents the cumulated probability that $ 


perceived the more intense stimulus of a pait 


to be above a particular criterion value 


(analogous to the probability of a hit), while | 


the abscissa represents the cumulated proba- 
bility that he perceived the lower intensity 
stimulus to be above that same cite 
(analogous to the probability of a false a 
firmative). For each of the four xe 
intensity pairs (Si—So, S2—S1, rd 
S,—S;)the points representing data bier 
under both experimental conditions appen 
fall on single lines with slopes close to ^ 
The underlying noise and signal-plus a 
distributions are therefore probably Gauss?» 
and of equal variance. 
In oia to make statistical tests to d 
mine whether the placebo affected Pes 
sensitivity or response bias, separate E. 
response matrices for control and Dion 
conditions were constructed for each $- 
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PROBABILITY OF A HIT 


05 :25 :50 75 “95 05 25 :50 75 “95 


PROBABILITY OF A FALSE AFFIRMATIVE 


Fic. 1, Receiver-operating-characteristic curves under placebo (A) and con- 
trol (0) conditions, plotted on normal-deviate axes. (The ordinate and 
abscissa obtained from the rating data are analogous to the hit and fale-[7 jj)  |/ 
affirmative rates obtained from the binary decision technique.) 


these, values of d' and C, were computed for being judged, criteria at different locations 
each S using the procedure just described along the sensory continuum will have iden- 
for the group data. Since slightly different tical likelihood ratios (Luce, 1963). In this 
thermal intensities were used to stimulate each case, it must be assumed that S's decision 
S, it was necessary to obtain a common mea- includes some additional factor, such as the 
sure of discriminability before pooling the magnitude of his sensory experience. Testing 
data. This was done by taking the differences of the hypothesized shift in response bias 
in intensity between each of the adjacent 


stimulus values (e.g. S1—80o, Se—Si, etc.) for TABLE 4 
E S and expressing them as mcal/sec/cm* MEAN THERMAL DISCRIMINABILITY 

© per d’, The means of these values (somewhat 
akin to a just noticeable difference with re- Mean discriminability (mcal/sec/cm? per d’) 
‘sponse bias removed) for each of the four Condition 
Stimulus pairs under the placebo and control (S1-S0)/d! | (S-S)/d' | (S1-S1)/d' | (SeS9/d' 
conditions appear in Table 4. The individual Control | 8325 | 6533 | 40.60 | 31.55 
stimulus-response matrices also yielded the ^ Placebo 99,09 | 60.79 | 4425 | 34.70 


criterion measure (Cx) used by each S to Meanover 


judge his sensory experiences. The C, is tions 
analogous to Lx, but reflects the fact that 

: " "m .—So = 0, = 94.4, S: = 208.6, Ss = 271.0, S 
when more than two stimulus intensities are = es mem 0 H ey 


91.17 63.06 4242 33.13 
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TABLE 5 


MEAN THERMAL CRITERIA (Cx) EXPRESSED 
As LIKELIHOOD RATIOS 


Criterion 
Condition 
Ci CxCa | Ci-C: | CC. 
detection| warm | hot pain 
Control 149 | 2.01 1.01 1.04 
Placebo 190 | 234 | 1.75 | 1.76 
Difference between 
conditions 041 | 0.33 | 0.74 | 0.72 


between experimental conditions requires the 
measurement of the change in a particular 
criterion value to a particular stimulus pair, 
for example, Cio to S4—S;. Thus only those 
criteria used by S on both sessions to the 
same stimulus pair were available for statisti- 
cal analysis. However, no response data were 
lost, since values of C, are cumulative mea- 
sures. The Ss tended to associate particular 
responses, or criteria, with each of the stimu- 
lus intensities. For example, C; was frequently 
used when S, and S, were presented, and 
never resorted to when S, was presented. The 
"warm" criteria (C2, Cs, and C4) were used 
with S3—$S;, the “hot” criteria (C;, Ce, and 
Cz) with Ss—So, and the “pain” criteria 
(Cs to Cia) were used to evaluate S,—S;. 
The increase in mean criterion values induced 
by the placebo condition appears in Table 5. 

According to Hartley's test there was no 
significant departure from homogeneity of 
variance, and an analysis of variance for re- 
peated measures with Ss nested within order 
was used to treat separately the measures of 
discriminability, mcal/sec/cm? per d', and of 
criterion magnitude, C, (Winer, 1962), 
Thermal discriminability was not altered by 
the administration of the placebo, nor were 
there any significant order or interaction ef- 
fects. However, discriminability did vary 
with stimulus intensity (F = 23.3, df = 3/60, 
b < .001), Ss becoming progressively more 
sensitive as intensity was increased. According 
to the Newman-Keuls procedure for the dif- 
ference between treatment means, each of the 
mean values of mal/sec/cm? per d' in the 
lower row of Table 4 differs significantly from 
each of the other values (5 — .01) with the 
exception of 42.42 versus 33.13 mcal/sec/cm? 
per d' Analysis of variance on C, demon- 
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strated that administration of the placebo did 
significantly increase the magnitudes of the 
subjective criteria (F = 8.53, df = 1/20, 
$ < .01). Each increment in the magnitude 
of Ss’ mean criterion is significant (p < ,005) 
according to 7 tests for correlated means | 
(Table 5). Thus, not only were the criteria 
for “pain” (Cs to C12) raised, but the cri- 
teria for “hot,” “warm,” and even "detec-. 
tion" were also increased. There were no 
significant order or interaction effects for C,. 
A graphic portrayal of the results from 
Tables 4 and 5 appears in Figure 2, The 
abscissa is linear with respect to d' and may 
be regarded as the observation axis repre- 
senting the magnitude of the observer's phe- 
nomenal experience (x), transformed to pro- 
duce Gaussian density functions. The ordinate 
represents probability density. The left-hand 
distribution represents the probability that # 
will occur in the presence of no stimulus, and 
in the remaining distributions the probability 
that x will occur in the presence of one of 
the four thermal stimuli. Since the distribu- 
tions overlap, each value of x has a set of 
probabilities, and. hence of likelihood ratios, 
associated with it. The five density functions 
are assumed to be of equal variance, since 
there is no strong evidence that the slopes of 
Figure 1 depart significantly from unity. The 
fact that sensitivity to thermal stimulation 
(d') was not altered by the placebo is re 
flected in the single set of density functions 
averaged over experimental conditions. The 
placebo-induced increase in the value of each 
of the criteria (Table 5) is represented by 1 
the arrow. If a linear relationship between. a 
sensory experience and stimulus intensity o 4 
the range of 271.0 to 308.8 mcal/sec/cm i 
assumed, then under the control condition the 
criterion for pain was set at the mean sera 
tion magnitude produced by a stint E? 
approximately 289 mcal/sec/cm*, while unt T 
the placebo condition the subjective cte d 
for pain was raised to 304 mcal/sec/ c d 
In order to compare the measures of adi- 
C. with the thresholds obtained with (9 
tional psychophysical procedures, the br 
response data were also processed accor wi 
to the method of constant stimuli. Ted 
chophysical ogives were obtained by e i 
the probability of a “faint-pain” (or gre? ^ — 
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OBSERVATION (X) 
Fic. 2. Interlocking hypothetical probability density functions of So, S:, Ss, Ss, and Si. 


response against stimulus intensity, that is, 
Ds; (Rs), Pss (Rs,), and ps, (Ra) (see Ta- 
ble 3). The normally transformed data were 
readily fitted with a straight line. In contrast 
to the constancy of d' over experimental condi- 
tions, the constant stimuli procedure yielded 
interpolated 5096 thresholds of 281 and 302 
mcal/sec/cm? for the control and the placebo 
conditions, respectively. 


DISCUSSION 


The administration of a placebo described 
as a potent analgesic decreased the propor- 
tion of pain responses to intense thermal 
stimulation. (Although every S but one re- 
ported some effect from the capsule, there 
was no significant relation between the num- 
ber of positive responses to the drug reaction 
Checklist and the size of an individual's cri- 
terion shift according to Kendall's rank cor- 
relation coefficient, 7.) Sensory-decision theory 
analysis indicates that the reduced number of 
pain responses reflects an increase in the 
amount of noxious stimulation Ss were willing 
to endure before calling it pain, rather than 
a decrease in their thermal sensitivities. The 
constancy of the index of sensory sensitivity 
(d’) strongly suggests that under the placebo 
condition the intensity of the phenomenal 
experience, or perhaps on a physiological level 
the number of neural impulses, produced by 
the stimulus did not decrease. The sole effect 
placebo was to cause Ss to increase 
itude of their criteria for pain (Ca 


to Ci»), presumably because the placebo- 
induced set increased the embarrassment or 
social cost of a “pain” response. In contrast 
to the constancy of d', analysis of the same 
data by the method of constant stimuli led to 
the conclusion that the placebo raised the 
threshold to painful stimulation. Since the 
threshold is generally regarded as the recip- 
rocal of sensory sensitivity, the customary 
interpretation would then be that the placebo 
had in fact exerted an analgesic effect and 
actually attenuated the experience of pain. 
According to decision theory analysis, how- 
ever, the change in the constant stimuli 
threshold was due to the effect of suggestion 
on the attitude or response bias of Ss. Other 
methods used to measure pain in which the 
subjective criterion of S is allowed to vary in 
an indeterminable manner, such as limits, 
adjustment, "titration," and response latency, 
are equally unsatisfactory. In sum, sensory- 
decision techniques offer a new approach to 
the study of pain perception, even though 
certain adaptations, such as the substitution 
of rating categories for a posteriori probabil- 
ity of stimulus presence, are necessary and 
may force the relaxation of some of the as- 
sumptions of the theory. The isosensitivity 
curves obtained demonstrate that Ss were able 
to maintain at least an order relation between 
the 12 categories used, even when five dif- 
ferent intensities were presented in a single 
session. 

A number of researchers (Gelfand, 1964; 
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Wolff, Krasnegor, & Farr, 1965) who found 
that instructions altered pain tolerance but 
not. pain thresholds have tentatively con- 
cluded that pain tolerance is determined pri- 
marily by psychological variables, while the 
pain threshold reflects the influence of purely 
sensory variables. The results of the present 
experiment, where the pain threshold was 
readily influenced by instruction-induced set, 
do not support this viewpoint. In fact, one 
may question whether a pain threshold, in 
the sense that threshold implies a lower limit 
on sensitivity, exists at all. The linearity and 
slope of the receiver-operating-characteristic 
curves obtained (Figure 1) support the sen- 
sory-decision concept that the criterion may 
be located anywhere along the continuous 
sensory-decision axis. Note, for example, that 
a few pain responses were obtained when the 
blank was presented. 

Even when E does not intervene with a 
placebo, and in fact keeps his instructions as 
neutral as possible, different Ss may perceive 
subtle interpersonal and social demand char- 
acteristics of the experimental situation in 
different ways (Orne, 1962; Rosenthal, 1963) 
and alter their pain criteria accordingly. Thus 
when the method of limits threshold for ex- 
perimental pain is found to be higher in 
Northern Europeans than in Mediterranean 
people or Negroes, and higher in old than in 
young Ss (Chapman & Jones, 1944), it is 
not clear whether these differences have a 
physiological or a psychological basis. Simi- 
larly, as Kolb (1962) has emphasized, differ- 
ences in the reaction to pathological pain by 
patients who differ in age, socioeconomic 
status, race, or religion may be due to differ- 
ences in attitude rather than differences in 
pain sensitivity. 

Lower pain thresholds have been reported 
frequently in Ss made anxious by the threat 
of shock, an impersonal E, unfamiliar ap- 
paratus, or inability to control the intensity 
of the pain stimulus (Haslam, 1966; Hill, 
Kornetsky, Flanary, & Wikler, 1952; Kornet- 
sky, 1954; Lepanto, Moroney, & Zenhausern, 
1965). Although it is possible that Ss were 
experiencing more pain, a more reasonable 
explanation is that they had altered their 
response bias. The S who is worried that he 
might be hurt may say pain more often at 
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the lower intensities, perhaps in the interest 
of inducing Æ to avoid high-intensity stimuli, 
thus spuriously lowering his pain threshold. 
Obviously, it is impossible to control for all 
of the exquisitely subtle variables which may 
influence attitudinal bias. Even double-blind 
procedures fail to control for attitudinal shifts 
produced by the side effects of analgesics, 
such as nausea or euphoria. The best solution 
to this and other problems in pain perception 
appears to be to measure separately the 
sensory sensitivity and the attitudinal bias 
of each S by means of a sensory-decision 
theory procedure such as the one described 
here, or some variant of it. 
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selective reinforcement." The present study was an attempt to replicate and 
extend the Ullmann et al. study. The results suggested that social reinforce- 
ment of common associations by chronic schizophrenics fails to produce any 
evidence of a generalized increase in the tendency to give common associa- 
tions. The results of the Ullmann et al. study may be seen as the result of 
a statistical artifact—regression to a common mean. Further, inappropriate use 
of the analysis of covariance (ANCOVA) to treat data of this type was 


discussed. 


Ullmann, Krasner, and Edinger (1964) have 
stated that “common associations by schizo- 
phrenics to word association stimuli may be 
brought under control by selective reinforce- 
ment [p. 17].” They claim to have replicated 
and extended an earlier study by Sommer, 
Witney, and Osmond (1962). The latter pre- 
sented a list of stimulus words to schizo- 
phrenics for a number of trials, reinforcing 
common associations with cigarettes and 
praise. They found that reinforcement led to 
an increase in the number of common as- 
sociations given to these stimuli. 

Ullmann and his co-workers administered 
the Kent-Rosanoff word list to two groups of 
20 chronic schizophrenics. In order to obtain 
a base-line rate of common associations, 
neither group was reinforced for the first 20 
responses. For the remaining stimuli, the 
experimental group was reinforced for each 
common association by a smile, “mmh-hmm,” 
and a head nod. The control group was not 
reinforced for any response, 

For the experimental group, which had been 
reinforced for common associations, Ullmann 
and his co-workers reported an increase in 
the number of common associations from the 
first 20 words on the Kent-Rosanoff list to 
the last 20 words. Such an increase did not 
occur for the nonreinforced controls, for whom 
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Indiana University 
Ullmann, Krasner, and Edinger have stated that “common associations by 
schizophrenics to word association stimuli may be brought under control by 
the number of common associations decreased 


from the first 20 words to the last 20. 

Although Ullmann et al. claim to. have 
replicated the results of the Sommer study, 
the two sets of data are really discrepant. 
Sommer and his co-workers tested their Ss for 
generalization on new stimulus words and 
were unable to find evidence that the learning 
effects had generalized. In contrast, in the 
Ullmann et al. study, each stimulus word was 
presented only once, and the effect of rein- 
forcement was demonstrated on new stimulus 
words. 

A closer look at the data reported by 
Ullmann et al. shows that the greatest mean 
differences between the two groups Occo 
on the first 20 trials. On this base-line mea- 
sure, the control group gave more orn 
association than the experimental group. T y 
suggests that the results may be D 
in terms of a statistical artifact—regression j 
a common mean. In other words, the d 
level of the control group may have been : 
high due to random sampling, while E 
operant level of the control group Was 


1 his were the 
low, due to random sampling. If t decedat 


da 
case, one would expect to fin given by 


the number of common associations 
the control group and an increase m 

number of common associations, REE, 
the experimental group even if reinfore e 
had no effect. Further, the use of the a? diat | 
of covariance rests on the assumption | 
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treatment effects are uncorrelated with the 
covariate. A detailed discussion of this point 
is presented by Evans and Anastasio (1968) 
who note that violation of this assumption 
is serious when differences among the groups 
on the covariate are relatively large, par- 
ticularly when only two treatment groups are 
involved. This was precisely the situation in 
the Ullmann et al. study. Since the base-line 
data were used in the analysis of covariance 
(ANCOVA) to adjust the final scores, the 
significant difference reported for these ad- 
justed scores may have been spurious. To 
test this possibility, it was decided to rep- 
licate the Ullmann, Krasner, and Edinger 
experiment. An added control group was used 
in order to control for the effects of the oc- 
currence of the reinforcement events inde- 
pendently of Ss’ responses. 


METHOD 


Subjects, Sixty males, diagnosed schizophrenic 
without brain pathology, were assigned to one of 
three groups. These groups averaged 45 yr. of age, 
10.5 yr. of education, and 15.8 yr. of hospitalization. 
The groups did not differ from each other signifi- 
cantly on any of these variables. 

Materials. As in the Ullmann study, 98 words 
from the Kent-Rosanoff word list were used, with 
the Russell-Jenkins (1954) norms being used as a 
Measure of common associations. Common associa- 
tions were those which had been given by at least 
100 Ss in the norm group. 

Procedure. All Ss were administered the first 20 
words of the Kent-Rosanoff word list without rein- 
forcement, so as to establish a base line of common 
associations, For the remaining items, Group DRF 
(differential reinforcement) was reinforced for all 
common associations by a smile, “mmh-hmm,” and 
a head nod. Group NRF (no reinforcement) was 
not reinforced for any responses, and Group CRF 
(continuous reinforcement) was reinforced for all 
Tesponses, regardless of whether they were common 
or not. Except for the addition of the CRF group, 
the procedure employed in the present study was 
essentially similar to that used by Ullmann and his 
Co-workers. Similarly, the measure of the effects of 
differential reinforcement was the number of common 
associations given to the last 20 words, adjusted 
for the number of common associations given to 
the first 20 items. 


RESULTS 


The analysis of variance performed on the 
number of common associations to the first 20 
items by the three groups showed no signifi- 
Cant differences in base-line rates. An analysis 
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TABLE 1 


Means AND STANDARD DEVIATIONS or THE NUMBER 
OF COMMON AssOCIATIONS GIVEN By THREE GROUPS 
OF SCHIZOPHRENICS TO THE First AND Last 20 
Irews or THE KrNr-RosaNorr Worp List 


First 20 items | Last 20 items 


Group pln 
M SD M SD 
Differential reinforcement | 10.7_ | 2.83 | 10.01 | 2.99 
Continuous reinforcement | 7.65 | 5.15 | 7.2 |4.7 
No reinforcement 9.25 | 4.36 | 9.55 | 4.35 


F =.96, 


of covariance was performed on the number 
of common associations given to the last 20 
stimulus words, using the base-line data as 
the covariance adjustment variable. The re- 
sulting F of .96 was not significant. Further, 
as may be seen in Table 1, the means do not 
seem to be in a direction which would sug- 
gest that conditioning took place. 


Discussion 


These data do not support the Ullmann 
et al. (1964) hypothesis that “common as- 
sociations by schizophrenics to word associa- 
tion stimuli may be brought under control by 
selective reinforcement [p. 17].” As suggested 
above, possible explanations for the difference 
between the results of the present study and 
those presented by Ullmann et al. lie in the 
inappropriate use of the analysis of covariance 
to treat the data, and in the possibility of a 
chance sampling error in the latter study. 
Given a spuriously high base line for the con- 
trol group, it might be expected that a second 
measure, the responses to the last 20 stimuli, 
would show a decrease in the number of 
common associations. Similarly, given a 
spuriously low base line for the experimental 
group, it might be expected that a second 
measurement would show an increase in com- 
mon associations. 

What is there to suggest that reinforce- 
ment of common associations to certain words 
should increase the frequency of common as- 
sociations to other words? This may be seen 
as an analogy to the case of bar pressing, in 
which reinforcement of a few instances brings 
about an increase in the rate of the response 
in general. This is true despite topographic 
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variations across individual response in- 
stances. Thus, these varying responses may 
be seen to be, or to have become, members 
of the same response class. But what is there 
to suggest that common associations consti- 
tute a single response class for long-term 
schizophrenics, or that they may become such 
a response class when reinforcement is given 
for specific responses? The present results 
and those of Sommer, Witney, and Osmond 
clearly do not. Other data seem equivocal. 
In general, the evidence indicates that at- 
tempts to teach schizophrenics to produce 
normal verbal behavior have resulted in 
limited amounts of generalization  (e.g., 
Isaacs, Thomas, & Goldiamond, 1965; Sher- 
man, 1965). 

To summarize the findings of the present 
study, an attempt to replicate the study by 
Ullmann, Krasner, and Edinger was unsuc- 
cessful. Social reinforcement of common as- 
sociations by chronic schizophrenics failed to 
produce any evidence of a generalized increase 
in the tendency to give common associations. 
It is suggested that the results obtained by 
Ullmann and his co-workers may have been 
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due to sampling vagaries rather than to ex- 
perimental manipulations. 4 
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The purpose of the present study was to investigate the effect of sensory 
deprivation upon hypnotic susceptibility. Ten Ss initially resistant to hypnosis 
were placed in a light- and sound-attenuated sensory deprivation cubicle for 
a maximum of 6 hr. or until sensory deprivation phenomena were elicited. 
A hypnotic induction was undertaken via a communication system, while S 
remained in the deprivation cubicle, at a point when E judged that the sensory 
deprivation was sufüciently intense. Approximately 1 wk. following the ex- 
perimental session, Ss submitted once again to a standard induction procedure. 
A control group of resistant Ss was included. Only the experimental group 
manifested a significant enhancement of hypnotic susceptibility, and this new 
level of susceptibility was maintained in the follow-up session. 


Despite numerous investigations (Hilgard, 
1965) into hypnotic susceptibility, under- 
standing of individual differences in suscepti- 
bility is still limited to speculation and clini- 
cal inference. One possible method for identi- 
fying the critical variables is through the al- 
teration of susceptibility. 

Wiseman and Reyher (1962) reported that 
hypnotically induced dreams in conjunction 
with “deepening” suggestions increased the 
percentage of complete amnesia from 33 to 70 
in a group of 30 Ss. However, this enhance- 
Ment in performance on one task does not 
necessarily imply that performance would 
have been proportionately higher on other 
tasks. As, Hilgard, and Weitzenhoffer (1963) 
directly attacked the problem of modifying 
Susceptibility by using a variety of deepening 
techniques frequently used by professionals in 
the field, They even spent time in psycho- 
therapeutic attitude discussions but found that 
their efforts were rewarded by only slight 
Sains as revealed by alternate forms of the 
Stanford Hypnotic Susceptibility Scale 
(SHSS). London (1967) concluded, “In gen- 
eral, Scale results are so consistent from ses- 
sion to session as to suggest that susceptibil- 
ity is a very stable personality characteristic 
extremely resistant to change [p. 72].” 

Several investigators have explicitly pointed 
to parallels between hypnotic induction and 
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sensory deprivation or perceptual isolation. 
Gill and Brenman (1959) suggested that all 
standard techniques of hypnotic induction 
involve extensive restrictions of meaningful 
sensory input in the visual, auditory, and 
kinesthetic modalities. These investigators 
concluded that conditions inherent in the 
standard hypnotic induction parallel the con- 
ditions imposed on Ss in the experiments with 
perceptual isolation initially reported by Bex- 
ton, Heron, and Scott (1954), Because of 
these apparent similarities between perceptual 
isolation and hypnotic induction, Gill and 
Brenman (1959) suggested that it would be 
valuable to study the effects of perceptual iso- 
lation on the hypnotizability of Ss. Also, 
Pefia (1962) noted that the methods of achiev- 
ing isolation or deprivation in an experimental 
setting are strikingly similar to the techniques 
used during the standard induction of hyp- 
nosis and that responses of isolated Ss resem- 
ble the behavior observed in hypnotic Ss, with 
evidence of heightened suggestibility being 
common to both situations, 

Levitt, Brady, Ottinger, and Hinesley 
(1962) investigated the effect of experiment- 
ally induced sensory deprivation or percep- 
tual isolation upon hypnotic susceptibility, but 
found no enhancement of susceptibility as a 
result of 4 hr. of sensory deprivation. Their 
Ss consisted of three female nursing students, 
refractory to hypnosis under standard condi- 
tions of induction, who, following the 4 hr, of 
deprivation, showed no enhancement of sus- 
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ceptibility. The reactions of two of the Ss 
even seemed to be antagonistic to the relaxa- 
tion and concentration required during hyp- 
notic induction. 

After 14 and 3 hr. of perceptual isolation, 
however, Peña (1962) produced modest but 
significant enhancement in the susceptibility 
of two groups of prison inmates. He con- 
cluded that the effect was associated not only 
with perceptual isolation but also with the 
practice, implied suggestions, and cues in- 
corporated in the procedure. 

From his psychophysiological model of in- 
trapsychic processes, Reyher (1964) reasons 
that hypnosis, sleep, sensory deprivation phe- 
nomena, and certain types of psychopathology 
are manifestations of the ascendence of lower 
levels of neural integration in the organization 
of brain functions and the regulation of be- 
havior. Adaptive, integrated behavior is seen 
as a function of the dominance of the highest 
level of neuronal integration in the organiza- 
tion of brain functions. Any condition which 
eliminates or homogenizes sensory input pre- 
vents adaptive behavior, and the level of 
neuronal integration mediating adaptive be- 
havior is replaced by a phylogenetically older 
and lower level of neuronal integration which 
becomes dominant in the overall organization 
of brain functions, producing an alteration of 
awareness. Intense anxiety, as a response to 
a hypnotic induction, or a defense against 
such anxiety engages adaptive behavior and 
prevents this reorganization from taking 
place. Reyher (1964, p. 117) also noted that 
sensory deprivation brings about a similar re- 
organization of brain mechanisms. 

This psychophysiological model suggests 
that S should not be removed from sensory 
deprivation prior to a hypnotic induction, as 
removal reactivates adaptive behavior and its 
supporting level of neuronal integration. 
Rather, sensory deprivation should continue 
until S manifests disturbed ego functioning, 
which indicates, in this frame of reference, a 
putative change in the organization of brain 
mechanisms. Only resistant Ss were used in 
the present experiment, and the design in- 
cluded a follow-up hypnotic induction to as- 
sess the stability of any change in hypnotiza- 
bility. 
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METHOD 
Subjects 


There were 10 volunteer college-age females in the 
experimental group who scored below 4 points on 
the Stanford Hypnotic Susceptibility Scale (SHSS), 
Form A. These resistant Ss were asked to return for 
two subsequent sessions and were told in advance 
that the second session involved 4-6 hr. of sensory 
deprivation after which they would again be hypno- 
tized. 


Materials and Experimental Setting 


During the experimental session, each S was 
placed in a 12 X 12 ft. light- and sound-attenuated 
sensory deprivation cubicle. While reclining on a bed 
within the cubicle, S wore a headset consisting of a 
microphone and earphones which were part of an 
integrated communication system.? The remaining 
components of the system included a voice-activated 
tape recorder and an amplifier. The Ss were con- 
nected to a Grass No. 5 Polygraph (Yellow Springs 
Equipment Company) via electrodes secured to the 
first and second fingers of their left hands, which 
provided E with a continuous record of GSR ac- 
tivity and basal skin resistance. Pinned to the bed 
on which S reclined was a switch which activated a 
light in the adjacent equipment room, An auto- 
kinetic light served as a fixation object for S during 
the hypnotic induction which was live rather than 
recorded. 


Instructions 


Prior to S's entry into the deprivation cubicle, she 
was told that the session was designed to test the 
effect of sensory deprivation on her ability to be 
hypnotized, She was specifically instructed as follows: 


Please tell me whatever you can about your 
reactions to this session. Your experiences while in 
sensory deprivation and subsequently those you 
have while being hypnotized will be helpful in 
understanding the results of the study. Whatever 
you choose to talk about will, of course, be kept 
strictly confidential. Also, try not to sleep during 
the session, but if you should doze off, don't feel 
guilty about it. At no time will I respond to 
what you say or answer any questions you may 
ask, or communicate with you in any way, unless 
you depress the switch on the bed. If you depress 


?' The microphone and earphones were manufac- 
tured by Midland Electronics under the trade name 
of Dynamic Headsets, model 21-236. The tape te 
corder was a Uher four-speed Report-L Recorder 
equipped with a voice-activating F411 Akustomat. 
Monitor and amplifier were built by Main Elec- 
tronics under model No. SA1400, which included 
balance and volume controls to adjust recording 
levels. The integrated system was developed jointly 
by the Department of Psychology at Michigan State 
University and Main Electronics of Lansing, Michi- 
gan. 
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the switch, I shall read three stock market quota- 
tions to you. However, I shall always be in the 
adjacent room listening to what you say. I shall 
begin the hypnotic induction over the earphones at 
some point during the next 6 hours. After you are 
hypnotized, I shall enter the room to continue 
with some tasks similar to those we did the last 
time. Do you have any questions? 


Procedure 


While S was in the deprivation cubicle, E re- 
mained in the adjoining equipment room. monitoring 
S's spontaneous comments and reactions with the 
tape recorder and the polygraph. The E turned on 
the autokinetic light and began a hypnotic induction 
when S started to show sensory deprivation phe- 
nomena according to criteria developed from previous 
research (Sanders, 1967) with susceptible Ss. These 
criteria were: 


1. Craving for stimulation as indicated by (a) 
sharp increase in frequency of pressing button to 
hear stock market quotations, (b) statements denot- 
ing the wish for something to do or something to 
see, (c) self-stimulation, including singing, counting, 
tactile stimulation, etc, (d) comments on bodily 
sensations and alterations in body image, (e) asking 
E direct questions or demanding E respond to S, 
(f) statements of wanting to be hypnotized or to 
hear E's voice as a signal the experiment is over, or 
wanting to quit, and (g) boredom and comments 
about the slow passage of time. 

2. Emotional lability as indicated by (a) evidence 
of restlessness, fear, anxiety, or anger, (b) crying, 
weeping, or statements of depression, (c) marked 
fluctuation in mood (e.g, change from anxiety to 
feelings of well-being), (d) somatic disturbance, and 
(e) feelings of isolation and impending doom. 

3. Impaired secondary process and reality testing 
as indicated by (a) marked impairment of the logical 
content in thought and speech, (b) hypnogogic 
imagery characterized as waking dreams, (c) halluci- 
matory experiences, somatic delusions, bodily dis- 
orientation, (d) personalistic interpretations, (e) in- 
crease in fantasy involvement, and (f) blank periods 
indicating inability to concentrate. 

4. Intensification of the relationship between E and 
S as indicated by (a) statement of S's feelings about 
E, (b) statements of personal problems and conflicts 
to E, (c) personal questions from S to E, (d) spec- 
ulation about E, and (e) statements indicating need 
lor contact with E, 


The timing of the induction was the key point in 
the procedure and was largely a matter of clinical 
Judgment based upon the above criteria. Any doubt 
on the part of E as to the extent or intensity of the 
Phenomena reported resulted in a delay of the induc- 
tion. Five of the Ss remained in sensory deprivation 
for the allotted 6 hr. because sensory deprivation 
Phenomena were either absent or minimal. 

en S depressed a button to indicate eye closure, 
E usd the deepening procedure suggested in the 
SHSS, Form B, until satisfied that he could enter 
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TABLE 1 
ANALYSIS OF VARIANCE SUMMARY FOR THE 
STANFORD HYPNOTIC SUSCEPTIBILITY 
SCALE SCORES 

Source of variation | df Mn F 
Conditions (A) 1 66.15 23:71% 

Error 18 2.79 
Trials (B) 2 44.45 (025^ 
AXB 2 24.91 3,50* 

Error 36 741 

Total 59 

* p <.05. 

* p S008, 


the cubicle and continue the administration of the 
scale. Except for minor wording changes to account 
for S's reclining position, the scale proved applicable 
for administration in the deprivation cubicle itself. 


Follow-up Session 


Approximately 1 wk. following the experimental 
session, each S returned and again received the 
SHSS, Form A, under standard conditions, All Ss in 
the experimental group were then interviewed in 
an exploratory attempt to identify those factors 
responsible for alteration or stabilization of suscepti- 
bility. 


Control Group 


A control group was included to test for the effects 
of repeated hypnosis upon the susceptibility of 
resistant Ss and to determine the demand char- 
acteristics of the experimental procedure, minus 
sensory deprivation. Ten additional Ss were selected 
from the same volunteer population and were as- 
signed to the resistant control group. Methods of 
screening and criteria for inclusion in the sample 
were identical to those of the experimental group. 

The Ss were told that they were being asked to 
participate in an experiment measuring the effect 
of familiarity of surroundings on hypnotic suscepti- 
bility and that two sessions beyond the screening 
interview were involved. The demand characteristics 
of these instructions favored an increase in suscepti- 
bility. During the session corresponding to the 
sensory deprivation session of the experimental 
group, Ss returned to the Jab and sat in the depriva- 
tion cubicle, now fully lighted and equipped with 
magazines, radio, and comfortable chairs. The Ss 
were free to study, read, sleep, or talk to E. More 
than one S was in the cubicle at a time, except for 
two Ss who participated singly because of scheduling 
problems. For these two Ss, E remained in the lab, 
thus preventing any social deprivation. 

After 44 hr. (the mean deprivation time of the 
experimental group), hypnotic induction was begun, 
using the same procedures as were used for the 
experimental group. The Ss in the control group also 
returned for a final session to assess their final 
level of susceptibility using the SHSS, Form A. 
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Fic. 1. Mean SHSS scores for experimental and 
control groups. 


Dependent Variables 


Four indexes of sensory deprivation were re- 
corded for the experimental group: (a) time in 
deprivation—the time interval beginning with lights 
out and ending with the initiation of the hypnotic 
induction; (b) verbalization—the time S engaged 
in any type of verbalization was expressed as a 
percentage of the total time in deprivation; (c) 
stimulus hunger—stimulus-seeking behavior was ob- 
tained by counting the number of requests for 
stock market quotations; (d) arousal—the GSR 
served as a measure of S's arousal both during 
sensory deprivation and during hypnotic induction. 
GSR activation was computed by dividing the 
number of discriminable GSRs for the period of con- 
finement and period of induction by the time S 
remained in each condition. A discriminable GSR 
was operationally defined as an inflection of 1,000 
ohms followed by a decline of 1,000 ohms. In 
addition to these objective measures, E recorded 
those cues which he used as signals to initiate 
hypnotic induction. This was done prior to E's 
entry into the deprivation cubicle for the administra- 
tion of the SHSS, Form B, and therefore was in- 


dependent of S’s subsequent enhancement of suscepti- 
bility. 
Scoring 

To determine the reliability of E’s administration 
and scoring of the SHSS, 10 administrations of the 
scale were observed and scored by an independent 
rater. A rank-order correlation of .98 was obtained 
between E's scoring and that of the rater. Differ- 
ences in S's SHSS scores between initial and final 
sessions were defined as enhancement scores (E 
Scores), Previous research (Hilgard, 1965) has 
shown that no correction for Practice effects is 


necessary when taking repeated measures of hyp- 
notic susceptibility on the SHSS. 


RESULTS 


The SHSS scores of the experimental and 
control groups were subjected to an analysis 
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of variance (Edwards, 1960). Table 1 sum- 
marizes the results of this analysis. 

The F ratio between conditions of ad- 
ministration (23.71) was significant at the 
.005 level, indicating that the average per- 
formance over the three administrations of 
the SHSS scale was significantly different for 
the two groups. The F ratio for trials (6.25) 
was significant at the .005 level, indicating 
that the three administrations of the SHSS 
averaged over the two groups also differed 
significantly. The Trials X Conditions inter- 
action was significant at the .05 level, indicat- 
ing that the increases in susceptibility did not 
take the same form for the experimental and 
control groups. A plot of the mean increases 
in susceptibility for experimental and control 
groups is presented in Figure 1. 

The means of the three administrations 
of the SHSS to the experimental group were 
compared using Duncan's new multiple-range 
test. Significant differences in the mean SHSS 
scores of the experimental group were ob- 
tained between Sessions 1 and 2 and between 
Sessions 1 and 3. The difference between the 
second and third sessions for both the experi- 
mental and control groups was not significant. 
The mean enhancement score of 4.80 for the 
experimental group differed — significantly 
(£ test) from the mean enhancement score 
of .90 for the control group at the .005 level. 

The percentage of Ss passing each of the 
items of the SHSS for the experimental and 
control groups is presented in Figures 2 and 3, 
respectively. 

Although relatively uniform increases in 
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Fic. 2. Percentages of resistant experimental group 
passing individual items of the SHSS, Form A, for 
the initial and final sessions. 
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the percentage of Ss passing individual items 
were obtained, the shift in percentages after 
exposure to sensory deprivation was most pro- 
nounced for the items of finger lock, verbal 
inhibition, and posthypnotic suggestion. 

An exploratory attempt was made to deter- 
mine what sensory deprivation behaviors were 
predictive of enhancement of susceptibility. 
Spearman rank-order correlations of —.18, .40, 
56, —.08, and .19 were obtained between 
enhancement scores and time in deprivation, 
verbalization, stimulus hunger, arousal during 
deprivation, and arousal during induction, 
respectively. None of these correlations 
reached statistical significance. Following the 
collection of data from the experimental 
group, E reviewed the tape recordings of the 
deprivation sessions and ranked the experi- 
mental Ss with respect to the intensity of 
their reaction to sensory deprivation. This 
ranking was not independent of E's knowledge 
of S’s hypnotic enhancement. A nonsignificant 
correlation of .54 was obtained between E's 
ranking and S’s enhancement scores. 


DISCUSSION 


Three alternate interpretations of the re- 
sults should be considered. At the time sen- 
sory deprivation symptoms were elicited, S 
might already have been in an altered state 
similar to the putative one which is regnant 
during hypnosis. If this were the case, S 
might have been highly suggestible and the 
hypnotic induction procedure was actually 
extraneous. Or, the distress of sensory de- 
privation enhanced S's response to any kind 
of instructions. Finally, the demand char- 
acteristics of sensory deprivation were greater 
than those of the control condition. The first 
two alternative interpretations are inconsistent 
With research reported by Zuckerman, Al- 
bright, Marks, and Miller (1962). After 6 
hr. of deprivation they found that their Ss 
did not increase in suggestibility when tested 
ior their detection threshold of a light. In 
the same study they found that their Ss 
manifested an inhibition of productivity when 
tested for free association and word naming 
after 3 and 6 hr. of sensory deprivation. 
The Ss were under sensory deprivation at 
the time of testing, and many symptoms had 
been produced. The problem posed by a dif- 
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Fic. 3. Percentage of resistant control group pas- 
sing individual items of the SHSS, Form A, for the 
initial and final sessions. 


ferential in demand characteristics of the ex- 
perimental and control conditions was directly. 
tested by Zuckerman and Cohen (1964) 
when they increased progressively the demand 
characteristics for four groups using differ- 
ent degrees of suggestion and a placebo effect. 
The results were negative despite the use of 
a variety of dependent variables. 

Tt should be noted also that none of these 
alternate interpretations readily explain the 
retention of the enhanced level of suscepti- 
bility 1 wk. later. According to the theory 
under investigation, the brain mechanisms 
underlying the enhanced susceptibility of the 
S have become associated with the induction 
procedure. In rapid inductions, a discrete cue 
can be used to reactivate instantaneously the 
brain mechanisms in question (Reyher, 1964). 

In addition to its immediate relevance 
to the present investigation, the study by 
Zuckerman and Cohen (1964) is of general 
interest. Their results show that scores on the 
dependent variable are not a monotonic func- 
tion of the number or saliency of cues relating 
to the hypotheses and expectations of the 
investigator. Although they did not draw this 
conclusion, it would seem that once S sizes 
up the relationship between the experimental 
design and the desired outcome of the re- 
search, his effort to please E is a constant 
value related to stable personality character- 
istics. Further improvement on the dependent 
variable can be effected only by increasing 
motivation through ego-involving instructions 
and procedures. 

The results obtained here were in marked 
contrast to those reported by Levitt et al. 
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(1962), who found no increase in suscepti- 
bility as a result of exposing refractory Ss to 
sensory deprivation. However, a comparison 
of the procedures of the two studies showed 
marked differences. In contrast to Levitt’s 
procedure, Ss in the present study were en- 
couraged to report their subjective reactions 
to deprivation. Thus, despite the clinical 
nature of the decision to initiate induction, it 
was possible to time the onset of the hypnotic 
induction with the emergence of sensory 
deprivation phenomena. 

In the present study visual stimulation was 
reduced to a minimal level while auditory 
stimulation was homogenized at near normal 
levels. In Pefia’s study both auditory and 
visual stimulation were homogenized at near 
normal levels. The most significant difference 
between the studies, however, was that Pefia’s 
Ss were removed from isolation prior to 
hypnotic induction. Because of the typically 
subtle and often transitory effects of sensory 
perceptual isolation, sudden removal from 
isolation is likely to have reduced the effect 
of his procedures. According to the theory 
under investigation here, the removal of an S 
from isolation should be accompanied by a 
reinstatement of higher levels of neuronal 
integration antagonistic to the development 
of successful hypnotic induction. 

In contrast to Pefia’s control group, the 
control Ss in the present study produced no 
significant increases in susceptibility as a 
result of either repeated hypnotic inductions 
or the implied suggestions of the experimental 
design. In Pefia’s study, the group receiving 
14 hr. of isolation did not differ significantly 
in enhancement from the gains of the control 
group. Therefore, the only clear effect that 
can be attributed to perceptual isolation oc- 
curred with Pefia’s 3-hr. group, and no at- 
tempt was made to determine the permanence 
of the gains in susceptibility. 

The positive results of the present study 
are consistent with the theories of both Gill 
and Brenman (1959) and Reyher (1964). 
Gill and Brenman stated that hypnosis results 
from a regression of autonomous ego func- 
tioning initiated by sensory-motor deprivation 
occurring in a particular interpersonal rela- 
tionship. The conditions imposed on Ss in 
sensory deprivation reproduce one of the sig- 
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nificant characteristics of standard hypnotic 
induction—the restriction of sensory input. 
Since these conditions initiate regressive 
movement in behavior, exposure to sensory 
deprivation should enhance hypnotic suscepti- 
bility by facilitating the regression in ego 
functioning necessary for successful hypnosis. 

Reyher (1964) contends hypnosis is pos- 
sible only after lower levels of neuronal inte- 
gration become dominant in the organization 
of brain functioning and the regulation of 
behavior. The ascendance of a lower level of 
neuronal integration is brought about in a 
susceptible § because he has abrogated 
an analytic-synthetic and adaptive attitude 
toward sensory input. He has accepted the 
hypnotist’s assumption of this role. He listens 
passively to him. However, anxiety generated 
by the hypnotic induction because of intra- 
psychic conflict is sufficient to prevent this 
reorganization, as anxiety generates defense 
and other forms of adaptive behavior. What- 
ever the cause of resistance, people who are 
initially nonhypnotizable should become sus- 
ceptible if a procedure such as sensory depri- 
vation is used to bring about the necessary 
reorganization of brain mechanisms before 
standard hypnotic techniques are employed. 
Once an S experiences the state of hypnosis, 
the accompanying organization of brain mech- 
anisms becomes associated with or conditioned 
to the cues inherent in the induction pro- 
cedure, thus facilitating future inductions, 
maintaining a stable level of susceptibility, 
and permitting rapid hypnotic induction by 
posthypnotic signal. 

The results of Pefia’s research and the pro- 
cedures of the present investigation represent 
a promising beginning. More research is 
needed to clarify the relationship between 
specific sensory deprivation symptoms and a 
readiness for hypnosis. Though none of the 
four behavioral indicators of sensory depriva- 
tion was significantly related to enhancement 
scores, three of them covaried in the expected 
direction. 

Although the theory cannot be tested di- 
rectly for both technical and ethical reasons, 
it does have heuristic value. It consists of an 
integrated set of postulates about neuro- 
physiological and psychodynamic processes 
which specify necessary and sufficient condi- 
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tions for hypnosis. The available data, in- 
cluding those reported here, on hypnosis, 
regression, and the like, are consistent with 
the theory; moreover, major features of neu- 
rotic and schizophrenic reactions and a 
variety of other manifestations of psycho- 
pathology are consistent with it. 
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Forty-four snake and spider phobic Ss, selected from a large pool of under- 
graduates were exposed to either (a) a form of systematic desensitization 
treatment, (b) a technique, called T-scope therapy, which embodies most of 
the expectancy-manipulating features of desensitization, but does not contain 
the technical elements of the procedure (ie., relaxation, visualization, and the 
construction of an anxiety hierarchy), (c) T-scope therapy, presented as an 
“incomplete” and probably ineffective form of treatment, or (d) no treat- 
ment. There were no significant differences (on self-rating, runway, or inter- 
view measures) between the effects of the systematic desensitization procedure 
and T-scope therapy, although Ss receiving either of these treatments im- 
proved significantly more than those who received no treatment or T-scope 


therapy administered under the “low-expectancy” condition. 


In general, explanations concerning the 
effectiveness of systematic desensitization 
therapy have focused on noncognitive factors 
(e.g., Wolpe, 1958; Wolpe & Lazarus, 1967), 
although the importance of cognitive factors 
has sometimes been implied in the writings 
of behavior therapists themselves. Lang 
(1966) pointed out that “in general, the sub- 
ject has a better chance of changing his be- 
havior if he is given meaningful feedback and 
encouraged to ‘think about’ alternatives . . . 
[p. 376].” Efran and Marcia (1967) have 
proposed a cognitive-expectancy model as an 
alternative way of interpreting the effects of 
systematic desensitization. The observed de- 
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creases in avoidant behavior are attributed 
mainly to changes in patients! expectations 
concerning the phobic object rather than to 
the conditioning of “antagonistic responses.” 
From this point of view, patients come into 
treatment anticipating that contact with the 
phobic object will cause reactions of fear and 
disgust. They then go through a sensible, 
structured, “scientific” procedure that tends 
to convince them that they will be able to 
approach the phobic object with greater 
equanimity. As they become convinced, they 
may alter their behavior (i.e., approach the 
phobic object), consistent with their newly 
acquired expectancies. Hence treatment has 
been successful as a function of cognitive 
change rather than counterconditioning. 

The present study had two objectives: 
(z) to demonstrate that a technique (T- 
scope therapy). embodying the expectancy- 
manipulating features of systematic desensi- 
tization, but without the technical elements of 
the procedure (ie, relaxation, construction 
of anxiety hierarchy, and visualization), can 
produce significant improvement, and (b) to 
Compare the results of this technique with 
those obtained with systematic desensitization. 


METHOD 


Because of difficulties in obtaining therapists and 
high-fear Ss, this study was run in two parts: 57 
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scope therapy the first year and systematic desensi- 
tization the second, However, procedures for obtain- 
ing Ss and for measuring outcome were essentially 
the same, so that the study is reported as a single 
project. 


Subject Selection and Treatment Groups 


Subject selection. Approximately 2,800 introductory 
psychology students were administered a modified 
(15-item) form of the Fear Survey Schedule (FSS, 
Geer, 1965). Those who checked “terror” or “very 
much afraid” for either snakes or spiders and who 
were interested in participating were given a runway 
approach test (Lang, 1964) and an interview. The 
runway test, which was later readministered as a 
posttreatment measure, required S, accompanied by 
an E, to approach as closely as possible a preserved 
specimen of either a snake or a spider, which had 
been placed on a table at the end of a 15-ft. runway. 
Those Ss who could touch the specimen were ex- 
cused from the study. (Over one-third of the poten- 
tial Ss were excused on this basis—a fact which 
leaves the validity of FSS ratings of “terror” and 
“very much afraid” somewhat in question.) Forty- 
four Ss (12 males and 32 females) who were (a) not 
presently in treatment, (b) sufficiently fearful (ac- 
cording to the runway measure), and (c) willing to 
participate in the project were assigned to a treat- 
ment or control group. The five groups planned 
were: T-scope therapy (high expectancy and low 
expectancy), systematic desensitization (high expect- 
ancy and low expectancy), and control. 

Fourteen graduate students and one faculty mem- 
ber in clinical or counseling psychology (10 males 
and 4 females) served as therapists. Except for one, 
they administered only T-scope therapy or systematic 
desensitization. 

T-scope therapy. The Ss were met by their thera- 
pist and a white-coated GSR technician who manned 
a polygraph console during the eight 20-min. treat- 
ment sessions. During the first session, the therapist 
explained that treatment would be based upon two 
“well-established” psychological principles: the irra- 
tional and, hence, unconscious nature of phobias, and 
the suppressibility of a response by means of a 
mildly unpleasant stimulus. The S was told that 
phobic stimuli would be presented in a tachistoscope 
at a speed “too fast for the conscious mind to per- 
ceive,” yet perceptible to the unconscious. These 
“subliminal stimuli" would evoke unconscious phobic 
Tesponses which would be followed by a mildly 
Painful electric shock. (Threshold values for pain 


3In the T-scope therapy portion, Ss were also 
given a 10-point “fear thermometer” (Lang, 1964), 
and their GSR responses to slides of neutral and 
phobic objects were obtained. Technical difficulties 
Precluded analysis of the latter data and results from 
the fear thermometer are suspect because of the 
lack of independence from the runway and its sus- 
ceptibility to situational demands (Efran & Marcia, 
1967). Hence, these two measures were not employed 
in the desensitization portion of the study. 
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were established at the first session and subsequent 
values of electric shock were kept just below this 
threshold.) For each session, S—with GSR, EKG, 
and stimulation electrodes attached—sat on the edge 
of a chair, and looked into a tachistoscope. Although 
he believed he was watching subliminal stimuli, some 
of which were pictures of the phobic object, he was 
actually looking at blank cards. Each session con- 
sisted of 100 presentations, 16 of which were shocked, 
based on a fixed-random sequence with a mean of 6. 
At the end of each session, the GSR technician 
showed S a polygraph printout to illustrate his 
"rate of improvement." The records, which had been 
prepared in advance, indicated decreasing physio- 
logical reactions (increasing improvement) from the 
first to the eighth session with plateaus at the fifth 
and sixth sessions. 

In order to create a low-expectancy group (Lo T), 
one group of T-scope therapy Ss was told that a 
"crucial element"—the phobic stimulus on the T 
scope—would be missing; therefore, the phobic re- 
sponse would not be evoked, and neither condition- 
ing nor improvement could be expected. This group 
received no polygraph feedback. High-expectancy 
T-scope Ss (Hi T) were simply administered the 
treatment as aforementioned. 

Ten Ss (four males and six females) composed the 
Hi T group with 7 Ss (three males and four females) 
in the Lo T group.* 

Systematic desensitization (DS). Each therapist 
was trained in standard systematic desensitization 
(DS) technique (Wolpe, 1958) by the second author. 
The Ss constructed hierarchies from a deck of 14 
cards containing phobia-relevant scenes by choosing 
the 7 items which most closely represented his re- 
sponse to the phobic object, and which could most 
closely be incorporated into an equal interval scale 
of fearfulness. Hence, although all Ss began with 
the same 14 items, they were able to construct indi- 
vidualized hierarchies, Each S had two 45-min. 
practice sessions during which he constructed his 
hierarchy and was trained in deep muscle relaxation 
and visualization. 

To control the time factor, all Ss spent a full 
session on each hierarchy item. This procedure devi- 
ated somewhat from traditional desensitization, where 
each S is allowed to proceed at his own pace. It 
was assumed on the basis of other studies (Davison, 
1965; Lang, Lazovik, & Reynolds, 1965) that Ss 
might be expected to complete between two and 
three items per session, and that therefore devoting 
a session to just one item would allow more than 
sufficient time for most Ss. (This assumption turned 
out to be false in a number of instances, as indi- 
cated in the results.) An equivalence in sessions was 
thus established between T-scope therapy and sys- 
tematic desensitization; actual time spent in therapy, 
although somewhat disproportionate for the two 
groups, was weighted in favor of systematic desensi- 
tization (see Table 3). 


4'Three initial Ss in the Lo T group dropped out 
claiming pressure of final exams. 
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TABLE 1 
SUMMARY OF IMPROVEMENT DATA 

Initial runway Improvement (d) 3 Inter- 
Group 7 T E E Touch | Pickup em N 
Desensitization (N — 16) 2.75 2.54 1.94 2.14 5 3 adl a 
+2 1 
High expectancy T scope (N = 9) 2.78 1.71 178 1.71 2 1 m 1 
42 2 
Low expectancy T scope (N — 7) 2.86 244 — 14 1.77 1 i id : 
+2 i ; 

- .50. 2.28 0 3.07 0 0 0 
Control (N — 12) 2.5 A 6 

+2 


An attempt was made to create a low-expectancy 
desensitization group by telling half of the desensi- 
tization Ss in their initial interview that they would 
be getting only the first Aalf of treatment and that 
the treatment could not work without both halves. 
However, posttreatment interviews showed that Ss 
promptly forgot or ignored this information, due 
partly to the competition of all the other new 
information to which they were exposed, and partly 
to their therapists being kept “blind,” and, hence, 
unable to reinforce the expectancy. (This is in con- 
trast to the successful expectancy manipulation for 
the T-scope groups where the therapist induced a 
low expectancy which was consistently reinforced 
by the therapeutic procedure itself; that is, S saw 
only a flash of light in the tachistoscope.) This group 
was combined with the high-expectancy group and 
the two collectively were referred to as systematic 
desensitization (DS), Since the low-expectancy DS 
Ss showed slightly greater improvement than the 
so-called high-expectancy desensitization Ss, com- 
bining the groups slightly raised the improvement 
scores of DS Ss, There were 19 Ss in this condition, 
2 males and 17 females. 

Control (C). Fourteen control Ss (3 males and 11 
females) were simply tested at the beginning and at 
the end of the sessions of the particular treatment 
group with whom they were paired. 

Tn spite of attempts to randomly assign Ss to the 
different treatment and control groups, when initial 
runway scores were compared, the four groups 
(Hi T, Lo T, SD, and C) were dissimilar. To equate 
them, one Hi T female, three DS females, and two 
C females were dropped from the data analysis on 
the basis of pretreatment scores. 

Three aspects of the runway situation indicate 
improvement: post- minus pretreatment position on 
the runway; difference in number of Ss who touch 
the phobic object before and after treatment; and 
difference in number of Ss who pick up the object 
before and after treatment. 

There are two ways of calculating change in run- 
way position: ratio scores (Lang, 1964) based on 


percentage of distance covered and conventional dif- 
ference scores (pretreatment minus posttreatment). 
The data were analyzed both ways with nearly 
identical results and thus only conventional scores 
are reported.5 All runway measures were adminis- 
tered by Es having no knowledge of Ss’ treatment 
groups. 

During the posttreatment interviews, conducted by 
the authors and an advanced graduate student, each 
S rated his improvement along a 5-point scale: 
—2=much worse, —1=a little worse, 0= no 
change, +1 — a little better, +2 = much better. These 
ratings were used as an additional measure of im- 
provement. (The runway test and interview were 
administered by different individuals.) 


RESULTS 
Runway Approach Measure 


Table 1 summarizes the improvement data 
for the four groups. One-way analyses of 
variance for unequal Ns were used to deter- 
mine equivalence of runway starting place 
and significance of difference on posttreatment 
Tunway position. While starting place was 
nearly the same for all four groups (F = .048, 
df = 3/40), the groups differed significantly 
at the conclusion of treatment (F = 3.75, 
df = 3/40, p < .05). 

The Mann-Whitney U was applied to 
ranked change scores between initial and final 
runway positions for the four groups. Results 
indicated that there was no significant differ- 
ence between the DS and the Hi T groups, 
that each group improved significantly more 
than controls (both #’s <.01), that each 


5For the runway measures, Pearson r= 84 
N=44, 
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improved significantly more than the Lo T 
group (DS, p< .05; Hi T, p< .06), and 
that there was no significant difference 
between the Lo T group and controls. 

Another way of looking at the runway data 
is in terms of improvement versus nonim- 
provement. Any positive change after treat- 
ment was considered as improvement; nega- 
tive or no change was considered as “no 
improvement." These data were analyzed by 
means of Fisher exact probability tests and 
are presented in Table 2. Findings here were 
similar to those in the ranked change score 
analysis except that neither the DS nor Hi T 
group was significantly different from the 
Lo T group. 

A third way of looking at the runway data 
is in terms of how many Ss could touch or 
pick up the phobic object at the conclusion 
of treatment who were not able to do so at 
the beginning (see Table 1). The only sig- 
nificant finding here was that the DS group 
had more Ss touching the phobic object at the 
conclusion of treatment than did the con- 
trol group (Fisher exact probability — .04). 
Again, there were no significant differences 
between the DS and Hi T groups on these 
measures. 


Interview Ratings 

The Ss’ ratings of change (from +2 to —2) 
appear in Table 1. No S indicated that he got 
worse. Fisher exact probabilities on these 
data for improvement (+1 plus +2) versus 
no improvement (0) indicated that there was 
no difference between the DS and Hi T condi- 
tions, that both of these groups showed sig- 
nificantly more improvement than controls, 
and, in addition, that DS Ss showed signifi- 
cantly more improvement than Lo T Ss. 


Adequacy of Systematic Desensitization 
Treatment 

Strict application of desensitization tech- 
niques would require that a scene be pre- 
sented without S’s signalling anxiety before 
moving on to the next hierarchy scene. How- 
ever, in order to make all treatment groups 
equivalent in number of sessions, DS Ss were 
€xposed to all seven hierarchy scenes, one per 
Session, whether or not the previous scene 
had been successfully desensitized. More often 
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TABLE 2 


Comparison OF Groups FOR IMPROVEMENT VERSUS 
No Improvement ON RUNWAY MEASURE 


Tao Neues Comparison 
Group | prove- | prove- 

ment | ment | aim | rom ih 
DS 31 5 +33 19 < 01 
HiT 7 2 16 < 01 
LoT 3 4 ns 
Cc 2 10 


Note.—DS = systematic desensitization; Hi T = high ex- 
pectancy T scope; Lo T = low expectancy T scope; C = 
control. 

* Fisher exact probability (levels in cells), 


than initially anticipated, Ss were presented 
with a scene without having been fully desen- 
sitized to a previous scene. If this decreased 
the effectiveness of desensitization, there 
should be a relationship between the methodo- 
logical adequacy of treatment and Ss’ im- 
provement. Those DS Ss should show greater 
improvement who (a) successfully completed 
more sessions; or (b) completed their last 
successful session higher in the hierarchy; or 
(c) had a greater “amount” of completion 
across all sessions. To assess this, the runway 
measure of improvement was correlated with 
measures of these three variables. Extent of 
methodological adequacy (Variable c) was de- 
termined by assigning a O to a session in 
which S experienced anxiety with every 
visualization of the scene, a 1 if the session 
ended with an anxiety-free visualization of 
2 sec., a 2 for a 3-sec. visualization, a 3 for 
a 5-sec. visualization, a 4 for a 10-sec. visuali- 
zation, and a 5 for a 15-sec. visualization. 
Scores, averaged across sessions, ranged from 
1.57 to the maximum possible of 5.00, with 
a mean of 3.85. Correlations of these three 
measures with runway improvement revealed 
no relationship between methodological ade- 
quacy of treatment and improvement, 

An additional analysis of the relationship 
between methodological adequacy and im- 
provement was made by splitting the DS 
group into subgroups on the basis of the 
proportion of sessions an S had successfully 
completed. If more than 50% of his sessions 
ended with his experiencing no anxiety during 
a 15-sec. visualization, he was put into the 
high group; with fewer than 50% of such 
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TABLE 3 


Comparisons OF IMPROVEMENT IN DS, Hi T, AND" 
Previous DESENSITIZATION STUDIES 


Approxi- 


i Change 

Group "in treat Gang) 1 
DS 300 min. | .39 
HiT 190 min. | .33 
Lang (1964) 720 min. | .34 


Lang, Lazovik, & Reynolds (1965) | 720 min. | .27 


Note.—DS = systematic 
expectancy T scope, 
a These figures include training sessions. 


desensitization; HiT = high 


sessions, he was assigned to the low group. 
The Mann-Whitney U was performed com- 
paring these groups with each other and with 
the controls. The high group improved sig- 
nificantly more (p< .025) when compared 
with controls than did the low group; how- 
ever, the difference between high and low 
groups was not significant. 

Table 3 presents improvement in terms of 
ratio scores (Lang, 1964) for the present 
study, Lang’s 1964 study, and the Lang, 
Lazovik, and Reynolds 1965 study, permitting 
comparison of the results of the present study 
with those of previous studies. It is apparent 
from these figures that the effects of desensi- 
tization in the current study were equal, or 
superior, to those in the previous studies cited, 
and that they were accomplished in a shorter 
time. The same was true, of course, for Hi T 
therapy. 


Discussion 


The first issue posed by this study, the 
therapeutic effectiveness of an expectancy- 
based therapy containing none of the theo- 
retically crucial elements of desensitization, 
was resolved. T-scope patients with a high 
expectancy for success improved significantly 
more than controls on all Measures, and more 
than the Lo T group on the runway measure, 
None of these Hi T patients got worse, al- 
though from a classical conditioning point of 
view, one might have expected such a result 
since if Ss happened to imagine Seeing the 
phobic object in the tachistoscope, they would 
be associating this image with anxiety aroused 
by electric shock. Classical conditioning 
theory would predict that this association 
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would enhance the phobia; certainly it would 
not predict its reduction. Either Ss did not 
imagine the phobic object or the theory is 
inapplicable in this circumstance—perhaps 
both. Clearly, this theory, upon which sys- 
tematic desensitization is based, does not 
account for the improvement found in this 
group. 

A second issue concerned the comparison 
between Hi T therapy and systematic desen- 
sitization. The difference between the effects 
of these two treatments is nonsignificant in 
every analysis. Overall, desensitization has a 
slight numerical edge; but one is certainly on 
safe scientific and statistical grounds claiming 
equivalence for the two treatments. 

Although the desensitization procedure used 
in this study deviated in form from the pro- 
cedure used clinically, the outcomes were 
comparable to those obtained in previous 
studies, Moreover, comparisons of methodo- 
logical adequacy with improvement suggest 
either that “being treated” itself is sufficient 
to produce improvement, regardless of number 
of completed hierarchy items, or, perhaps, 
that exposing Ss to items in the hierarchy for 
which they are unprepared is as efficacious as 
exposing them to only those for which they 
are “ready.” 

While this study does not demonstrate con- 
clusively that desensitization is successful 
because it manipulates patients’ expectancies, 
it certainly suggests that cognitive, as opposed 
to autonomic, factors are important compo- 
nents of the treatment. Moreover, expectancy 
manipulation, as opposed to classical condi- 
tioning, provides a theoretical rationale for 
improvement underlying T-scope therapy, sys- 
tematic desensitization, and perhaps other 
forms of behavior therapy as well. 
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CONCEPT LEARNING IN HYPERACTIVE 
AND NORMAL CHILDREN* 
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Concept learning in 65 hyperactive and 99 normal children was compared 
under three experimental conditions for two types of concepts. Both acquisi- 
tion and reversal data were obtained. Contrary to current assumptions, the 
hyperactives showed no decrements on number, as compared to naturalistic 
concepts. Comparison of the clinical and normal samples showed no significant 
group differences under a continuous reinforcement schedule, with either a 
4-sec. or an 8-sec. intertrial interval. Significant decrements in the hyperactives’ 
performance were found under partial reinforcement, and persisted on retest 
sessions occurring 2 mo. later, These results suggest an explanation of the 
hyperactives’ performance decrements in terms of attentional and motivational 
variables, rather than attributing them to global cognitive-conceptual im- 


pairments. 


Hyperactivity is a common presenting 
symptom among children seen in psychiatric 
clinics, and refers to an excessive level of 
activity which is sufficiently sustained to be- 
come a serious source of complaint. Hyper- 
activity is usually found as the main symptom 
in a syndrome which also includes some or all 
of the following: poor motor coordination, dis- 
tractibility, short attention span, and emo- 
tional instabilitly (e.g., hypersensitivity and 
low frustration tolerance), 
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Most of the information on hyperactivity 
currently available deals with questions of 
diagnosis and treatment. Little seems to be 
known of the intellectual functioning of 
hyperactive children, apart from the frequent 
observation that their school performance is 
not satisfactory (Clements & Peters, 1962; 
Laufer, Denhoff, & Riverside, 1957; Laufer, 
Denhoff, & Solomons, 1957). In one study 
(Burks, 1960), the IQs of a sample of hyper- 
active children were compared with those of 
a normal control group, but no significant 
differences were found. 

Conceptual difficulties as characteristic of 
hyperactive children have been reported by 
several authors (Burks, 1960; Clements & 
Peters, 1962; Rosenfeld & Bradley, 1948) 
without offering any supportive objective evi- 
dence. Rather, the children's poor academic 
performance is somewhat freely interpreted 
as resulting from specific learning defects in 
reading, spelling, or, in some cases, only arith- 
metic and number concepts. Burks (1960) has 
suggested that the conceptual difficulties of 
the hyperactive reflect "inefficient patterning 
and processing capabilities of the brain.’ 
Similarly, Clements and Peters (1962) refer 
to an impaired “capacity to receive, hold, 
scan, and selectively screen out stimuli in 
sequential order.” 

Generalizations of this nature, while un- 
warranted by the mere evidence of unsatis- 
factory school progress, become more under- 
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standable in light of the widely held belief 
that the hyperactive syndrome is due to some 
form of brain damage or dysfunction. Loss of 
conceptual ability or “abstract attitude” is 
of course the classical symptom reported in 
individuals known to have suffered cerebral 
lesions (Battersby, Krieger, Pollack, & 
Bender, 1953; Goldstein & Gelb, 1918). A 
critical evaluation of the literature, however, 
reveals that the evidence linking hyperactivity 
to various signs of brain damage is extremely 
inconclusive (Freibergs, 1965). Similarly, a 
preliminary investigation of the same popula- 
tion from which Ss in the present study were 
drawn suggested brain damage as only one of 
several possible causes of hyperactivity 
(Werry, Weiss, & Douglas, 1964). Most im- 
portant, the nature of the cognitive impair- 
ments, if any, should be established indepen- 
dently, by means of objective studies, what- 
ever the etiological nature of hyperactivity. 

The aim of the present study was to pro- 
vide some objective evidence on this question 
by systematically comparing the performance 
of hyperactive and normal children in a con- 
trolled learning situation involving concept 
formation tasks. Long series of different stim- 
uli were used, eliminating the possibility of 
correct responding by rote memorization of 
the correct stimuli. In order to discover the 
correct concept, Ss would first need the ability 
for abstraction in the literal sense, that is, 
the ability to single out as relevant particular 
dimensions or attributes of the stimulus com- 
plex. In addition, a symbolic categorization 
response would be required (e.g., the verbal 
label “flower”) which would help in choosing 
the correct pictures despite considerable varia- 
tion in their physical appearance. Finally, the 
task required the ability to modify responses 
on the basis of the informative feedback pro- 
vided by concrete reinforcements for correct 
responses, 

Two types of concept problems were used 
in this study, in order to verify the hypothesis 
that hyperactive children have more difficulty 
With number concepts. Variations were also 
introduced in schedule of reinforcement and 
length of intertrial interval in an attempt to 
manipulate the difficulty level of the tasks and 
increase the likelihood of detecting any deficits 
that may be present. 


METHOD 
Subjects 


Hyperactive sample. The 65 Ss included in the 
hyperactive group were selected by two staff psychi- 
atrists on the basis of criteria designed to make the 
sample as homogeneous as possible. Hyperactivity 
had to be the major presenting symptom, being 
chronic, sustained, and reported by both the parents 
and the school. Any Ss diagnosed as defective, psy- 
chotic, epileptic, or primarily neurotic were excluded, 
as were those with definite indications of brain 
damage. The EEG data and early medical history for 
part of this sample have been presented by Werry, 
Weiss, and Douglas (1964). The Ss were receiving 
neither drugs nor psychotherapy at or immediately 
preceding the time of assessment. They ranged in 
age 6-12 yr, with a mean age of 8.6. Their mean 
IQ on the WISC was 103.8, with a range of 83-127. 
All but three of the clinical Ss were boys, 

Normal sample. The 99 control Ss were students 
from schools in the Montreal area who had no indi- 
cations or known history of behavioral problems or 
emotional disturbance. The normal children ranged 
in age 6-12 yr, with a mean of 8.9. Since indi- 
vidual IQ measures were not available for these Ss, 
the Vocabulary subtest of the WISC was adminis- 
tered and used as an IQ estimate. Their mean scaled 
score on the Vocabulary subtest was 12, SD = 1.7. 
The mean Vocabulary score for the hyperactive Ss 
was 11.3, SD — 2.8. The difference between the two 
overall samples was nonsignificant on a two-tailed 
test (t = 1.80, df — 157). All but 3 of the normal 
Ss were boys. 


Apparatus 


The apparatus consisted of a 12 X 12 X 18 in. gray 
metal cabinet which allowed for presentation of two 
contiguous 21-in. square stimulus pictures on a rear- 
projection Plexiglas screen. A response key was lo- 
cated just below each stimulus. Marble rewards for 
correct responses were released through an opening 
immediately upon depression of the response key. 


Stimuli 


Four different concept problems, of a type used 
by Osler in previous studies (e.g, Osler & Weiss, 
1962), were used. Each concept was represented by 
a set of 150 unduplicated pairs of stimuli, one of 
which was always an exemplar of the concept. Two 
object concepts, “flower” and “bird,” were repre- 
sented by naturalistic colored pictures. The other two 
problem sets represented the number concept "two" 
by means of black figures of different geometric 
shapes and dots of different colors, respectively. 


Procedure 


Each S was tested individually in four separate 
sessions. Instructions were given about the nature 
of the task and rate of reinforcement to be expected 
(continuous versus intermittent). The Ss were tested 
to a criterion of 10 consecutive correct responses for 
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TABLE 1 


MEAN TIME INTERVALS IN DAys SEPARATING 
DIFFERENT TEST SESSIONS FOR EACH 
EXPERIMENTAL SUBGROUP 


PRIM Test-retest interval 
Treatment 
condition " 
Pretest | Retest Dict Ripe 
Normal Ss 
Continuous 6.9 41 53.4 50.6 
Partial 6.8 41 53.0 50.3 
Delay 4.0 = = -7i 
Hyperactive Ss 
Continuous 6.6 3.2 66.4 62.5 
Partial 5.0 4.8 68.8 1 
Delay 3.0 — = met 


a maximum of 300 trials. Whenever an S reached 
criterion in less than 300 trials, he was switched to 
a reversal stage of learning without any warning. 
Exemplars of the concept just learned (e.g., flowers) 
were no longer reinforced, but reinforcements could 
be obtained by choosing the nonexemplars of the 
concept. Testing was again continued until a reversal 
criterion of 10 consecutive correct responses or for 
a maximum of 300 trials since the beginning of 
testing, 


Design 


The basic design of the study was a 2X3X2 
factorial with replication on the third variable, The 
two types of Ss, hyperactives and normal controls, 
were randomly assigned to either of three treatment 
conditions, continuous (CR), partial (PR), or delay 
(DL), with the restriction that age and average IQ 
estimate be kept comparable among the groups. 

The Ss in the CR groups received 100% reinforce- 
ment for correct choices in a two-choice discrimina- 
tion situation, with a 4-sec, intertrial (postreinforce- 
ment) interval. The PR groups received 50% rein- 
forcement on a fixed ratio schedule (every second 
correct response was reinforced), also with a 4-sec. 
intertrial interval. The Ss in the DL groups were on 
a continuous reinforcement schedule, but had an 
intertrial interval of 8 sec. 

The first two test sessions constituted the pretest 
phase of the study, with all Ss receiving the flower 
concept, followed by the two black shapes concept. 
Retest took place about 2 mo, later, with all Ss 
receiving the bird Concept, followed by the two 
colored dots concept, The average time intervals be- 
tween test sessions are given in Table 1. The hyper- 
active Ss were placed on out-patient drug therapy 
immediately after initial assessment, and were taking 
either chlorpromazine or a Placebo at the time of 
retesting (see Freibergs, Douglas, & Weiss, 1968; 
Werry, Weiss, Douglas, & Martin, 1966). h 

Additional control samples. In the basic desi 

this study, the effects of order of Bean on ie 

two concepts are confounded with Possible differences 
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in their level of difficulty. Counterbalancing at^ the 
problem level, however, was not attempted at the 
outset, because of the limitations in the number of 
hyperactive Ss available. On the basis of a prelimi- 
nary analysis of the data, two additional control 
groups were added at the CR level. This “order 
control” sample consisted of 22 normal and 10 hyper- 
active Ss who received the two pretest problems in a 
reversed order: two black shapes, followed by 
flowers, but were not tested in the retest phase. 


RESULTS 


Pretest Data 


Response measures, Performance on each 
concept was scored in terms of the number 
of trials required to reach criterion. The Ss 
who did not reach criterion within the maxi- 
mum of 300 trials were assigned an arbitrary 
score of 300. 

The distribution of solvers (Ss reaching 
criterion) and of nonsolvers for each sub- 
group is given in Table 2. There were signifi- 
cantly more solvers among the normal chil- 
dren (x? = 14.55, df = 1, p < .001) as well 
as increasing numbers of solvers in the PR, 
DL, and CR subgroups (x? — 17.62, df — 2, 
$ < .001). The two types of concept problems 
did not yield significant differences in the 
distribution of solvers and nonsolvers. 

Basic experiment. Means and standard de- 
viations for the trials to criterion measure 
are presented in Table 3. Because of hetero- 
geneity in the group variances, all statistical 
calculations were performed after a square- 
root transformation of the data. A fixed factor 
model analysis of variance was employed 


TABLE 2 


Nomper or SUBJECTS per Group WHo REACHED 
CRITERION PERFORMANCE ON 
Pretest CONCEPTS 


Normals Hyperactives 
Category Con- | p, E Conia E 
tinu-| fal | tay |tinu- | ial | lay 
Flower concept 

Solvers 26 | 22 | 25] 19 | 7 | 11 
Nonsolvers 0| 3 1 1113| 4 
E aur 26 | 25 26 | 20 | 20 | 15 

'Wo-shapes concept 
Solvers 2 25 | 20 | 23 | 20 | 10 | 13 
Nonsolvers 1| 5| 3| 0|10| 2 
Total 26 | 25 | 26 | 20 | 20 15 

EJ 
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(2X 3 X 2 factorial with repeated measure- 
ments on the last factor), with an unweighted- 
means correction for unequal cell frequencies 
(Winer, 1962, pp. 337, 342). 

The main effect for S classification was 
significant at the 5% level. Specific normal- 
hyperactive comparisons, collapsing data 
across problems, showed that this effect was 
due mainly to the hyperactives doing more 
poorly than the control children under 
partial reinforcement (F = 3.65, df = 1/126, 
p< .05). The difference between the hyper- 
active and the control children was not signifi- 
cant under continuous reinforcement, that is, 
in either the CR (F = .09, df = 1/126) or 
in the DL condition (F = 2.80, df = 1/126). 

The main effect for experimental treatment 
was significant at well beyond the 1% level 
of confidence. Specific paired comparisons 
showed that Ss receiving 100% reinforce- 
ment for correct responses performed signifi- 
cantly better than those receiving 50% rein- 
forcement (F = 45.87 for CR versus PR; 
F= 13.42 for DL versus PR, df = 1/126, 
p<.01 in both cases). Concept learning 
under PR thus is considerably more difficult 
for all Ss, and particularly difficult for the 
hyperactive Ss. No systematic effects due to 
intertrial interval were found, the CR versus 
DL comparison being nonsignificant for both 
types of Ss. There was a trend for poorer 
performance on the first problem by the 
hyperactive DL group, especially for the 
younger Ss, but this trend did not reach 
significance, 

The overall difference between the two 
problems (i.e. the two test sessions) was 
significant at the 5% level and there was also 


TABLE 3 


MEANS AND STANDARD DEVIATIONS FOR TRIALS TO 
CRITERION ON THE Two PRETEST CONCEPTS 


Normals Hyperactives 
Measure Y 

Gon- | par- | De- | Gau Par- | De- 

ous | tial | Tay | ous y 
Flower co; 
M Beg 87.6 | 162.6 | 98.0 |122.3 | 221.4 169.3 
T2 61.8 | 86.1 | 92.0 | 87.7 | 113.7 122.2 
'WO-shape: 
A Bae RU 89.1 | 154.5 | 95.4 | 47.4 |201.9 104.7 
ED 73.2 | 100,9 | 98.0 | 37.7 | 110.2 102.8 


391 


TABLE 4 


MEAN TRIALS TO CRITERION FOR ORIGINAL AND 
ORDER CONTROL Groups UNDER CONTINUOUS 
REINFORCEMENT 


Original continuous Order control group 


group 
S Second | First 
3 T 

ERU e problem. Second 
ppa wo two Bree eee 
shapes | shapes | ower 

Hyperactive | 134.4 51.7 99.4 36.4 

N 13 13 10 10 
Normal 87.6 89.1 113.9 42.4 

N 26 26 22 22 


a significant Ss X Problems interaction. This 
reflects the fact that hyperactive Ss in the 
CR and DL groups showed a significant im- 
provement on the second problem, with a 
slight trend in that direction by the hyper- 
active PR group. All normal Ss performed at 
a comparable level on both pretest problems. 
Order control data. In order to evaluate the 
respective effects of problem type and of the 
order of presentation (practice effect), the data 
of the original CR groups and those of the 
order control samples were combined into a 
Latin-square design as shown in Table 4. 
Separate analyses were done for the hyper- 
active and the normal Ss. Since the 10 addi- 
tional hyperactive Ss obtained ranged in age 
7-9 yr., only the 7-, 8-, and 9-yr.-old Ss of 
the original hyperactive CR group (a total of 
13 Ss) were included in this analysis. 
Results of the Latin-square analyses showed 
that, first of all, the differences between the 
two separate groups were nonsignificant, indi- 
cating that the original and order control 
samples are quite comparable. For the normal 
samples, the effects of order of presentation 
and type of concept problem were both sig- 
nificant at the 5% level of confidence. The 
normal Ss thus showed positive transfer to 
the second problem, but this was obscured 
in the original CR sample by the fact that the 
two concept is apparently more difficult for 
them than the flower concept. In the hyper- 
active data, on the other hand, the differences 
due to order of presentation were highly sig- 
nificant (F = 12.91, p < .01), but there were 
no significant differences due to the type of 
concept (F = .24). Thus, under CR, hyper- 
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active Ss show a large amount of positive 
transfer which is independent of the particular 
concepts involved. s 


Retest Data 

The main original aim in gathering the re- 
test data had been to evaluate the effects of 
chlorpromazine on learning in the hyperactive 
children. Since drug-placebo comparisons had 
revealed no systematic effects attributable to 
type of medication received (see Freibergs et 
al, 1968), the data for all hyperactive Ss 
were combined, giving a sample of 19 Ss un- 
der CR and 18 under PR. The combined data 
were then used for comparisons with the nor- 
mal CR and PR subgroups. Although the hy- 
peractives were retested an average of 2 wk. 
later than the controls, this seems unlikely to 
bias the results since correlations between test- 
retest time intervals and equivalent difference 
Scores were not significant. 

'The retest data were analyzed by means 
of an analysis of covariance, with the pretest 
scores on the equivalent concept serving as 
covariates. Separate calculations were made 
for all Ss under CR and under PR. The re- 
sults showed no significant differences between 
hyperactive and normal Ss in the CR condi- 
tion (F = .99, df = 1/42), while under PR 
the difference was significant at the 1% level 
(F — 8.00, df — 1/40). The pretest deficit 
shown by the hyperactives under PR was not 
only maintained over the retest sessions, but 
actually became more evident. While the con- 
trol Ss under PR improved significantly on the 
retest sessions, the hyperactive PR Ss showed 
no apparent improvement over their pretest 
performance. The effects of problem type were 
not significant. 


Reversal Data. 


Since only Ss who had reached the acquisi- 
tion criterion could be meaningfully placed 
on a reversal schedule, the possibility of bias 
should be considered in the evaluation of these 
data. Analyses of covariance were used be- 
cause of the possibility of a systematic rela- 
tionship between the number of acquisition 
and reversal trials. Separate analyses were 
made for each of the four concepts, with ad- 
justments for unequal cell frequencies. 

Considering first the pretest data, neither 
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the main effects, nor the interaction were sig- 
nificant in either analysis. Thus, with number 
of acquisition trials controlled, the hyper- 
active Ss showed no indications of "rigidity," 
or difficulty in switching responses under re- 
versed reinforcement contingencies. Similarly, 
the differences between the CR and the PR 
groups were nonsignificant, giving no indica- 
tion of an equivalent to the partial reinforce- 
ment effect (PRE) at this stage of testing. 

Summaries of the analyses of covariance 
for both retest concepts are presented in 
Table 5. Here both main effects were signifi- 
cant, more trials to reach reversal criterion 
being required by the hyperactive, as com- 
pared to the normal, children and by Ss under 
PR as compared to those under CR. The in- 
teraction term is also highly significant on 
the second concept, reflecting the fact that 
the differences between hyperactive and nor- 
mal Ss were markedly larger under the PR 
condition. 


Discussion 


The main aim of the present study was to 
gather objective evidence on the presence or 
absence of cognitive deficits of a conceptual 
nature in hyperactive children. The first main 
finding is that hyperactives were just as effi- 
cient as the normal controls in discovering 
the correct concept in a standard (CR) con- 
cept-learning situation. This argues quite 
strongly against the currently held opinion 
that they suffer from basic deficits in the 
fundamental cognitive operations involved in 
the concept formation process, namely, ab- 
straction of stimulus dimensions, production 
of symbolic categorization responses, and 
modification of response strategies on the basis 
of informative feedback. 

The experimental situation, it must be 
noted, had several features which would be 
expected to facilitate learning by helping to 
arouse interest, focus attention, and reduce 
boredom. None of these should have been of 
any help, however, had the hyperactive Ss 
been truly unable to function at a conceptual 
level. 

One feature of the experimental situation 
which may have made a difference is the fact 
that the task was self-paced, since stimuli re- 
mained in view until S had made his response. 
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TABLE 5 
ANALYSES OF COVARIANCE OF RETEST REVERSAL SCORES WITH 
EQUIVALENT ACQUISITION SCORES AS A COVARIATE 
Concept problem Source of variation df MS F 
Birds (first retest session) Groups (normal versus hyperactive) 1 173.75 S97" 
Reinforcement (CR versus PR) 1 147.93 4.57* 
OE X Reinforcement 1 57.43 1.78 
Residual 76 32.34 
Two colored dots (second CE (normal versus hyperactive) 1 296.71 22.79** 
retest session) Reinforcement (CR versus PR) 1 154.32 11.85** 
Groups X Reinforcement 1 109.07 8,38** 
Residual 71 13.02 
Note.—CR = continuous reinforcement; PR = partial reinforcement. 
* p <05. 
Dd <01. 


With stimuli presented for a fixed period of 
time and paced at a predetermined rate, there 

- is a possibility that hyperactive Ss might do 
more poorly than they did here. 

The general assumption that hyperactive 
children have particular difficulty with arith- 
metic and number concepts is also contra- 
dicted by the present data. There was no indi- 
Cation that hyperactives had any difficulty 
with the two number concepts included in the 
present study. Indeed the contrary was found, 
Since for the normal children the two black 
Shapes problem was significantly more difficult 
than the flower problem. The hyperactive 

children did significantly better on the second 
“test session than on the first, regardless of 
Which concept was presented. The reason for 
the discrepancy between the present findings 
“and previous literature seems simply due to a 
different interpretation of what is meant by 
“number concepts." So far as can be ascer- 
tained, the difficulty with number concepts 
has been inferred on the basis of poor arith- 
—metic performance in school. Traditional pri- 
mary school armithmetic, however, deals 
mainly with the rote, mechanical, application 
Of memorized information such as tables of 
“Multiplication, rules for “carrying,” etc., and 
typical tasks (addition, long division) do not 
Yequire any particular understanding of con- 
_ Cepts of numerosity and the like. What they 
40 require is concentrated and sustained at- 
tention, qualities in which a hyperactive child 
May indeed be lacking. 
_A further discrepancy with general descrip- 
tions of hyperactive children arises with re- 
SPect to the question of perseveration. No 


doubt because of the belief in an organic 
etiology, hyperactives have been said to be 
more stimulus-bound and to perseverate with 
the same response, much as brain-injured pa- 
tients are known to do. In the present study 
this hypothesis was tested in two different 
ways. First, the presence of any primitive 
position habits was checked by analyzing 
sequences of left-right response choices for all 
Ss (see Attneave, 1959, p. 24 for the method). 
A comparison of the mean results (not re- 
ported here) showed no indication of system- 
atic position preferences by either group. 

The other measure of perseveration was 
given by the reversal data. Here again, on 
the two pretest sessions, the hyperactive Ss 
showed no tendency to persevere with the 
previously correct response when the rein- 
forcement contingencies were reversed after 
reaching criterion. In view of this, the signifi- 
cant differences which appeared in the retest 
reversal data cannot be taken as evidence of 
perseveration, but must be explained in terms 
of the differences in test-retest transfer that 
they represent. 

The second main finding is that under par- 
tial reinforcement, the hyperactives not only 
showed significant decrements on the two pre- 
test sessions, but also failed to improve on the 
retest sessions, while all other Ss (including 
hyperactives under CR) showed considerable 
positive transfer. 

The crucial, defining event of the PR situ- 
ation is the occurrence of nonreinforcements 
(NR) for correct as well as incorrect re- 
sponses. The best strategy for solution under 
PR thus would be to memorize the character- 
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istics of the reinforced pictures and to search 
for possible commonalities betwen them. 

Considering the greater time interval which 
separates successive reinforced responses un- 
der PR, such a strategy could be more diffi- 
cult for Ss with more rapidly fading short- 
term memory, or with a difficulty in rapid 
coding of information for memory storage. In 
this case, however, some decrements would 
have been expected for the hyperactive DL 
group as well, where the postreinforcement 
interval was twice as long as in the CR con- 
dition. While there was a tendency in that 
direction, no systematic group differences were 
obtained. 

Another way of considering the PR situa- 
tion is in terms of Amsel's theory (1958, 
1962), according to which the occurrence of 
NR under PR produces frustration, a pri- 
mary aversive motivational condition. While 
frustration theory is based strictly on runway 
experiments using rats as Ss, the frustrative 
effects of NR have also been demonstrated in 
token-reward situations with children (Kend- 
ler, Kendler, Pliskoff  & D’Amato, 1958; 
Lambert, Lambert, & Watson, 1953; Long- 
streth, 1960). According to Amsel (1962), a 
secondary form of frustration, rp, or fractional 
anticipatory frustration, develops over a series 
of learning trials through a process of classi- 
cal conditioning. This, in turn, produces two 
different effects which are in competition dur- 
ing the intermediate phase of learning: acti- 
vating, or drive effects, and inhibitory effects 
producing a partial decrease in strength of 
the instrumental response. 

On the basis of the above formulation, the 
performance decrements of hyperactive Ss 
under PR could be explained as being due to 
a higher than average sensitivity to the frus- 
trative effects of NR. The inhibitory com- 
ponent of rp would favor abandoning any 
hypothesis that had not resulted in reinforce- 
ment. Discarding hypotheses after one, or 
even a few, NR would make it impossible to 
reach the correct solution under PR. Similarly, 
the activation component of rp, by raising Ss’ 
level of arousal above an optimum limit, 
would decrease the likelihood of task-relevant 
discriminations by interfering with the cue 
function of stimuli (Hebb, 1955). This would 
retard the discovery of the correct concept by 
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interfering with the search for elements com- 
mon to all reinforced pictures. 

That many hyperactives under PR found it 
difficult to continue searching for hypotheses 
can be seen from the comments noted in their 
test records. Some children clearly abandoned 
any attempts at rational solution, saying “the 
rule changes all the time," or "there is no 
rule.” A few others claimed they had “the 
right answer," despite the absence of a con- 
firmation from E. These reactions, inciden- 
tally, are almost classical examples of denial, 
one of the defense mechanisms used in at- 
tempting to deal with frustration (Douglas, 
1965). 

This motivational explanation of the PR 
deficits of hyperactive Ss is consistent with 
the absence of any truly cognitive deficits sug- 
gested by their “normal” performance in the 
CR and DL conditions. It may also help to 
explain the retest reversal results. The frus- 
tration involved in abandoning a solution as 
soon as it has proved successful might have 
prevented the hyperactives (and all PR Ss) 
from taking maximum advantage of their dis- 
covery of the reversal rule during pretest. 
Finally, this explanation is consistent with 
what appears to be the fundamental difficulty 
underlying the hyperactives’ symptoms. Dis- 
tractibility, hypersensitivity, and low frustra- 
tion tolerance (cf. Werry, Weiss, & Douglas, 
1964) can be seen as all stemming from the 
same fundamental difficulty—a chronically 
excessive level of arousal. This causes the 
hyperactive child to have abnormally low 
response thresholds, whether the response be 
skeletal (hyperactivity), autonomic (irrita- 
bility, hypersensitivity), or cognitive (dis- 
tractibility). 

The notion of low response thresholds 
leading to response competition helps to 
explain the apparently crucial importance of 
CR for effective concept learning in the hy- 
peractive. At the beginning of a test session, 
especially the first one, the hyperactive child 
typically reacts to a wider range of stimuli 
than a control, as indicated by the observa- 
tions noted during testing. The hyperactive 
children’s overt reactions to E, the apparatus, 
objects in the room, or their own bodies would 
obviously compete with their attention to the 
task at hand. Normal children, on the other 
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hand, were remarkably quiet and businesslike 
during testing. On each reinforced trial under 
CR, however, the hyperactive child’s atten- 
tion is drawn back to the task by the immedi- 
ate appearance of a visible, audible, and 
tangible reinforcement. Thus the advantage 
of the CR schedule is that it serves to rein- 
force task orientation (i.e. attention) in the 
hyperactive S, and acts as a continuing 
counterforce to his distractibility and physical 
restlessness. In addition, the positive motiva- 
tional aspect of reinforcement helps to avoid 
the appearance of maladaptive responses such 
as overanxiousness or giving up. 

The present findings have implications for 
the schooling of hyperactive children. So far, 
the best recommendation that could be made 
in severe cases was for a special environment 
with outside stimulation reduced to a mini- 
mum (eg. Strauss & Lehtinen, 1947). Ac- 
cording to the findings of the present study, 
favorable results may also be obtained by 
providing positive reinforcements at a con- 
tinuous rate. Thus learning materials pro- 
grammed so as to minimize the pupil’s oppor- 
tunities for error could be used for school 
subjects which create the most difficulty 
(spelling, arithmetic). An important proviso is 
that the situation should provide as much 
immediate sensory feedback as possible. This 
could be implemented, for example, by haying 
children work at computer consoles linked to 
a real time system with central program 
storage, 
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Two Es with markedly different person 


ality characteristics and styles of in- 


teracting with Ss tested male undergraduates (N —22 and 25) for pupillary 


dilation to pictures of nude and partially 


clothed women, as well as to pictures 


of men, and to control pictures. The Ss of E;, an aloof and businesslike gradu- 
ate student, dilated equally to male and female pictures. The Ss of E; a 
casual outgoing undergraduate, dilated more to pictures of women than to 
those of men. These results indicate that pupillary dilation to sexually provoca- 


tive stimuli reflect, in part, S's relationshi 
cerning permissible emotional responses to 


As Loewenfeld (1966) has recently pointed 
out, research workers have known for almost 
two centuries that emotional and cognitive 
arousal produces pupillary dilation. Never- 
theless, Hess and his co-workers (Hess, 1965; 
Hess & Polt, 1960, 1964, 1966; Hess, Seltzer, 
& Shlien, 1965) are responsible for the recent 
upsurge of interest in the effects of psycho- 
logical events on pupil size. These investi- 
gators have reported, among other things, that 
the pupil of the human eye changes in size as 
a function of either the observer's interest or 
his positive evaluation of the stimuli that he 
observes. Hess and Polt (1960) reported that 
the pupils of 4 male Ss were larger when Ss 
observed pictures of nude women than when 
they viewed either control slides or pictures 
of nude men. Nunnally, Knott, Duchnowski, 
and Parker (1967), using a group of 26 Ss, 
have confirmed Hess and Polt's finding that 
dilation occurs in males as a response to pic- 
tures of seminude females. Hess, Seltzer, and 
Shlien (1965) have further reported “pilot” 
data indicating that pupillary response may 


1 Requests for reprints should be sent to Loren J. 
Chapman, Department of Psychology, University of 
Wisconsin, Madison, Wisconsin 53706, 

? The authors are indebted to Eckhard Hess and 
James Polt for manufacturing the stimulus slides 
used in the present study and for assistance and 
advice in copying their equipment. The authors are 
also indebted to Neil Tafeen, Thomas Stern, and 
Lynn Kahn for assistance in conducting the study, 
and to Richard McFall and Rafael Klorman for 
their critiques of an earlier draft of the manuscript. 
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ip to the examiner and his view con- 
the stimuli. 


be a useful measure of heterosexual and homo- 
sexual interests, These investigators reported 
that they used as their heterosexual Ss five 
“Students or workers in our laboratory . . . 
well known to us over a period of several 
years [p. 165]." Their homosexual Ss were 
five other students who admitted their homo- 
sexual activity. The investigators used each 
S's pupil size in response to nonsexual art as a 
base line from which to compute for him a 
score of change in pupil size in response to 
female pictures and to male pictures, as well 
as a score of the difference between these 
two scores. Hess et al. found no overlap be- 
tween their heterosexual and homosexual Ss 
on this difference score, and they concluded, 
“In the study of some aspects of personality 
. . . this technique appears to us to open up 
entirely new dimensions [p. 168].” 
Enthusiasm for the promise of the technique 
should be tempered by the evidence from other 
psychological research that S’s response in an 
experiment or assessment situation may be 
influenced by his perception of his relationship 
to the examiner, and his anticipation of what 
the examiner expects him to do. Howes and 
Solomon (1950) made these points dramati- 
cally in their now classic critique of McGin- 
nies’ (1949) study of perceptual defense 
against taboo words. Masling (1960) reviewed 
a number of studies that demonstrate such 
effects in psychodiagnostic testing. Recently 
Rosenthal (1966) has marshaled considerable 
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PUPILLARY DILATION: EXAMINER INFLUENCE 


evidence that the expectations of E concerning 
the behavior of his Ss may influence their re- 
sponses. Contemporary investigators of the 
pupillary response have ignored the possibility 
that E may influence the outcomes of his ex- 
periments. 

In the course of a pilot study on pupillary 
response to sexually arousing pictures, the 
present writers had noticed that two Es work- 
ing on the study were obtaining different re- 
sults. Therefore, a study was designed to in- 
vestigate more systematically the effects on 
the pupillary response of E's personality and 
his style of interacting with his Ss. 


METHOD 
Experimenters 


The two Es each tested half the Ss in a study of 
pupillary response to sexually provocative pictures. 
The design was such that the data of the two Es 
could be treated as two independent but parallel 
studies. Each E was told to behave in his accus- 
tomed way in his interaction with his Ss, but in 
presenting the experiment, to follow the written in- 
structions to S precisely. The following description of 
the two Es represents a consensus of several ob- 
servers. 

Ei was a 28-yr.-old married male graduate student. 
He wore a coat and tie and maintained with his Ss 
a businesslike, somewhat formal, and aloof relation- 
Ship. His manner appeared to communicate the mes- 
Sage that the experiment was a serious affair in 
which Ss were expected to work carefully and behave 
Properly, much as they are expected to conform and 
behave properly in a classroom situation. 

Es was a 21-yr.-old male unmarried undergraduate 
who came to the experiment dressed in khaki trousers 
and either a sport shirt or pullover sweater. He was 
an energetic, buoyant, casual young man with a very 
friendly but “breezy” approach to interactions with 
Others, His style of deportment communicated fairly 
Immediately an image of a carefree, fun-loving under- 
graduate, with a lively interest in the opposite sex. 
He interacted with his Ss as equals, rather than as a. 
Superior in the academic hierarchy. 

The authors believe that these two styles of inter- 
action with S developed differing expectations by Ss 
Concerning the study, and differing views as to per- 
Tissible emotional responses to the stimuli presented 
in it. E, established an atmosphere which, the au- 
thors believe, permitted, or even encouraged, his 
Tale Ss to partake freely of the enjoyment of view- 
ing sexually provocative pictures of women. E: on 
the other hand, probably developed an atmosphere 
for the study in which his Ss would not be likely 
9 view such a response as appropriate. 

The two Es alternated in running blocks of Ss so 
55 to avoid any possible changes in pupillary response 
4s a function of the time of day or the stage in the 
“periment in which S was tested. 
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Apparatus 


The apparatus was a near duplicate of the appa- 
ratus used by Hess and his collaborators and de- 
scribed by Hess (1965) and by Hess and Polt 
(1966). This apparatus produced photographs of 
the pupil of one eye on infrared film, two frames 
per second, while S viewed for 10 sec. an image 
projected onto the back of a translucent screen. For 
each slide, the total incident light at S's eye was ap- 
proximately 169 ítc. Since sudden sounds produce 
pupillary dilation (Loewenfeld, 1958), Hess’ appa- 
ratus was modified by placing the projector and all 
relays in an adjacent room and by projecting through 
a double-pane glass window so that the noise of both 
the projector and the relays was not audible in the 
testing room, In an additional modification of Hess’ 
apparatus, the film was marked when the stimulus 
slide was changed by directing a light beam into 
the focus lens of the camera. This made it possible 
to determine with certainty which slide was being 
viewed for each photograph of the pupil. 

The photographs of the pupil were projected 
and magnified on a ground glass screen mounted in 
the top of a table. A research assistant measured 
with a ruler the diameter of the projected image of 
each pupil. This assistant was not aware which S's 
pupils he was measuring or which slide S had been 
viewing. 

The first two and last three frames of each series 
of 20 photographs were omitted from the analysis of 
data to reduce effects from preceding slides, the 
effects of the momentary change in illumination 
when slides were changed, and any effects of Ss 
anticipating the change of slides near the end of a 
series. Also omitted were any photographs on which 
S blinked or moved his eyes. A mean diameter was 
computed for the remaining photographs of each 
S’s pupil for each slide. 


Stimulus Materials 


Twenty-four slides were prepared in Hess’ labora- 
tory, from magazine pictures selected by the present 
investigators. Five experimental slides were pictures 
of attractive women, nude or seminude, from Play- 
boy or Esquire magazines. Another five experimental 
slides were pictures of attractive men. Two of these 
were from physique magazines. These pictures of 
men, unlike those of the women, were “proper” 
enough to appear in a family oriented evening news- 
paper. 

There were also 12 control slides. Six of these each 
contained five numbers, one in each of the four 
corners and one in the middle (as previously de- 
scribed by Hess, 1965), and 6 were photographs of 
landscapes, taken from National Geographic and Life 
magazines. Hess’ group has used both number slides 
and pictorial slides as control stimuli. The landscapes 
were used in the present study because they, like 
the experimental slides, were photographs of objects 
with greater detail than the number slides. (There 
were also two experimental slides showing bloody 
mutilated bodies. These slides were not germane to 
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the concerns of the present paper, but were instead 
inserted into this study to provide evidence on the 
unrelated issue of whether pupillary constrictions or 
dilation occurs in response to repugnant stimuli. 
Data concerning them will not be reported here.) 
The control slides were alternated in their presenta- 
tion with the experimental slides. Each experimental 
slide was preceded by either a number slide or a 
landscape slide. Both the female and male slides were 
preceded and followed as often by number slides as 
by landscape slides. Six different orders of slides 
were used so as to equate within the group of Ss as 
a whole the ordinal position of each type of slide. 


Instructions and. Procedure 


Both Es recited the same instructions concerning 
the nature of the apparatus and the experiment, 
with the exception of a single line specified below. 
The Ss who came to the experiment wearing glasses 
other than contact lenses were asked to remove 
them. The Ss were told that the experiment was 
concerned with how a person looks at a picture. 
Pupillary response was not mentioned either during 
the experiment or afterwards, and no S indicated 
any suspicion that this was the topic of investiga- 
tion. The E pointed out the screen and the lights in 
the box and told S, “Your task will be to look at 
each picture. Continue to look at each picture until 
the next one appears." The lights were turned off, 
and a number slide appeared on the screen. The S 
was asked to read the numbers, each of which was 18 
in. high on the screen 40 in. from his eyes. Three 
prospective Ss who were unable to read the num- 
bers, were excluded from the experiment. The Ss 
were then shown three additional buffer slides each 
for 10 sec. One slide showed a baby, a second a girl 
on a boat, and a third was another number slide. 
The E then stopped the apparatus and checked the 
focus of his camera, and reminded S to keep his head 
as still as possible. 

Es, but not Zi, said at this point, in a “suggestive” 
tone of voice, “I think you will find some of the 
pictures interesting.” This single line was the only 
difference between Es programmed into the script. 
This line was inserted for E» on the assumption that 
it was consistent with his other less explicit com- 
munications to S concerning his attitudes toward 
the study and toward S. Neither E spoke while Ss 
viewed the slides. 


Subjects 


The Ss were 51 male introductory psychology stu- 
dents at the University of Wisconsin. Four Ss were 
dropped because of poor quality of the photographs 
of their pupils. This resulted in a final group of 22 
Ss for E; and 25 for Es. 

Like Hess and Polt (1960) and Nunnally et al. 
(1967) the present investigators did not scrutinize 
the heterosexuality of their Ss, but instead relied on 
the presumption that the majority of male college 
students have heterosexual interests, 
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RESULTS 


For both Hs’ Ss, the mean pupil sizes for 
the landscape and number slides were almost 
identical. Therefore, these two types of con- 
trol slides were combined in order to provide a 
stable base line with which to compare pupil 
size in response to the male and female slides, 

Most recent investigators have analyzed 
pupillary response in terms of percentage 
change in pupil size. This mode of analysis 
seems inappropriate. The correlation, for both 
groups combined, between size of pupil diam- 
eter on the control slides and dilation to fe- 
male slides (pupil size to female slides minus 
pupil size to control slides) was —.28 (p= 
.05). This means that the larger the pupil, the 
smaller the dilation, a finding closely analo- 
gous to the Law of Initial Values. This law, 
that the higher the basal level of autonomic 
activity the smaller will be the upward reac- 
tivity to a stimulus, is commonly found to 
hold for other psychophysiological measures, 
but has not received attention in recent re- 
ports on the pupillary response. An analysis 
of results expressing dilation as percentage 
change from pupil size for control slides would 
further depress differences that are already 
depressed by their relationship to a large ini- 
tial pupil size. Mean percentage changes will 
be reported here in the interests of compara- 
bility with other research reports, but the sig- 
nificance tests were computed using values for 
pupil size adjusted for initial level. Percentage 
change was computed by dividing the differ- 
ence between the pupil sizes for the control 
slides and each set of experimental slides by 
the pupil size for the control slides. 

For both Zs, Ss’ pupils dilated to both the 
male and female slides. All of Ej's Ss dilated 
to both female and male slides; however three 
of E»'s Ss failed to dilate to the male slides, 
and two of these three Ss failed to dilate to 
the female slides. Mean dilation for Ey’s Ss 
was 3.2% to the female slides and 2.9% to 
the male slides. Mean dilation for E;'s Ss was 
3.8% to the female slides and 2.7% to the 
male slides. 

Of central interest to this study is the com- 
parative dilation obtained by each E for male 
and female slides. A regression model was 
used, as suggested by Benjamin (1963), to 
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adjust each individual’s scores by the amount 
attributable to the correlation of the pupil 
size for experimental slides with pupil size for 
control slides. The formula used is a modifi- 
cation of the familiar formula for predicting 
scores on Y using the coefficient of regression 
of Yon X: £ Y — b, (X — X). 

The Y is the pupil size for experimental 
slides, X the pupil size for control slides, and 
Y the adjusted pupil size for the experimental 
slides. These adjustments were performed 
separately for each S's response to male and 
female slides, using regression coefficients 
computed for each E. The regression coeffi- 
cients for E, were .945 for the female slides 
and .951 for the male. Comparable values for 
E were .923 and .965. 

Fourteen of E,'s 22 Ss showed larger pupil 
size while viewing female slides than male. 
The difference for the group as a whole was 
not significant, as shown by a direct difference 
t test using the adjusted scores (¢ = .92). In 
striking contrast, 20 of Exs 25 Ss showed a 
larger pupil size viewing female slides than 
male, and the difference for the group was 
significant (t= 2.54, p< .01) as indicated 
by a one-tailed direct difference ¢ test. The 
difference between the difference scores of the 
two Es’ groups fell short of significance (t = 
1,53, p = .06),* 


Discussion 


Although both Zs found pupillary response 
to both male and female pictures, they found 
quite different results in the comparative di- 
lation. If the two Es had been working in dif- 
ferent laboratories, they would have reported 
contradictory findings. This result is consistent 
with the hypothesis that differing modes of 
interacting with S produce differing pupillary 
responses. These findings do not, however, 
identify the specific cues by which the two Es 
influenced their Ss’ responses. 

The finding of pupillary dilation by 44 out 
of 47 male Ss in response to male slides ap- 


3 When the comparable significance tests were 
computed without the correction for initial level, the 
difference between pupil size for male and female 
slides was significant for E» (f= 2.37, p< 02) but 
Not for E, (t= .92, p=.36), and the difference be- 
tween the difference scores of the two Es was not 
Significant (t = 1.43, p= 15). 
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Pears divergent from that of Hess et al. 
(1965) who found dilation for only two of 
five heterosexual Ss. Hess et al., as previously 
mentioned, suggested that a differential re- 
sponse to the male and female slides may be 
a useful index of heterosexuality. Neverthe- 
less, the present finding of dilation by male 
Ss to male slides may be consistent with Hess’ 
broader contention that pupillary response 
measures interest and positive evaluation. 
Heterosexual men in our society do many 
things that may be interpreted as reflecting 
both a strong interest in and a positive evalu- 
ation of other masculine men, An outstanding 
example is their lionization of athletes. Ad- 
vertising agencies apparently exploit this in- 
terest in other men by using very masculine 
appearing men as models in their advertise- 
ments. Not all interest in other people need be 
erotic! 

In evaluating Hess et al.’s results one must 
bear in mind that their heterosexual Ss worked 
in Hess’ laboratory and presumably were fa- 
miliar with the hypotheses being tested. Any 
heterosexual young man in this circumstance 
would probably feel constrained to demon- 
strate his manhood by responding with lively 
interest to the female slides, but not to the 
male slides. The Ss knew that E expected it of 
them, and they surely also expected it of 
themselves. In contrast, Hess et al.’s, homo- 
sexual Ss had admitted their homosexual be- 
havior to one of the investigators, and prob- 
ably, therefore, felt under less constraint to 
demonstrate heterosexual interests. 

Hess and his co-workers assume that the 
pupillary response is entirely beyond volun- 
tary control. If the response can be altered by 
E’s behavior, as in the present study, it can 
probably also be influenced by self-instruction 
to attend with interest, or by self-expecta- 
tions. Pupillary response is not isolated from 
the rest of S’s behavior, and apparently re- 
flects, among other things, the general level of 
activation of the organism (Nunnally et al., 
1967). Activation level is affected by many 
intrapsychic and interpersonal variables. In 
future research on pupillary response, inves- 
tigators should control such variables as the 
personality of E, his interaction with S, the 
social atmosphere of the experimental sessions, 
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and S's knowledge and expectations of the 
study. 
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PSYCHIATRIC PATIENTS? 
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AND SIROON P. SHAHINIAN 
Hillside Hospital, Glen Oaks, New York 


Although Zigler and Phillips contend that an identical relationship between 
premorbid social competence and prognosis is found in both a schizophrenic 
and nonschizophrenic hospital population, supporting data are not presented. 
To test this hypothesis adequately the 3-yr. posthospital outcomes of 81 
Schizophrenic and 85 nonschizophrenic previously hospitalized patients were 
separately related to the Zigler and Phillips’ Social Competence Scale. The re- 
sults indicate that among schizophrenic patients social competence is positively 
related to two out of five indexes of posthospital outcome: incidence (5 < .001) 
and frequency of rehospitalization (p< .025). Among nonschizophrenic pa- 
tients, the relationship between incidence of rehospitalization and social com- 
petence is not statistically significant. With regard to frequency of rehospital- 
ization a significant negative relationship (p< .05) was obtained. These results 
question the validity of the theoretical model of psychopathological develop- 


ment advocated by Zigler and Phillips. 


Zigler and Phillips (1962) contend that 
psychopathology is a unitary phenomenon 
with the various diagnostic categories rep- 
resenting arbitrary segments derived from a 
maturity continuum. Central to their position 
is the hypothesis that the relationship be- 
tween level of maturity and prognosis, gen- 
erally accepted as existing among schizo- 
phrenic patients, is not unique to schizo- 
phrenia, but instead, cuts across all functional 
mental disorders. 

Zigler and Phillips (1962) support their 
position by referring to the results of a previ- 
ous study (Zigler & Phillips, 1961) which 
they contend indicates the existence of an 
identical relationship between level of pre- 
Morbid maturity and prognosis in both a 
schizophrenic and nonschizophrenic hospital 


lThis study was supported in part by a grant 
Provided by the Board of Directors, Hillside Hos- 
pital, and by Grants MH-10191 and MH-12273, 
and MH-08004 from the National Institute of 
Mental Health, United States Public Health Service. 
The authors would like to thank Robert E. Patton, 

irector of the Biostatistics Division of the New 
York State Department of Mental Hygiene, for 
his cooperation in making records available to us. 

? Requests for reprints should be sent to Bernard 
Rosen, Hillside Hospital, 75-59 263rd Street, Glen 
Oaks, New Vork. 


population. Although the relationship of social 
competence to outcome is presented for a 
heterogeneous group of schizophrenic and non- 
schizophrenic patients, as well as for a sub- 
group of the schizophrenic patients in their 
sample, the relationship between premorbid 
social competence and posthospital outcome 
among nonschizophrenic patients is not re- 
ported. 

The lack of specific analyses relating pre- 
morbid social competence to posthospital 
outcome for nonschizophrenic patients raises 
doubts as to the general prognostic efficacy 
of the Social Competence Scale as well as the 
validity of Zigler and Phillips’ unitary theory 
of psychopathology. 


3A reanalysis of the data presented in Study II 
(Zigler & Phillips, 1961, pp. 267-268) rejects the 
hypothesis that social competence is significantly 
related to posthospital outcome for either schizo- 
phrenic or nonschizophrenic patients. The recom- 
puted probabilities using a Fisher's exact test are 
for nonschizophrenic patients .11, and for schizo- 
phrenic patients .098. (The probability value for 
schizophrenic patients is twice as large as that 
reported by Zigler and Phillips, 1961, p. 268; 
p<.05. This discrepancy may be due to an in- 
correct assumption made by Zigler and Phillips that 
the Fisher's exact test is tabulated as a two-tailed 


test). 
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METHOD 


The sample studied consisted of 166 psychiatric 
patients described previously (Levenstein, & Pollack, 
1966; Levenstein, Pollack, & Klein, 1966). Of the 
original sample of 176 patients 10 could not be 
followed-up and were excluded from this study. 

Patients were diagnosed by the research psy- 
chiatrist according to criteria presented previously 
(Klein, 1967). The sample was diagnostically hetero- 
geneous, 81 patients were diagnosed as schizophrenic; 
the remainder (W = 85) consisted primarily of pa- 
tients with affective and character disorders. 

Follow-up interviews were conducted after the 
second and third year following discharge from 
Hillside Hospital, Glen Oaks, New York. Psy- 
chiatric rehospitalizations were further checked 
through the Biostatistics Division of the New York 
State Department of Mental Hygiene. 

The social competence measure employed is the 
same as that specified by Zigler and Phillips (1961, 
1962). The distribution of the social competence 
scores was divided as close to the median (1.17) of 
the entire sample as possible. 

To assess posthospital outcome, the following 
measures, similar to those used by Zigler and Phillips 
(1961), were used: (a) incidence of psychiatric re- 
hospitalization during the 3-yr. follow-up period;* 
(b) the number of times the patient was rehospital- 
ized during the follow-up period (frequency of 
rehospitalization) ; (c) the amount of time a re- 
hospitalized patient had spent in the community 
between the time of discharge from Hillside Hos- 
pital and his first rehospitalization; (d) total length 
of psychiatric rehospitalization during the entire 
3-yr. period; and (e) the average length of each 
rehospitalization. 


RESULTS 


The relationship between social competence 
and each outcome measure will first be pre- 
sented for the entire sample and then sepa- 
rately for the schizophrenic and nonschizo- 
phrenic subgroups. 

With regard to the incidence of rehospital- 
ization among the entire sample, significantly 
more low-competence patients (54; 62.196) 
were rehospitalized than high-competence 
patients (26; 32.9%; p< .001; one-tailed 
tests of significance were used where ap- 
propriate). 

Table 1 contains the relationship between 
premorbid social competence and incidence 
of psychiatric rehospitalization for the two 
diagnostic groups. With regard to the schizo- 
phrenic subgroup, the results were consistent 
with those for the entire sample (5 « 001). 


4 Two patients who had never been rehospitalized, 
but who had committed suicide were included in the 
rehospitalized group for this variable only. 
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TABLE 1 


INCIDENCE OF REHOSPITALIZATION AS A FUNCTION 
OF PREMORBID SOCIAL COMPETENCE 


Schizophrenic Nonschizophrenic 


Compe- | Hospitalized | Total | Hospitalized | Total 
N | % N N % | N 
Low 40 | 72.7 55 14 43.8 | 32 
High 9 | 346 | 26 17 32.1 | 953 
Note,—Total group: x? = 14.10, df — 1, p <.001, one- 
tailed ; schizophrenic group: x? = 10.73, df = 1, p < .001, one- 
tailed | nonschizophrenic group: x? = 1.17, df = "1, = ns, one- 


tailed, 


However, among nonschizophrenic patients 
the incidence of rehospitalization was statis- 
tically similar for the two competence groups. 
The possibility exists, however, that although 
the incidence of rehospitalization for high 
and low-competence nonschizophrenic pa- 
tients was similar, the nature of the re- 
hospitalization experience differed. It can be 
hypothesized that among rehospitalized pa- 
tients the frequency, total length, and average 
length of rehospitalization was greater, and 
that time from discharge to rehospitalization 
was shorter for low-competence patients, as 
compared with high-competence patients. 

Table 2 examines the relationship between 
social competence and the aforementioned 
four measures of posthospital outcome for re- 
hospitalized patients only. With regard to fre- 
quency of rehospitalization (Table 2), for the 
combined sample, the two competence groups 
could not be differentiated statistically. How- 
ever, when the sample was divided into diag- 
nostic subgroups, significant differences were 
observed. The rehospitalized low-competence 
schizophrenic patients were rehospitalized a 
significantly greater number of times than 
the rehospitalized high-competence schizo- 
phrenic patients (p< .025). Unexpectedly, 
among the rehospitalized nonschizophrenic 
patients the reverse was true (p < .05). 

Table 2 contains the relationship between 
social competence and time between dis- 
charge and first rehospitalization. The two 
competence groups could not be differentiated 
statistically on this measure. This was true 
for the entire sample as well as for the two 
diagnostic groups. 
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TABLE 2 
POSTHOSPITAL OUTCOME BY PREMORBID SOCIAL COMPETENCE FOR REHOSPITALIZED PATIENTS ONLY 
Total Schizophrenic Nonschizophrenic 
Competence 
Low High Low High Low High 
N 53 26 39 9 14 16 
Frequency of rehospitalization 
M 2.19 2.6 2.38 1.44 1.64 
SD 1.13 1.29 1.17 0.68 0.81 15 
t 0.11° 2.2805 2.13% 
Time from discharge to first rehospitalization (in mo.) 
M 8.96 10.24 8.08 7.78 11.43 11.62 
SD 9.46 10.04 8.72 10.00 10.88 9.79 
D 0.578 0.099 0.05" 
Total length of rehospitalization (in mo.) 
M 12.30 8.76 12.74 9.00 11.07 8.62 
SD 9.14 8.75 7.99 10.04 11.68 7.93 
t 1.70** 1,189 0.66" 
Average length of each rehospitalization (in mo.) 
M 7.40 4.96 6.87 7.89 8.86 3.31 
SD 8.43 6.98 6.91 10.41 11.52 275 
t 1.249 0.51" 1.80* 


a One-tailed test of significance. 

b Two-tailed test of significance. 

iia etd value corrected for unequal variances, 
* b <.025. 


The total length of rehospitalization (Table 
2) was significantly greater among low-com- 
Petence patients when compared with the 
high-competence patients in the combined 
sample (p < .05), but there were no signifi- 
cant differences within either of the two 
diagnostic groups. 

When the two competence groups were 
compared with regard to average length of 
each rehospitalization (Table 2), there was no 
Significant difference between the two groups 
for the entire sample, or either of the two 
diagnostic subsamples. 


DISCUSSION 


The present study contradicts Zigler and 
Phillips (1962) as to the existence of an 
identical relationship between pre-morbid 
Social competence and prognosis in both a 
Schizophrenic and non-schizophrenic hospital 
Population [p. 216]." This contradiction can 


be understood if one examines those sig- 
nificant findings common to the two studies, 
that is, the incidence of rehospitalization for 
the total group and the schizophrenic sub- 
sample. Here, the present study is in complete 
agreement with Zigler and Phillips (1961). 
In both studies a significant relationship was 
reported between social competence and in- 
cidence of rehospitalization for the total 
heterogeneous patient group, and for the 
schizophrenic subgroup. One might then er- 
roneously infer that since the hypothesized 
relationship exists for the schizophrenic pa- 
tients, and still holds when this group is 
combined with a nonschizophrenic group, 
that the relationship must also hold for non- 
schizophrenic patients. Methodologically, this 
study emphasizes the error involved in apply- 
ing this type of reasoning to psychiatric re- 
search. 

The results obtained with the variable 
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“frequency of rehospitalization,” not included 
in Zigler and Phillips’ (1961) study, suggests 
an inverse, rather than an “identical” rela- 
tionship between premorbid social competence 
and -posthospital outcome among schizo- 
phrenic and nonschizophrenic patients. The 
lack of a significant positive relationship be- 
tween premorbid social competence and post- 
hospital outcome for nonschizophrenic patients 
questions the theoretical model of psy- 
chopathological development advocated by 
Zigler and Phillips. 
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an inverse, rather than an “identical” rela- 
tionship between premorbid social competence 
and -posthospital outcome among schizo- 
phrenic and nonschizophrenic patients. The 
lack of a significant positive relationship be- 
tween premorbid social competence and post- 
hospital outcome for nonschizophrenic patients 
questions the theoretical model of psy- 
chopathological development advocated by 
Zigler and Phillips. 


REFERENCES 


Kier, D. F. Importance of psychiatric diagnosis 
in prediction of clinical drug effects. Archives 
of General Psychiatry, 1967, 16, 118-126. 


B. Rosen, D. F. KLEIN, S. LEVENSTEIN, AND S. P. SHAHINIAN 


LEVENSTEIN, S., Portack, M., & Kuz, D. F. 
Follow-up study of formerly hospitalized psy- 
chiatric patients: Procedural considerations in 
data collection. Journal of Hillside Hospital, 
1966, 15, 152-164. 


LrvEeNsTEIN, S, Krem, D. F., & Porrack, M. 
Follow-up study of formerly hospitalized voluntary 
psychiatric patients: The first two years. American 
Journal of Psychiatry, 1966, 122, 1102-1109. 


ZIGLER, E. & Pures, L. Social competence and 
outcome in psychiatric disorder. Journal of Ab- 
normal and Social Psychology, 1961, 63, 264-271. 


Zicter, E. & Pures, L. Social competence and 
the processes-reactive distinction in psychopa- 
thology. Journal of Abnormal and Social Psy- 
chology, 1962, 65, 215-222. 


(Received March 25, 1968) 


(Continued from page 400) 


Phasic Electrodermal Orienting Response in Chronic Schizophrenics: II. Response to Auditory Signals of Varying 
Intensity: Alvin S. Bernstein*: Downstate Medical Center, 450 Clarkson Avenue, Brooklyn, New York 11203. 


Anxiety Level of Schizophrenic and Control Patients and Their Parents: Amerigo Farina* and Jules D. Holzberg: 
Department of Psychology, University of Connecticut, Storrs, Connecticut 06268. 


Affective Experience in Psychotherapy: Kenneth I. Howard,* Davic E. Orlinsky, and James A. Hill: Department 
of Psychology, Northwestern University, Evanston, Illinois 60201. 


Increases in Hypnotizability Resulting from a Prolonged Program for Enhancing Personal Growth: Charles T. 


Tart*: Department of Psycholo, 
Effects of a “Reformed Sinner” and a “Lapsed Saint": 


; University of California, Davis, California 95616. 
Thomas Harford and Leonard Solomon*: Department of 


Psychology, Boston University, 111 Cimmington Street, Boston, Massachusetts 02215. 
Determinants of Institutional Release and Prognosis in Mental Retardates: William Vogel,* Karoly J. Kun, and 


Edward Meshorer: Department of Psycholo, 


» Worcester State Hospital, Worcester 1, Massachusetts 01604. 


Effect of Hypnotic Age Regression on the Magnitude of the Ponzo and Poggendorff Illusions: Michael Parrish, 
Richard M. Lundy,* and Herschel W. Leibowitz: Department of Psychology, Pennsylvania State University, 


University Park, Pennsylvania 16802. 


Strategies within Hypnosis for Reducing Phobic Behavior: Suzanne Larsen Horowitz*: Child Psychiatry Clinic, 


Stanford University Hospital, Stanford, California. 


Changes in Mental Status of Elderly Patients in Age-Integrated and Age-Segregated Hospital Milieus: Boaz 


Kahana* and Eva Kahana: Department of Psychology, Washington University, St. Louis, Missouri 63130. 


pinal Cord Lesions: James W. McDaniel* and Alan W. Sexton: Depart- 
ilitation, University of Colorado Medical Center, 4200 East Ninth Ave- 


Psychoendocrine Studies of Patients with S; 
ment of Physical Medicine and Rehabi 
nue, Denver, Colorado 80220. 


* Asterisk indicates author for whom address is supplied. 


Volume 74 


August 1969 


Journal of | Lo 


ore 
Editor: Howarp F. Hunt, New York State pec. 


and. Columbia University 


E. 


Associate Editors; WitLiAM N. THETFORD, College of Physicians and Surgeons, 
Columbia. University 


ARTHUR C. Carr, New York State Psychiatric Institute and Columbia University 


CONSULTING EDITORS 


THEODORE X. BARBER 
Medfield Research Foundation 
Harorp BASOWITZ 
New York University 
RıcmarD L, BLUMENTHAL 
New York State Psychiatric Institute 
and Columbia University 
Evcene I. BURDOCK 
New York University 
Joun M. BUTLER 
University of Chicago 
Loren J. CHAPMAN 
University of Wisconsin 
W. Crawrorp CLARK 
New York State Psychiatric Institute 
and Columbia University 
Jonn H. CUMMING 
New York State Department of Mental 
Hygiene 
W. GRANT DAHLSTROM 
University of North Carolina 
Josrem L. Frerss 
New York State Department of Mental 
Hygiene and Columbia University 
Jerome D. Franx 
Johns Hopkins University 
Sor L. GARFIELD 
Columbia University 
ANN MAGARET GARNER 
University of Nebraska 
ISRAEL GOLDIAMOND 
Institute for Behavioral Research 
Harrison G. GOUcH 
University of California, Berkeley 
Starke R. HATHAWAY 
University of Minnesota 
Raren HENE 
University of Michigan 
Ernest R. HILGARD 
Stanford University 


Lawrence C, Kors 
New York State Psychiatric Institute 
and Columbia University 


SHELDON J. KORCHIN 
University of California 

WirtrAM G. LAWLOR 
Fordham University 


ARDIE LUBIN 
United States Naval Hospital, San Diego 


SALVATORE MADDI 

University of Chicago 2 
Martin T. ORNE 

Institute of the Pennsylvania Hospital 

and the University of Pennsylvania 


Ernst PRELINGER 
Yale University 


Davin ROSENTHAL 
National Institute of Mental Health 


ALAN K. ROSENWALD 
Illinois State Psychiatric Institute 


Jacques RUTSCHMANN 
New York State Psychiatric Institute 
and Columbia University. 


KURT SALZINGER 
New York State Psychiatric Institute 
and Brooklyn Polytechnic Institute 


is SCHACHTER 
New York State Psychiatric Institute 


and Columbia University 


. Joun SHLIEN 


University of Chicago 


Morris I. STEN J 
New York University 


Hagorp L, WILLIAMS A 
University of Oklahoma School of Medicine 


OSEPH Z 
: New York S State Department of Mental 
Hygiene and. Columbia University 


PUBLISHED BY 


THE AMERICAN PSYCHOLOGICAL ASSOCIATION, Inc. 


The Journal of Abnormal Psychology is devoted to basic research and theory in the 
broad ‘field of abnormal behavior, its determinants, and its correlates. The following 
general topics fall within its area of major focus: (a) psychopathology—its develop- 
ment, treatment or remission, and its symptomatology, and course; (b) normal proc- 
esses in abnormal individuals; (c) pathological or atypical features of the behavior of 
normal persons; (d) experimental studies, with human or animal subjects, relating to 
emotional behavior or pathology; (e) social or group effects on adjustment and patho- 
logical processes; (f) tests of hypotheses from psychoanalytic or other psychological 
theory. Thus, case histories, experiments on hypnosis, theoretical papers of scholarly 
substance on personality or emotional abnormality, studies of patient populations, 
analyses of abnormal behavior and motivation in terms of modern behavior theories, 
studies of therapy and behavior change, to cite a few illustrative items, all would fall 
within the boundaries of the Journal’s interests. t 

The Journat is issued bimonthly, one volume per year. The subscription price is 
$10.00 annually (foreign, $10.50). Single issues are $2.00. 


Because of the large number of manuscripts submitted, a publication lag of up to 
24 months can occur. Authors may avoid this delay if they are prepared to pay the 
costs of early publication ($45.00 per Journal page). 

Manuscripts must be submitted in duplicate and should be sent to the Editor: 

Howard F. Hunt 

Journal of Abnormal Psychology 

Department of Psychology 

N. Y. State Psychiatric Institute & Columbia University 
722 West 168th St. 

New York, N. Y. 10032 


Manuscripts must be accompanied by an abstract of 100-120 words. The abstract 
should conform to the style of Psychological Abstracts. Detailed instructions for prepa- 
ration of the abstracts appeared in the American Psychologist (1961, 16, 833), or may 
be obtained from the APA Central Office. 


, Authors submitting manuscripts are requested to use full journal titles in reference 
lists; do not abbreviate. 


The following policies govern reprintin i i 

" I s E g or other forms of reproduction of APA-copyrighted 

journal materiali. Permission by the American Psychological Association must be requested for 

Reprinting all tables, figures, and text of more than 500 words (total from one article) in length. 
ere will be a charge of $5.00 per page for such permission except when the requester is the 


author of the article to be reprinted, or the article i E à 
i i 2s ber of 
copies solely for instructional rns À to be reproduced in a limited number 


Heren Orr Myrtie Henpricks Henry S. SALMON Rosert D. GORMAN 
Managing Editor ^ Technical Editor Advertising and Business Manager Subscription Manager 


_ Subscriptions, address changes, and business communications should be sent to: 


THE AMERICAN PSYCHOLOGICAL ASSOCIATION, INC 
1200 Seventeenth St., N. W. : n 
Washington, D. C. 20036 


Changes of address must reach the Subscripti 

5 T 1 iption Office by the tenth of th th to take effect 

the Voce ie Undelivered copies resulting from Sites oiana will pot be replaced; sub- 

Uu ers should notify the post office that they will guarantee second-class forwarding postage. Other 
aims for undelivered copies must be made within four months of publication, 


PUBLISHED BY 
THE AMERICAN PSYCHOLOGICAL ASSOCIATION, Ixc. 
Prince and Lemon Sts., Lancaster, Pa. 17604 
and 1200 Seventeenth St., N. W., Washington, D. C. 20036 


Second-class postage paid at Lancaster, Pa. 
Copyright © 1969 by the American Psychological Association, Inc. 


| 


Journal of Abnormal Psychology 
1969, Vol. 74, No. 4, 405-412 


SOURCES OF VARIANCE IN RESPONSES TO 
1 QUESTIONNAIRES AND IN BEHAVIOR 


RUDOLF H. MOOS: 
Stanford University School of Medicine 


Sixteen psychiatric patients were observed in and were asked to describe their 
reactions to six psychiatric ward subsettings. The findings, which replicated 
and extended earlier results, indicated (a) that persons, settings, and Person X 
Setting interactions generally accounted for statistically significant and impor- 
tant proportions of the total variance, both in responses to questionnaires and 
in actual behavior; and (b) that these proportions remained remarkably stable 
over a 3-mo. interval, except for a consistent tendency for the proportions 
attributable to settings to increase. Implications discussed include that different 
psychiatric ward subsettings may be differentially therapeutic to different 
patient groups, and that the findings bear on the upper limit of validity indexes 
obtainable with standard interview or rating predictions of patient behavior. 


Several recent studies have demonstrated 
that persons, settings, and Person X Setting 
interactions each account for statistically 
significant and important proportions of the 
total variance in responses to different ques- 
tionnaires and in inferential ratings of behav- 


ior. The relative amounts of variance ac- 


counted for by different sources of variance 
vary importantly, depending upon the par- 
ticular sample of persons and settings chosen 
for the study; however, it is generally true 
that the variance accounted for by the inter- 
action between settings and persons is greater, 
sometimes substantially greater, than the vari- 
ance accounted for by either persons or 


' settings alone. 


For example, Endler, Hunt, and Rosenstein 
(1962) and Endler and Hunt (1967) found 
that persons, settings, and Person X Setting 
interactions each contributed significantly to 
behavioral variance in both anxiety and hos- 


„`The author is also at the Veterans’ Administra- 
tion Hospital, Palo Alto, California. The author 
Wishes to express his appreciation to the patients 
and staff who participated in this research; to 
Eleanor Levine and Phyllis Nobel, who served as 
behavior observers; to William Lake, who provided 


| Valuable help in data analysis; to Bert Kopell and 


William Wittner, who importantly facilitated the 
Work; and to Mark Abramson, John Adams, and 
Sheldon Starr, who permitted observations of their 
individual and group therapy sessions. 

Requests for reprints should be sent to Rudolf H. 
Moos, Department of Psychiatry, Stanford Uni- 
E School of Medicine, Palo Alto, California 


tility. Raush, Dittman, and Taylor (1959) 
and Raush, Farbman, and Llewellyn (1960), 
studying hyperaggressive and control children 
in a psychiatric ward milieu, found that inter- 
actions between child and setting were far 
more important in accounting for behavior 
than was either the child or the setting alone. 

In a series of earlier studies (Moos, 1967, 
1968b; Moos & Daniels, 1967), patient and 
staff responses to a number of different 
psychiatric ward subsettings, for example, 
individual therapy, group therapy, and com- 
munity meeting, were analyzed for each of 
five different sets of adjectives reflecting trust, 
extroversion, security, involvement, and socia- 
bility. The results indicated that consistent 
individual differences among patients ac- 
counted for between 24 and 40% of the 
total variance; consistent setting differences 
always accounted for less than 5% of the 
total variance; and the Patients X Settings 
interaction effect, with only one exception, 
accounted for a significant percentage of the 
total variance (between 17 and 26%). Re- 
sults for staff were generally similar, except 
that consistent differences among individuals 
accounted for less variance than setting 
differences. 

For example, one group of patients was more 
trusting in settings in which only one other 
person was present than in settings in which 
several others were present, whereas the re- 
sults for a second group of patients were 
exactly the reverse. These findings need to be 
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replicated using different samples for re- 
sponses, persons, and settings. They are of 
particular importance for both the assessment 
and prediction of behavior and for the plan- 
ning of maximally therapeutic environments, 
since they suggest that different types of 
settings may be maximally beneficial to dif- 
ferent types of patients, and that psycho- 
logical assessment procedures must system- 
atically sample both persons and settings in 
order to predict behavior accurately. 

The extent to which these results would be 
replicated if the data used were observations 
of actual behavior in different settings is im- 
portant. Setting and Person X Setting inter- 
actions might account for consistently greater 
or consistently less of the total variance when 
actual behavior is observed than when ques- 
tionnaire responses are obtained. The extent 
to which the proportions of variance ac- 
counted for by different sources of variance 
change depending upon the particular time 
at which the sample is studied is also impor- 
tant. Does the percentage of variance attrib- 
utable to persons or settings change with 
increasing familiarity with the environment 
and/or with successful psychiatric treatment? 


PURPOSE 


There were three major purposes for the 
present study: (@) to replicate earlier find- 
ings (Moos, 1968b) indicating that for psy- 
chiatric patients, differences among persons 
and Person X Setting interactions account for 
much larger proportions of variance than 
setting differences; (b) to compare the per- 
centages of variance accounted for by dif- 
ferent sources of variance in questionnaire 
responses and in actual behavior; (c) to esti- 
mate the direction and extent to which the 
percentages of variance attributable to dif- 
ferent sources of variance change with in- 
creasing familiarization with the environment 
and/or change in psychiatric status. 


METHOD 
Settings and Subjects 


The research was conducted on an open-door, 30- 
patient psychiatric ward located in a large Veterans’ 
Administration Hospital. This was a mixed ward 
with approximately 18 male and 12 female patients 
at any one time. The treatment was based upon 
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patient responsibility, decision making, and self- 
regulation within the framework of the ward com- 
munity. There was a maximum emphasis on active 
coping with the milieu, on behavior change, and on 
the process of making practical plans for release, 
for example, obtaining a job, handling financial 
crises, and reconstituting family relationships. Open 
communication between staff and patients and active 
patient participation were strongly encouraged. 

During the study, community meetings and small- 
group therapy were each held 2 days a week. Indi- 
vidual therapy was formally arranged for the major- — 
ity of the patients; for some of these, it was 
centered around frequent but brief contacts, The 
remainder of the treatment program consisted of 
recreational activities and various hospital workshop 
and industrial work assignments. 

The patients who were initially admitted to the 
ward essentially constituted a representative sample 
of male and female veterans with diagnoses of 
neuropsychiatric disorders; that is, there were no 
specific criteria which were applied for admission 
to the ward, 

Each of 16 patients (8 male, 8 female) was 
selected on the basis of several criteria: (a) their 
conversation had to be easily understandable; (b) 
they had to be able to cooperate in wearing a 
wireless radio transmitter, and also to be willing to 
answer a questionnaire about their reactions in each 
setting; (c) they had to be on the ward long 
enough for systematic behavioral change to occur. 
The average age of the patients was 38.8 yr. 
(SD — 7.1), and their average education was 134 
yr. (SD — 11). The median length of hospitaliza- 
tion was approximately 3 mo. and the number 
of previous hospitalizations ranged from none to five. 
Eight of the patients carried a primary diagnosis of 
neurosis; 7 carried a primary diagnosis of schizo- 
phrenia, and 1 carried a primary diagnosis of 
character disorder. 


Procedure 


The research was completed as a part of a larger 
project studying the behavior of psychiatric patients 
by the use of behavior observers and a miniature 
wireless radio transmitter. The plans for the use 
of the radio transmitter were discussed in both 
staff and patient-staff meetings, and the patients 
participated in putting up television antennae around 
the ward, including bedrooms, dayroom, visitors’ 
room, dining room, etc. 

The 16 Ss were each observed twice in each of 
six different ward subsettings. These subsettings Were: 
(a) an intake meeting (or discharge-planning meet- 
ing), in which the entire ward staff observed the 
patient being interviewed by one of the resident 
psychiatrists; (b) an individual therapy session; 
(c) a group therapy session in which the patient 
was a member of a group of approximately eight 
patients; (d) a community meeting in which all pa- 
tients and staff participated; (e) lunch time, whi 
occurred in a dining room on the ward; and (f) free 
time, which was unscheduled time on the war 


Sources OF VARIANCE 


during which the patient was free to engage in any 
activity he chose. These six settings constituted a 
representative sample of regularly scheduled ward 
activities in which all patients participated. 

The initial set of observations occurred during 
the first and second weeks of hospitalization. A 
second set of observations was made on 12 of these 
patients approximately 3 mo. later, the week 
before the patient had improved enough to leave 
the hospital. Four of the patients could not be ob- 
served a second time, either because they had left 
the hospital against medical advice or because they 
had to be discharged without adequate prior warning. 

Three sets of observations were obtained on each 
patient in each setting: (a) questionnaire responses 
relevant to how patients were feeling; (b) behav- 
ioral observations of what patients were actually 
doing; and (c) recordings of what patients were 
saying. Only the results of the first two sets of ob- 
servations are discussed in this report. 

Patients were given a questionnaire immediately 
after their participation in each of the six settings. 
This questionnaire consisted of nine dimensions mea- 
Sured by three items each, on each of which patients 
tated their experience on a 1-5 scale. Four affect 
dimensions were measured, for example, anxiety 
(uneasy, fearful, anxious), depression (sad, lonely, 
blue), vigor (lively, active, energetic), pleasantness 
5 (pleased, lighthearted, elated). Two of the dimen- 

sions measured the perceived worth (worthwhile, 
important to me, glad I went) and the perceived 
therapeutic benefit of the setting (made me feel 
better, helped me gain self-confidence, helped me 
get well). The other three dimensions measured 
affiliation (I worked closely with others, people were 
friendly to me, I was included), participation (I 
Worked hard, I was active, I talked freely and 
openly), and leadership (I was needed, I played 
àn important part, I helped decide what to do). 

The items for the first four dimensions were taken 
Partially from the Nowlis (1965) Mood Adjective 
Check List and partially from other items used in 
- the earlier study (Moos, 1968b). The items for the 
other five dimensions were directly related to Barker 
and Gump’s (1964) notions of the possible effects 
of different types of behavior settings. 
E. Each of the patients was also actually observed 

in each setting. These behavior observations were 
made by two Os. The Ss knew that they would be 
Observed by one of these Os during the time they 
had the microphone on. The Os, who had spent sev- 
eral months on the ward and were integrated with 
the staff and familiar to all the patients, had been 
trained to categorize and note patient behavior in 

terms of 16 relatively simple behavior categories, 
_ ‘or example, smiling, talking, arm movement, etc., 
I taken from those used by Zinner (1963). During the 

Intake and community meeting settings, O sat in 
the same room as Ss and staff, and during the group 
therapy and individual therapy settings O sat behind 
One-way vision screen. During the lunch and free- 
me settings O followed the patient around as 
“nobtrusively as possible. 
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In order to estimate the effects of these observa- 
tions, Ss were observed both with and without the 
microphone on in both community meeting and 
group therapy. Results indicated that the average 
effects of wearing and being observed with the wire- 
less radio transmitter microphone were very small, 
and that the differences that did occur suggested a 
slight increase in purposeful and a slight decrease in 
purposeless activity while wearing the transmitter. 
These results, which are similar to those of other 
studies in showing that observational effects may be 
quite small, are more fully discussed in another 
paper (Moos, 1968a). 


RESULTS 


The reliability of the behavioral observa- 
tions was established by having the two be- 
havior Os observe and rate for 16 reliability 
sessions after a period of initial training. The 
task was to note the occurrence of each coded 
behavior during each minute of a 20-min. 
observation session. The reliability correla- 
tions were very acceptable, ranging from a 
low of .77 to a high of .99. 

The data were analyzed by using an analy- 
sis of variance design. Separate analyses of 
variance were calculated for each of the nine 
questionnaire scales and each of eight behav- 
ior categories in each of the two sets of 
observations. With this technique it is pos- 
sible to specify, for each of the questionnaire 


2The analysis of variance employed a model 
which assumed that each of the three sources of 
variance was sampled randomly, even though this 
was not strictly the case. The logic of this approach 
was discussed by Endler, Hunt, and Rosenstein 
(1962) and by Moos (1968b). The statistical signifi- 
cance of these results is considered to be less impor- 
tant than either the results relevant to the propor- 
tions of variance accounted for by different sources 
of variance or the reproducibility of the results. The 
analyses of variance calculations used for the F tests 
follow those discussed on pages 199-202 in Winer | 
(1962). For example, in a three-way random model, 
in which the interactions cannot be pooled, quasi 
F ratios with estimated degrees of freedom for both 
numerator and denominator must be calculated. 
These analyses indicated that in the first set of 
observations, person effects were statistically signifi- 
cant (p <.05) for all questionnaire variables and all 
but one behavior variable (nodding yes); person- 
setting interactions were significant for all variables; 
and setting effects were significant for five of the 
questionnaire variables (anxiety, vigor, participation, 
affiliation, leadership) and for all but two (smiling, 
general movement) of the behavior variables. Signifi- 
cance levels were generally similar in the second set 


of observations, 
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TABLE 1 
PERCENTAGES OF VARIANCE ACCOUNTED FOR BY DIFFERENT SOURCES OF 
VARIANCE IN QUESTIONNAIRE SCALES—FIRST OBSERVATIONS 
Source 
Scale 
A P s AXI AXS PXS AXP XS 
Anxiet; 5.6 25.4 74 8.3 14 20.8 31.0 
e aa 0.0 26.7 0.0 4.0 0.5 29.2 39.0 
Vigor 2.0 25.0 5.0 5.0 1.9 29.2 32.1 
Pleasantness 12.8 12.8 1.5 6.7 3.8 15.0 47.4 
Worth 0.0 21.0 0.0 0.9 Q5 24.6 51.7 
"Therapeutic benefit 0.3 13.0 3.0 4.1 0.9 37.9 40.8 
Participation 0.0 5.3 10.5 19.4 3.7 19.7 41.6 
Affiliation 0.0 18.0 4.6 15.8 2.2 9.2 51.0 
Leadership 12 11.5 0.8 10.5 1.6 244 50.0 


Note,—Abbreviated: A= adjectives, P = persons, S = settings. 


and behavior dimensions, the proportion of 
the total variance which is accounted for by 
persons, by settings, and by their interaction. 
The percentages of variance accounted for 
by each source of variance were calculated 
for these random-effects analysis of variance 
models using the rationale and equations 
given by Gleser, Cronbach, and Rajaratnam 
(1965) and Endler (1966). These authors 
have pointed out that the analysis of vari- 
ance technique can be used to estimate the 
relative magnitude of each individual compo- 
nent of variance, expressed as a percentage 
of the sum of the different variance compo- 
nents. The general logic of this technique 
involves breaking the expected mean squares 
into their various variance components and 
solving separately for each component. 
Tables 1 and 2 give the percentages of 
variance accounted for by different sources 


of variance in the questionnaire scales for 
the first and second sets of observations. 
Consistent person differences accounted for 
between 0 and 45%, consistent differences 
between settings for between 0 and 18%, and 
the interactions between persons and settings 
for between 9 and 38% of the total variance. 
Differences between adjectives, Adjective X 
Person, and Adjective X Setting interactions 
generally accounted for less than 10% of the 
total variance. 

In both sets of observations the variance 
accounted for by Person X Setting interac- 
tions was greater than the variance accounted 
for by settings for all nine dimensions, and 
was approximately the same as the variance 
accounted for by persons. Also, the proportion 
of variance accounted for by persons was Con- 
sistently greater than the proportion ac 
counted for by settings. The amount of vati- 


TABLE 2 


PERCENTAGES OF VARIANCE ACCOUNTED FOR BY DIFFERENT SOURCES OF 
VARIANCE IN QUESTIONNAIRE SCALES—SECOND OBSERVATIONS 


| 


Source 
Scale 
A P s AXP A XS PXS |AXPXS 
Anxiety 5.2 25.3 11.0 4. ET 
Depression 0.0 45.1 0.0 33 16 172 32.0 
Vigor 75 10.6 73 41 33 34.9 325 
Pleasantness 13.3 9.7 44 11.5 0.0 15.0 45.1 
Worth — 0.0 222 33 02 34 24.5 47.5 
Therapeutic benefit 1.0 26.8 98 1.6 0.0 32.4 29.5 
Participation 15.0 6.6 17.8 92 23 18.9 302 
Affiliation 40 97 52 91 0.0 184 53.9 
Leadership 42 0.1 99 172 2.2 23.3 43.1 


Note.—Abbreviated: A = adjectives, P = persons, S = settings, 
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TABLE 3 


PERCENTAGES OF VARIANCE ACCOUNTED FOR BY 
DirFERENT SOURCES OF VARIANCE IN BEHAVIOR 
CATEGORIES—FIRST OBSERVATIONS 


Source 
Category 
Persons | Settings | P X S | Within 

Hand and arm move- 

ment 16.8 119 | 319 | 394 
Foot and leg move- 

ment 274 10.0 | 26.7 | 35.9 
Scratch, pick, rub 30.7 13.1 | 24.5 | 31.6 
General movement 

and shifting 17.3 14 | 471 | 341 
Nod yes 4.2 42.9 | 33.5 | 19.4 
Smile 35.3 3.6 | 35.4 | 25.6 
Talk 10.5 683 | 139 | 74 
Smoke 41.9 7.1 | 20.7 | 302 


ance accounted for by settings increased from 
the initial to the final set of observations for 
eight of the nine response dimensions, whereas 
the changes for person differences and Person 
X Setting interactions showed no consistent 
patterns. 

Tables 3 and 4 show the percentages of 
variance accounted for by different sources 
of variance for the first and second set of 
behavior observations. The 16 behaviors had 
been classified into four categories on the 
basis of their intercorrelations, and the 8 be- 
haviors shown in Table 3 represent the behav- 
lors in each category which showed the most 
frequent occurrence. The behaviors of hand 
and arm movement; foot and leg movement; 
Scratching, picking, or rubbing; nodding yes; 
and general movement and shifting fell into 
the general movement category; smiling fell 
into a category called expressiveness; and 
Smoking and talking each fell into separate 
Categories, 

The percentage of variance accounted for 
by different sources of variance varied greatly 
depending upon the particular behavior being 
Considered. Talking and nodding yes were 
more a function of the setting than of the 
Person, although the Person X Setting inter- 
actions were substantial, especially for nod- 
ding yes, The other six behaviors showed gen- 
erally similar results, in that the variance 
accounted for by persons and by Person X Set- 
ting interactions were both always greater 
han that accounted for by settings. The 


409 


variance accounted for by Person X Setting 
interactions generally tended to be as great 
as, or somewhat greater than, that accounted 
for by consistent person differences. 

The findings indicate that there is much 
greater variation in the proportion of variance 
attributable to different sources of variance 
in actual behavior than in responses to ques- 
tionnaires, especially in the proportion of vari- 
ance attributable to settings, which varied 
from 1 to 68%. Most of the variance in talk- 
ing was related to setting differences; that is, 
persons consistently tended to talk more in 
some settings than in others. On the other 
hand, most of the variance in smoking was 
related to person differences; that is, some 
people consistently smoked more than others. 
Furthermore, the amount of Person X Setting 
interaction variance was consistently high, 
indicating that, for example, some persons 
tended to smile more in Setting A than in 
Settting B, whereas other persons showed just 
the reverse tendency. 

The changes from the initial to the final set 
of observations were very similar to those 
for the questionnaire responses. The variance 
attributable to settings showed an increase for 
seven of the eight behaviors (all except talk- 
ing, which initially showed an extremely high 
setting variance), whereas the variance attrib- 
utable to persons and to Person X Setting 
interactions showed no consistent changes. 

Figures 1 and 2 show examples of inter- 
action effects for the questionnaire variable of 


TABLE 4 


PERCENTAGES OF VARIANCE ACCOUNTED FOR BY 
DIFFERENT SOURCES OF VARIANCE IN BEHAVIOR 
CATEGORIES—SECOND OBSERVATIONS 


Source 
Category 
Persons | Settings Within 
Hand and arm move- 
BUND fa 17.2 13.8 39.3 
Foot and leg move- 
rcs SNNT 27.3 13.0 28.6 
Scratch, pick, rub 26.3 18.2 27.9 
General movement 
a shifting 23.1 44 24.3 
Nod yes 46 | 565 18.5 
Smile 334 8.3 22.3 
Talk 74 60.1 12.5 
Smoke 36.5 122 41.1 


410 
5 Basra T T T T 
= Patient *3 
A == Patient *10 
4 
è 
$5 
$ 
= 2 


penn Lt 
9 1c AT OE $ 
MER E ue 
7 E 0E fg 
£ ó 


Fic, 1. Example of Person X Setting interaction for 
the variable of therapeutic benefit of setting. 


therapeutic benefit and the behavioral variable 
of the frequency of scratching and rubbing 
parts of the body. Figure 1 indicates that 
Patient 10 stated that he obtained more treat- 
ment benefit from intake meeting and less 
from group therapy than Patient 3, whereas 
there were no differences between these two 
patients in either individual therapy or com- 
munity meeting. Figure 2 indicates that Pa- 
tient 10 did more scratching and rubbing 
parts of his body in intake meeting and less 
in group therapy than Patient 3, whereas 
there were no differences between the two pa- 
tients in individual therapy or community 
meeting. To the extent that scratching and 
rubbing parts of the body may be indicative 
of anxiety, an O would make very different 
estimates of the anxiety of these two patients, 
depending upon the setting in which they 
were observed. 


Discussion 


These results essentially replicate and sub- 
stantially extend those of previous studies. 
The questionnaire results for the two sets of 
observations were very similar to those ob- 
tained in an earlier study (Moos, 1968b), 
despite a different patient sample, the use 
of a different psychiatric ward, the sampling 
of different settings, and the measurement of 
different response dimensions. 

The results also indicate that persons, 
settings, and Person X Setting interactions 
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^ constant in the two sets of observations, exe 


generally account for significant and impor 
tant proportions of the total variance in bes 
havior. These findings are consistent with 
those of Dittman (1963) and Ekman (1965), |) 
who have previously discussed the important | 
influence of situational factors on nonverbal 
behavior, and Zinner (1963), who recorded | 
observations of 56 specific behaviors on 90 | 
airmen in basic training in 30 different real” 
life settings and found that there were de 
pendencies between settings and behaviors; 
for example, there were settings that eithel 
facilitated or inhibited smoking and talking to 
varying degrees. { 

The proportion of variance accounted for | 
by different sources both in responses to questi 
tionnaires and in behavior were remarkablj 


cept that the variance attributable to setti 
differences generally increased over ti 
These findings probably indicate that settings 
tend more consistently to elicit particulat 


with the general milieu. As the 12 patients 

tested the second time had all improved in | 
psychiatric condition, this change may De 
related to changes in psychiatric status. The} 
current data do not allow a test between these 
alternative explanations, since psychiatric imi 
provement and increasing familiarization with 
the milieu are confounded. 
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Fic. 2. Example of Person X Setting interaction 
for the variable of scratching and rubbing parts 
the body, 


SOURCES OF VARIANCE 


These findings replicate those of Raush et 
al. (1959, 1960) in indicating that situa- 
tional factors (setting differences) may play 
a more potent role in behavior after treatment 
and/or experience with the general milieu. 
The two studies were so different (age of Ss, 
length of time between initial and final ob- 
servations, range of settings, types of response 
dimensions studied, etc.) that these similari- 
ties indicate that the replicated findings may 
be highly general. 

On the other hand, the specific sample of 
settings, persons, and response indicators used 
may affect importantly the relative propor- 
tions attributable to different sources of vari- 
ance. For example, the results for the variable 
of anxiety were generally similar to those that 
Endler et al. (1962) obtained for their anx- 
ious questionnaires; however, the percentage 
of variance accounted for by modes of re- 
sponse (adjective differences) was less, 
whereas the percentages of variance accounted 
for by consistent individual differences were 
greater in this study. The former difference 
is probably due to the fact that Endler et al. 
sampled a wider range of anxiety indicators, 
whereas the latter difference may be due to 
the fact that psychiatric patients are generally 
high-anxiety individuals and, as such, would 
be expected to be more consistently anxious 
across settings than college student Ss. 

Moos and Clemes (1967) studied the 
amount of actual talking (by word count) 
of therapists in different individual therapy 
settings, and found that 18.5% of the vari- 
ance was attributable to setting differences. 
This indicates that the variance related to 
Settings is still statistically significant, even 
When all the settings sampled are similar; 
however, the actual proportion attributable to 
Settings may decrease markedly when the 
range of settings is curtailed. : 

Further studies need to explore systemati- 
cally these effects by experimentally or sta- 
tistically (through Monte Carlo methods) 
Sampling different populations of persons, 
Settings, and response indicators. The degree 
9f variation in the proportion of variance ac- 
Counted for by different sources of variance 
Clearly is much greater than has been assumed 
heretofore. This is particularly important 
since the percentage of variance accounted 
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for by consistent differences between indi- 
viduals is related to the upper limit of the 
validity of predictions. If 9% of the total 
variance is accounted for by consistent indi- 
vidual differences, then the upper limit of 
validity is .3; if 16% of the variance is 
accounted for, then the upper limit is .4; and 
if 25% of the variance is accounted for then 
the validity may go as high as .5; etc. Thus, 
no predictions made from individual differ- 
ences on the nine questionnaire response di- 
mensions for the initial set of observations 
could have a validity better than .5. Predic- 
tions made from the behavior categories could 
be somewhat better, and might be as high as 
.6 for the category of smoking. 

The general implications of these results 
for the measurement and prediction of behav- 
ior and for the planning of therapeutic milieus 
have been more fully discussed in an earlier 
paper (Moos, 1968b). It is perfectly clear 
that as long as there are large percentages 
of variance which are due to Person X Setting 
interactions, the particular behavioral poten- 
tial of the person-setting unit must be more 
fully studied. The most important aspects of 
this research area are that the percentages of 
variance accounted for by different sources of 
variance determine the extent of validity pos- 
sible with the use of standard questionnaire 
and interview or rating measures of person- 
ality, and that different psychiatric ward sub- 
settings may be differentially beneficial to 
different patient groups. 
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A BRIEF COMMENTARY ON THE USEFULNESS OF 
STUDYING FEARS OF SNAKES 
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Therapy analogue studies using Ss with fears of snakes or rats have been used 
by several investigators to evaluate types of behavior therapy. The method 
had appeared to be an extremely convenient way of evaluating various thera- 
pies. In a small pilot study the authors were able to desensitize four Ss to 
fears of snakes or laboratory rats. This was consistent with reports in the 
literature. However, the two control Ss also overcame their fears of snakes, 
They had been instructed to hold the snake when it was presented to them and 
to act unafraid. They were also told that they would overcome their fears in 
this manner, The authors suggest that any “technique” that helps a person 
confront the fear of snakes may “cure” him. It is also suggested that studying 
the treatment of snake fears may be irrelevant to the understanding of treating 


clinical neuroses, 


In this brief note, the authors suggest that 
evidence for the effectiveness of desensitization 
as a clinical technique should not be evaluated 
in studies of snake and rat phobias. The use of 
these animal studies initially appeared to be a 
brilliant strategem. It provided a conveniently 
available phobic object that could be presented 
to a large pool of available Ss such as college 
students. The phobia was the same for all Ss. 
Change could be operationally defined in terms 
of distance from the snake that S could tolerate. 
However, despite all these apparent advantages, 
the usefulness of this type of study as an indi- 
cator of the effectiveness of desensitization in 
clinical practice was questioned. It is the authors’ 
impression that both the type of S and the course 
of treatment are markedly different in college 
students who are fearful of snakes, and in people 
who are seen in actual clinical practice. 

As part of a pilot study the authors success- 
fully desensitized four students to their fear of 
harmless snakes or laboratory rats. These results 
teplicated the findings of Lang, Lazovik, and 
Reynolds (1965) that desensitization can effec- 
tively cure a person’s fear of snakes. Two other 
students served as control Ss. They were told 
that for the sake of science it was important to 
see if they could fool trained observers. They 
Were instructed to act unafraid when a harmless 
Shake was presented to them to hold. They were 
also told that by going through this procedure 
again and again, they would actually overcome 
their fears, 


Requests for reprints should be sent to Allan 
Cooper, Department of Psychiatry, Albert Einstein 
College of Medicine, Eastchester Road and Morris 
Park Avenue, Bronx, New York 10461. 


After an average of eight sessions the two 
control Ss were able to handle the snake and 
report that they were not afraid. Monitoring of 
their physiological reactions was in accord with 
their self-report. 

Other investigators have reported results simi- 
lar to those obtained with the control Ss in the 
present study. Strahley (1965) demonstrated that 
16 Ss receiving “counterphobic” treatment were 
less afraid of a harmless snake than 16 Ss who 
received desensitization. Hogan and Kirchner 
(1967) report that significantly more of the 21 
Ss treated with “implosion” were able to handle 
a laboratory rat after only one session than 
untreated control Ss. 

One is impressed with all the various tech- 
niques that seem to be able to “cure” snake or 
rat phobias. However, the authors now suspect 
that the success of these techniques is due to 
“the natural fear extinction process” that one 
sees in human beings when they learn new skills. 
That is, when people learn to drive a car or 
dive into water, they soon overcome their fears 
with repeated exposure whether or not any 
“therapeutic” procedure is used to help them. 
It now appears that animal phobias in general 
are different from other clinical phobias. Marks 
and Gelder (1966) suggest that animal phobias 
have a much earlier onset, a more continuous 
course, come less frequently to psychotherapy, 
are associated with few or no other psychiatic 
symptoms, and respond better to behavior ther- 
apy. Moreover, the fear of snakes, in particular, 
is a fear that is not exposed to “the natural fear 
extinction process” since the person has little 
opportunity to encounter them in his daily life. 
In the case of fear of snakes, the therapeutic 
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aspects of various procedures may be irrelevant 
to the cure. The therapy may simply operate by 
providing repeated contact with the feared object. 

The therapeutic procedure may also operate 
by providing explicit or implicit suggestion that 
the person will overcome his fears. Agras, Leiten- 
berg, and Barlow (1968) report that Ss treated 
with desensitization but not given therapeutically 
oriented instructions or social reinforcement did 
not differ significantly from an untreated control 
group, while a group which did receive these vari- 
ables was able to make progress in overcoming 
their fear of snakes. Thus, suggestion seems to 
play a role in the curative process. 

Ordinarily, when one invokes the concept of 
“suggestion” it often appears as if what is meant 
is that S has simply been fooled or tricked into 
getting better. A more complex statement as 
applied to the fear of snakes and rats is that the 
belief that a person will get better may be suf- 
ficient motivation for him to face the fear and 
thereby expose the fear to the natural fear 
extinction process. 

Studying the effectiveness of "treatments" for 
curing fears of snakes and rats may be irrelevant 
to a knowledge of the treatment of clinical 
phobias. Any “technique” that helps a person 
confront the fear of snakes may “cure” him. 
However, clinical experience suggests that this 
procedure of confrontation by itself will not be 
effective in the treatment of clinical phobias. It 
is also the authors’ experience that desensitization 
is not nearly as effective with patients in clinical 
practice as it is with college students who are 
afraid of snakes. 

The authors therefore question studying tech- 
niques for the treatment of fear of snakes, if 
this is being done for the purpose of understand- 
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ing the efficacy of these procedures in treating 
clinical phobias. It is suggested that despite the 
greater difficulty in procedures, only actual clini- 
cal phobias be studied to obtain data about tech- 
niques that are ultimately designed to treat 
clinical phobias. 


SUMMARY 


Therapy analogue studies, using Ss with fears 
of snakes or rats have been used by several in- 
vestigators to evaluate types of behavior therapy. 
This technique has appeared to be an extremely 
convenient way of evaluating various therapies, 
However, as a result of curing two control Ss of 
their fear of snakes, the authors have come to 
the conclusion that studying the treatment of 
snake fears may be irrelevant to the under- 
standing of treating clinical neuroses. Supporting 
evidence from other studies is cited. 
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. Fifty-one consecutive male admissions to a psychiatric receiving hospital who 


had a staff diagnosis of schizophrenia were kept drug free for a period of 
testing. The rod-and-frame test (RFT) showed a curvilinear correlation 
coefficient of .28 with the Phillips Prognostic Rating Scale (PRS). The 
midrange on the RFT showed a good prognosis, with the process cases being 
at the extremes of the RFT scores. There was a significant relationship between 
RFT and duration of hospital stay at 1 and 2 yr. but not after 3 yr. of 
follow-up. The readmission pattern suggested that intermediate cases on the 
RFT leave the hospital sooner but return more frequently. The data failed to 
demonstrate a relationship between RFT and measures of conceptual plasticity, 
abstract attitude, severity of schizophrenic thought disorder, and severity of 
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reality disruption. 


The persistent problems in diagnosis of 
Schizophrenia necessitated an improvement in 
classification. The concept of process-reactive 
Schizophrenia was just such an effort to im- 
prove the nomenclature. The empirical finding 
that this distinction was useful has led to the 
attempt to clarify the underlying basis for the 
distinction. Many authors (Becker, 1956; 
Fine & Zimet, 1959; Herron, 1962; Kantor 
& Herron, 1965; Kantor & Winder, 1959; & 
Zimet & Fine, 1965) have suggested that 
classification is an expression of differences in 
the degree of psychological differentiation be- 
tween schizophrenics. The process cases are 
Viewed as less differentiated than the reactive 
Cases. 

There are no more reliable or better vali- 
dated experimental measures of differentiation 
than those developed by Witkin, Dyk, Fater- 
son, Goodenough, and Karp (1962). These 
measures have been subjected to rigorous use 
in a theoretical framework of differentiation 
for a number of years. Two efforts (Bryant, 
1961; Gibeau, 1965) to examine the process- 
reactive dimension in schizophrenia with these 
measures have produced slightly different 

lInitial data collection was done at the State 
University of New York, Downstate Medical Center, 
With grant support from the Commonwealth Fund. 
The authors wish to thank Grant Dahlstrom and 
Herbert E. Spohn for their helpful comments and 
Suggestions, 
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results. A major source of variance has been 
the use of drugged, as well as chronically 
institutionalized patients. Differences reported 
between process and reactive cases could 
easily be related to group differences in types, 
dosage, duration, or action of drugs. The ef- 
fect of differences in the length of hospitaliza- 
tion between process and reactive patients 
could also account for some findings, Recog- 
nizing the essential need to control for the 
effects of drugs and duration of current hos- 
pital stay, the authors obtained a population 
that fulfilled these requirements. The attempts 
to demonstrate a linear relationship between 
position on the process-reactive continuum 
and the rod-and-frame test (Sugerman, 1962) 
and the sophistication of body concept test 
(Cancro, 1962) were unsuccessful. 

An increasing number of reports by various 
investigators suggest that a linear relationship 
is not to be expected between these variables 
(Silverman, 1964; Voth, 1947; Witkin, 1965; 
Witkin, Lewis, Hertzman, Machover, Meiss- 
ner, & Wapner, 1954). These reports stimu- 
lated the present reexamination of this drug- 
free population for the presence of a non- 
linear relationship between psychological 
differentiation and the process-reactive classifi- 
cation. The secondary purpose was to seek 
out possible relationships between psycholog- 
ical differentiation, duration of hospitaliza- 
tion, and certain signs of the illness, specifi- 
cally, ability to shift conceptual set, abstract 
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attitude, severity of thought disorder, and 
degree of reality disruption. 


METHOD 
Subjects 


Fifty-one consecutive male admissions to the 
wards of a large city psychiatric observation hos- 
pital, who had a senior staff diagnosis of schizo- 
phrenia, ranged in age from 18 to 44, and were 
fluent in English were admitted to this study.* 
While the use of the staff diagnosis introduced an 
indeterminate amount of variability, it was chosen 
over the authors’ diagnosis in order to avoid the 
problem of unconscious bias in the selection of 
cases. The age range was selected in order to screen 
out early adolescent reactions, involuntional states, 
and premature senility. The patients whose clinical 
picture was complicated by organic, neurological, or 
extraneous factors such as alcoholism, were rejected. 
An important methodological feature of this study 
was that patients received no tranquilizers or long- 
acting sedatives until after the period of testing and 
examinations, so that the clinical picture would not 
be contaminated by drug effect. 


Procedure 


Toward the end of the first week of hospitaliza- 
tion the patient received the rod-and-frame test 
(RFT, body erect), in Witkin’s laboratory, without 
the technician knowing the patient’s clinical history 
or mental status findings. The RFT score was used as 
the measure of psychological differentiation. 

The patients were interviewed, and the material 
obtained was used to fill out the Prognostic Rating 
Scale (PRS, Phillips, 1953), which served to classify 
the patients on the process-reactive dimension. The 
patients were not divided into nominal categories on 
the basis of the PRS, but rather assigned their actual 
numerical score. 

Patients were followed for a period of 3 yr., and 
the outcome measure was the total number of nights 


5 The age range of the population was 18-44, 
M 7295, SD — 74. The mean number of years of 
education was 11.3, SD —2.. The mean scaled 
WAIS vocabulary score was 9.3, SD=3. Twenty- 
eight patients (55%) had no prior psychiatric hos- 
pitalization. Of the 23 patients (45%) who were 
previously hospitalized, the mean duration of total 
prior hospitalization was 15.9 mo. with only 2 
cases being hospitalized for a lifetime total of more 
than 36 mo. and no case for over 5 yr. Twelve 
patients with a history of previous hospitalization 
were classified as reactive so that patients with prior 
hospitalization were represented more highly in the 
reactive group (60%) than in the process group 
(40%). Thirty-four patients (66.7%) were in Hol- 
lingshead’s Group V; 12 (23.5%) were in Group IV; 
4 (7.8%) were in Group III; and only 1 patient 
was in Group I. There was no significant relation- 
ship between social class and the process-reactive 
classification in this sample. 
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spent in any mental institution during the specific 
time period under examination. The patients were 
sent to state and veterans’ hospitals for treatment, 
without the treating hospitals being informed of 
their research status. There were no significant 
interinstitutional differences in duration of hospital 
stay for this sample. 

The Metalog Test (Zubin & Windle, 1951) was 
used as the measure of ability to shift conceptual 
set and was scored blindly and independently. A 
Proverb List (Benjamin, 1944) was administered and 
independently scored for abstraction, using a scale 
developed by Meadow (Meadow,  Greenblatt, 
Funkenstein, & Solomon, 1953). The severity of the 
thought disorder was clinically quantified by two 
separate and independently scored measures. The 
first of these was a rating of the severity of the 
formal signs on a 4-point scale. A score of O was 
given if no thought disorder was manifested during a 
l-hr, mental status interview, done during the first 
few days of hospitalization. A rating of 1 was as- 
signed if the patient showed circumstantiality, literal- 
ness, or concreteness. A rating of 2 was given if the 
patient showed autistic intrusions, predicative identi- 
fications, or loosening of associations. A rating of 3 
was assigned if the patient showed perseverations, 
echolalia, blocking, neologisms, or incoherence. The 
patient was given the score of the most severe 
formal sign demonstrated. For example, if he showed 
loosening of associations and echolalia he was given 
a rating of 3. The second measure was the fre- 
quency of intrusions of formal signs of schizophrenic 
thought disorder in the proverb responses. 

A clinical measure, designed to measure the de- 
gree of reality disruption on a 7-point ordinal scale 
of severity, was used (Cancro, 1962). Curvilinear 
correlation coefficients were used so that nonlinear 
relationships would be allowed to emerge. The RET 
was treated as the nominal variable with the 51 | 
cases divided into three equal groups of 17 each, 
representing the low, middle, and high range of 
differentiation. Pearson correlations were also calcu- 
lated and F tests were done on the basis of the dif- | 
ference between the curvilinear and the Pearson 
correlations. 


RESULTS 


The RFT correlated with the PRS at .28, 
F = 4.0816, p < .05. The intermediate group 
on the RFT consisted primarily of reactive 
cases as determined by the PRS. The most 
differentiated cases included most of the | 
process patients and a very few reactives. The 
most undifferentiated group contained both 
process and reactive cases. In summary, the 
process cases tended to be at the RFT & 
tremes, although more often at the highly 
differentiated rather than at the undiffer- 
entiated end, while the reactive cases were | 
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mostly in the midrange and to a lesser degree 
in the undifferentiated end of the continuum. 
The duration of hospital stay was correlated 
with RFT at four specific times during the 
3-yr. follow-up period—6, 12, 24, and 36 mo. 
The RFT correlated with 6-mo. duration at 
.24, p < .20; with 12-mo. duration at .33, 
< .05; with 24-mo. duration at .30, p< 
.05; and with 36-mo. duration at .22, p < .20. 
At all four of these points it was the midrange 
group on the RFT that had the shortest dura- 
tion of hospital stay. The least differentiated 
cases had an intermediate duration, and the 
most differentiated cases had the longest dura- 
tion of hospital treatment. The susceptibility 
of the curvilinear correlation coefficient to 
variations in the manner of grouping patients 
into low, medium, and high RFT scores sug- 
gested the wisdom of grouping the patients 
by other criteria to confirm the relationship 
between RFT and outcome. Taking those 13 
patients whose RFT score was within plus or 
minus .5 sigma from the mean as the mid- 
range group produced a second grouping with 
14 patients in the least differentiated and 24 
patients in the most differentiated groups. 
This second grouping of the RFT correlated 
with 6-mo. duration at .19; with 12-mo. 
duration at .32, p < .05; with 24-mo. dura- 
tion at .29, p < .05; and with 36-mo. duration 
at .23, p<.20. Witkin and associates 
(Witkin, Lewis, Hertzman, Machover, Meiss- 
ner, & Wapner, 1954) reported in a study 
drawn from the same observation hospital that 
only 20% of their cases represented the mid- 
range and that the remainder were equally 
split between high and low scores on the RFT. 
Grouping this patient. population in a similar 
manner with the middle 20% (m= 11) rep- 
resenting the intermediate cases still gave 
essentially similar results. The third group- 
ing of the RFT correlated with 6-mo. dura- 
tion at .28, p < .05; with 12-mo. duration at 
40, p< .01; with 24-mo. duration at .32, 
Ż < .05; and 36-mo. duration at .22, p < .20. 
The Metalog Test correlated with the RFT 
at .32 but the curvilinearity was not sig- 
nificant. The Pearson correlation between the 
Metalog Test and the RFT was .27 with the 
Most differentiated patients showing the most 
Conceptual plasticity. There was a strong 
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TABLE 1 


CURVILINEAR CORRELATIONS BETWEEN RFT 
AND DURATION oF HosPITAL STAY 


Mo. 
Grouping 
6 12 24 36 

First 24 .33* 30* 22 
Second 19 -32* 29* 23 
Third 28* 40** 32* 22 
Note—N = 51. 

*p«.05 

**$ <01 


linear correlation between the Metalog Test 
and Wechsler Adult Intelligence Scale 
(WAIS) Vocabulary score of .46 in this 
sample. There was a marginal linear correla- 
tion between RFT and WAIS Vocabulary 
score and when the effect of the Vocabulary 
score was removed, the correlation between 
the RFT and the Metalog Test became non- 
significant. 

There were no significant correlations be- 
tween the RFT and the measures of abstract 
attitude, severity of thought disorder, and de- 
gree of reality disruption. 


DISCUSSION 


The importance of performing perceptual 
tests on a drug-free population cannot be 
overemphasized. Differences between patient 
groups, such as process and reactive, on a 
variety of psychological measures can be 
solely an expression of degree of tranquiliza- 
tion. It is also essential to control for length 
of hospitalization, especially since the process 
patients in any given study are likely to have 
been hospitalized longer than the reactive 
group to which they are compared. 

The tendency to see process cases as Sicker 
and more primitive than reactive cases, 
coupled with the tendency to see less differ- 
entiated people as more primitive and less 
stable, may have contributed to the establish- 
ment of a mental set that the relationship be- 
tween these variables must be linear. 

The finding that the midrange of differ- 
entiation, as measured on the RFT, consisted 
almost exclusively of reactive schizophrenics 
suggests that an intermediate degree of dif- 
ferentiation was correlated with good prog- 
nosis. The relationship between RFT and 
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duration of hospital stay tended to support 
the conclusion that differentiation contributes 
to the predictive power of the PRS. The cog- 
nitive stance and defensive preference of the 
extreme groups on the RFT differed sharply. 
Both extremes of differentiation were seen as 
overly committed to a particular ego style 
and lacking in the flexibility to shift accord- 
ing to the demands of the situation. From this 
point of view, patients in the midrange of 
differentiation would have a better prognosis, 
since they are less firmly committed to a 
particular style and can change more readily 
(Haronian & Sugerman, 1967). While this 
change may signal a true improvement it may 
only reflect the ability of the patient to 
comply with what the treating institution ex- 
pects. One cannot help but be reminded of 
the warning about patients who improve 
rapidly only to lose their gains just as easily 
(Freud, 1964). This admonition, coupled 
with the failure of the RFT to significantly 
correlate with 6- and 36-mo. duration, sug- 
gested an examination of admission and dis- 
charge patterns over the 3 yr. of follow-up. 
The curvilinear correlation between RFT 
and total number of admissions was .24, 
p< .20, while that between RFT and the 
total number of discharges was .23, p < .20. 
While not significant, the results were in the 
direction of the midrange on the RFT, having 
a higher number of readmissions, with the 
most differentiated cases having the fewest 
readmissions and the least differentiated cases 
an intermediate number. This was the same 
pattern that emerged with the discharges as 
well. The difference in duration of hospital 
stay between various RFT groups becomes 
nonsignificant because the higher readmission 
rate of the midrange and least differentiated 
patients begins to compensate for the longer 
initial hospital stay of the most differentiated 
patients at about 36 mo. This pattern of 
hospitalization suggests the long-term predic- 
tive elements of the PRS are not those that 
are related to psychological differentiation. 
The failure of the RFT to correlate with 
any of the clinical variables studied suggests 
that these measures are not related to psycho- 
logical differentiation. The restricted range of 
the Ss tested leaves open the possibility that 
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a relationship between various clinical 
and differentiation may exist in other t; 
of patient samples. The clinical measures u: 
in this study were affected to varying degr 
by the severity of the schizophrenic proce 
and the failure to demonstrate a relations! 
to the RFT was consistent with the obser! 
tion that psychological differentiation 
tributes to the form of the pathology a 
not its severity. 
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COMMENT ON “ARTIFICIAL INDUCTION OF 
POSTHYPNOTIC CONFLICT” 


JOSEPH REYHER! 
Michigan State University 


Sheehan’s reason for discounting a previous investigation using posthypnotic 
conflict is factually incorrect, and his concept of “successful simulation” is 
challenged in this paper. His own procedures are questioned for several reasons: 
(a) he used a relatively weak posthypnotic conflict; (b) he did not incorporate 
any of the variables in his design which have been associated with the 
production of psychopathology in previous research; (c) he violated his own 
criteria for inducing posthypnotic conflict; and (d) his interpretation of his 
findings appears to be inconsistent. Recent investigations are cited which 
further document the pathogenic properties of the present author’s procedures 


for inducing posthypnotic conflict. 


In the course of developing a basis for his 
investigation of posthypnotic conflict, Sheehan 
(1969) discounted, on the basis of an im- 
proper control group, the results of investi- 
gations by Reyher (1961) and Moore (1964). 
Reyher reported the spontaneous repression 
of posthypnotically activated anger-aggres- 
sion and the production of a wide range of 
psychopathology, including many psychoso- 
matic symptoms. Using a modification of 
Reyher’s procedures, Moore found that high 
achievers tend to repress the induced anger- 
aggression more than low achievers. He also 
reported the production of psychopathology, 
but the incidence was lower. 

The control group described by Reyher 
(1961) was really a separate study in which 
the same Ss were given the experimental treat- 
ment under simulating and hypnotic condi- 
tions. The design was similar to one reported 
by Hilgard (1965, pp. 39-43) for sorting out 
order effects from main effects. The design was 
that highly susceptible Ss were given, in ran- 
dom or counterbalanced order, an experi- 
mental and a simulating condition. On one 
day S was hypnotized by E who left, and was 
awakened by a co-E who asked him to behave 
for E as if he were really hypnotized. On the 
other day, the co-E did not awaken S but 
merely gave a few simple suggestions that wer 
executed; E returned and gave S the experi- 
mental treatment. The S was not advised as to 


1 Requests for reprints should be sent to the 


author, Department of Psychology, Michigan State 
University, East Lansing, Michigan 48823. 


the second day until the completion of the 
condition on the first day. The main effects 
were determined through a comparison of con- 
trol and experimental conditions occurring on 
the first day. In contrast to the high reliability 
(87%, n — 23) of producing symptomatic 
reactions with this procedure, the simulating 
condition produced no symptoms at all for 
five consecutive Ss; consequently, it was 
considered pointless to run more Ss (Inter- 
ocular Traumatic Test, Edwards, Lindman, 
& Savage, 1963), even though not enough Ss 
had been run to sort out statistically the 
order effects from the main effects. 

It was the order effect in combination with 
another point that led Sheehan to discount 
Reyher’s and Moore’s results. Sheehan (1969) 
contended: 


The E, however, was unconcerned about motivating 
Ss to simulate successfully, and this procedure carried 
the strong implication that Ss should respond dif- 
ferently on the second day by changing their behav- 
E from the responses they had shown previously 
p. 16]. 


Although the word “unconcerned” does not 
reflect E's true attitude at the time, his use 
of the vague phrase “successful simulation 
is not an improvement in design unless an 
operational definition or at least some sort 
of objective guidelines or criteria are pro: 
vided to show how this is done by instruction 
to S. One of the best known facts of psycho- 
logical research is the sensitivity of dependent 
variables to type of instruction. Quite as! id 
from this lack of operational definition fot 
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successful simulation, the instruction to simu- 
late already alters the demand characteristic 
of the research design (Reyher, 1967) for 
control Ss; additional motivating instructions 
further confound the issue. Even Orne 
(1959), who was a consultant to Sheehan's 
study, has recognized the intrinsic and con- 
siderable motivating properties of simulating 
instructions. The opportunity for a college 
freshman or sophomore to contribute to sci- 
ence and to put one over on the professor or 
E is quite sufficient, although as already 
noted, this alters the demand characteristics 
of the research for the simulators in unknown 
ways. Their unusual and high motivation 
probably accounts for why they overplay their 
roles (Reyher, 1968), scoring higher on the 
dependent variables than hypnotic Ss. 

Sheehan's inclusion of Moore's study in his 
criticism of Reyher is incorrect, because 
Moore, unlike Reyher (1961), used separate 
groups of highly susceptible Ss for the simu- 
lating and experimental conditions. The con- 
trol Ss were unknown to E and were asked to 
simulate by a co-E. Moore's results cannot be 
brushed aside so lightly. 

No mention was made of Perkins’ (1965) 
successful replication and extension of 
Reyher’s procedures. He used the same con- 
trol procedures as Moore and found that 
the experimental and simulating Ss were 
grossly disparate in number and kind of 
symptoms as well as GSR activity in response 
to the recognition of tachistoscopically pre- 
sented conflict words. The GSR lends consid- 
erable objective support to the validity of the 
obtained psychopathology. 

Sheehan’s investigation is vitiated by his 
departure from Reyher’s paradigm for in- 
ducing pathogenic posthypnotic conflicts, 
Which he did not acknowledge, although its 
essential features were included in his own 
Criteria. He used a reproachable act, which 
is a relatively weak drive or emotional state, 
instead of the sudden and intense posthyp- 
notic activation of a basic drive. Conse- 
quently, his results were similar to the minor 
disturbances of other studies (Bobbitt, 1958; 
Huston, Shakow, & Erickson, 1934; Luria, 
1932; Pruesse, 1967) which also used a 
"forgotten" reproachable act with posthyp- 
notic suggestions that it will continue to 
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influence S in some way. He did not include 
any of those variables that put S's defenses 
under great pressure and thereby produce 
many symptoms representing a wide range 
of psychopathology (Perkins, 1965; Reyher, 
1958, 1961, 1967, 1968; Sommerschield, 
1969). These variables are (a) a sudden 
surge of overwhelming rage toward an au- 
thority figure (5) coupled with an overwhelm- 
ing impulse to destroy property belonging to 
said authority figure, and (c) activated sud- 
denly by a posthypnotic cue (conflict word). 

Modifications of the basic procedure sub- 
stantially diminish its pathogenicity. Moore 
(1964) modified the procedure by reducing 
Variable 2 from overwhelming rage to nearly 
overwhelming and by eliminating Variable 3 
entirely. Although he obtained a substan- 
tial amount of psychopathology, this was 
markedly less than what has been obtained 
by Perkins (1965), Reyher (1958, 1961, 
1968), and Sommerschield (1969). Another 
investigation by  Rokeach, Reyher, and 
Wiseman (1968) substitute reversals in be- 
liefs for anger-aggression (Variables 1 and 2), 
resulting in even a lower incidence of symp- 
tors. Veenstra (1969) eliminated Variable 2 
and found that there was no spontaneous 
repression of the induced anger and, accord- 
ingly, there was no psychopathology. 

The clinical relevance of Sheehan's data is 
weakened by his unintended departure from 


` the proper paradigm (Reyher, 1962, 1963). 


Although he apparently endorses that part of 
the paradigm which stipulates that the hyp- 
notically induced process must spontaneously 
produce S's posthypnotic behavior, his in- 
duced parammesia is incompatible with his 
intent because it contains specific posthyp- 
notic suggestions, namely, you will feel “un- 
easy,” “anxious,” “terrible,” “bad,” and “wor- 
ried.” Aside from noting any differences be- 
tween the experimental and simulating Ss, 
which there were, implications in regard to 
clinical phenomena cannot be drawn. 

This author is puzzled by the way in which 
the differences between the experimental and 
simulating Ss were handled. On the one hand, 


Sheehan opined: 


The pattern of results suggests that previous findings 
on the effect hypnosis has on personality after the 
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induction of conflict (e.g, Reyher, 1961) can be 
equally well explained by the demand characteristics 
of the experimental situation as by appeal to the 
effects of hypnotic treatment [p. 23]. 

On the other hand, he said of the hypnotic Ss: 
‘The subjective reality of their feelings argues for 
changes and distortions in their behavior which 
accord with individual differences in the personality 
dynamics of Ss. In support of this, the evidence 
showed that the hypnosis had no single effect on Ss’ 
personalities even though the same paramnesia was 
suggested to all Ss. Subjective reactions ranged from 
expression of somatic symptoms such as headaches 
to symptoms of unspecified distress, paranoid ideas 
and aggressive urges. Hypnotic Ss all reported that 
they experienced amnesia for the conflict material yet 
showed varying signs of breakdown of ego defenses. 
The most extreme reaction came from a hypnotic S 
who was agitated enough to hurl the TAT card at 
the E and say that he felt as if the police were 
watching him through a one-way screen which was 
placed in the room [p. 24]. 

These observations are peculiar to Sheehan’s 
experimental Ss and are precisely the kinds 
of reactions which our (Moore, 1964; 
Perkins, 1965; Reyher, 1958, 1961, 1967, 
1968; Sommerschield, 1969) experimental Ss 
have produced in great quantity, and which 
we consider to be of cardinal significance. 
Here is a method, finally, for spontaneously 
producing repression and psychopathology in 
the laboratory. We (Perkins, 1965; Reyher, 
1958, 1961, 1967; Sommershield, 1969) al- 
ready have strong evidence from this line of 
research that the frequency of symptoms is 
an inverse function of the degree of repres- 
sion and that the type of symptom is a 
function of degree of repression. Using an 
experimental design which included a rep- 
lication group, Basch (1968) has verified 
the relationship between degree of repression 
and frequency of psychosomatic complaints 
using a physical health questionnaire and an 
inventory of items scored in terms of degree 
of drive representation. 
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CONCEPT OF “PATHOGENESIS” IN PARENTS OF 
SCHIZOPHRENIC AND NORMAL CHILDREN * 


KEVIN M. MITCHELL ? 


Arkansas Rehabilitation Research and Training Center, Fayetteville 


In this cross-validation study the Thematic Apperception Test was ad- 
ministered to five pairs of parents of normal and schizophrenic children. 
Parents by diagnostic category were matched on age, education, and social 
class. Each story was judged pathogenic, benign, or unscorable and a patho- 
genic score was tabulated for each S from the formula: pathogenic/pathogenic 
plus benign, It was found that mothers of the normal children told signifi- 
cantly fewer pathogenic stories than mothers of the schizophrenic children. 
Although fathers did not differ significantly, their mean pathogenic scores 
were distributed according to the hypotheses of the study. 


Karon (1963) has conceptualized the 
schizophrenogenic mother as one who, when 
the needs of mother and child conflict, will 
indirectly satisfy her needs by manipulating 
her child in such a way that his behavior 
satisfies her needs, The covert nature of the 
mother’s demands and her ambivalence lead 
to her child’s inability to distinguish her mo- 
tives from his and to seriously impaired ego 
functioning. Following this lead, Meyer and 
Karon (1967) and Mitchell (1968) found 
that mothers of schizophrenic children pro- 
duced significantly more Thematic Appercep- 
tion Test (TAT) stories than mothers of nor- 
mal children in which the dominant individual 
either ignored expressed needs of the de- 
Pendent individual, or took from the de- 
pendent individual when their needs con- 
flicted. The authors termed these stories 
“pathogenic,” and concluded that, regardless 
of the direction of causation, the concept of 
the schizophrenogenic mother is valid. 

The purpose of the present study was to 
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cross-validate these findings on different 
samples of mothers of normal and schizo- 
phrenic children. Since available Ss included 
the fathers of the normal and schizophrenic 
children, a second hypothesis was that the 
fathers of the schizophrenic children would 
also tell significantly more pathogenic stories 
than the fathers of the normal children. 


MeEtTHopD 


The Ss were five sets of parents of normal and 
schizophrenic children who had been matched on 
age, education, and social class. Diagnosis of the 
schizophrenic children was based on interviews by 
two or more psychiatrists, as well as psychological 
tests. Normal children were nominated by the staff 
of a local junior college and a local high school. 

Fathers were presented Cards 1, 2, 3BM, 6BM, 
6GF, 7BM, 16, and 18BM. Mothers were presented 
Cards 1, 2, 3GF, 6BM, 6GF, 7GF, 16, and 18GF. 
Stories were typed on separate sheets of paper and 
were presented randomly to a clinical psychologist 
with approximately 4-yr. experience who blindly clas- 
sified all stories as (a) pathogenic, (b) benign, or 
(c) unscorable. Classification was based on Karon’s 
(1963) general formulation of “pathogenesis” which 
has been described in the preceding paragraph. How- 
ever, if the dominant individual acknowledged and/ 
or met the needs of the dependent individual, the 
story was scored benign (B). If there was no inter- 
action, the story was judged unscorable (U). 

In addition to the general criterion of meeting 
or not meeting the needs of a dependent individual, 
specific themes reflecting the pathogenic (P), B, 
and U categories had been collected from the two 
earlier studies (Meyer & Karon, 1967; Mitchell, 
1968) and were used by both judges as guidelines 
in the present study. Half of the randomized stories 
were presented to a second judge, an advanced 
graduate student in clinical psychology, who also 
classified all the stories as P, B, or U. A pathogenic 
score was tabulated for each S using the formula: 
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P/P +B, where P is the number of stories judged 
pathogenic and B the number of stories judged 
benign. 


RESULTS AND DISCUSSION 


Interscorer reliability for the two judges as 
measured by a product-moment correlation 
was .79. Since both hypotheses were direc- 
tional and differences in the opposite direction 
could not be explained within the framework 
of Karon's theory, all tests were one-tailed. 

The ¢ tests for matched pairs indicated 
that the mothers of the normal children told 
significantly fewer P stories than the mothers 
of the schizophrenic children (¢ = 2.595, p < 
.05). Mean P scores of the fathers of the 
normal and schizophrenic children were .43 
and .61, respectively, but this difference was 
not significant (¢ = 1.636, p < .10). 

The present data offer further cross-valida- 
tion of Karon’s concept of the schizophreno- 
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genic mother. Although the difference between 
the fathers was not significant, it was in the 
predicted direction. Significant differences 
might have been found between fathers if the 
N had been larger, or it may be that TAT 
themes specific to the “schizophrenogenic 
father” need to be identified. 
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PHYSIOLOGICAL EFFECTS OF RELAXATION TRAINING 


AND HYPNOTIC SUGGESTION. 


GORDON L. PAUL 1 
University of Illinois 


Comparative effects of hypnotic suggestion and brief relaxation training were 
evaluated with regard to reduction of subjective tension and distress (Anxiety 
Differential) and physiological arousal (heart rate, respiratory rate, tonic 
muscle tension, skin conductance). Three groups (N —20 each) of under- 
graduate female Ss participated individually for two sessions, 1 wk. apart, 
receiving (a) abbreviated progressive relaxation training as used in systematic 
desensitization therapy, (b) a hypnotic induction emphasizing direct sugges- 
tions of relaxation, heaviness, warmth, drowsiness, and sleep, or (c) a self- 
relaxation control procedure, included to evaluate the effects of merely resting 
quietly for an equal period of time with instructions to relax. In general, 
both relaxation training and hypnotic suggestion produced significantly greater 
effects than controls, and relaxation training resulted in significantly greater 
effects than hypnotic suggestion. The superiority or relaxation training was 


most pronounced in response systems not under direct voluntary control. 


While it is assumed that multiple factors 
are involved in the application of systematic 
desensitization therapy (Paul, 1966, 19692, 
1969b; Lang, 1969) the “working hypoth- 
esis” has been that the major effective 
mechanism follows the  countercondition- 
ing paradigm which Wolpe (1958) termed 
the "reciprocal inhibition principle." Accord- 
ing to this principle, the ability of given 
stimuli to elicit anxiety will be permanently 
weakened “If a response antagonistic to anx- 
lety can be made to occur in the presence of 
anxiety-evoking stimuli so that it is accom- 
panied by a complete or partial suppression 
of the anxiety responses . . . [Wolpe, 1958, 
P. 71]. Current research suggests that 
"anxiety" may best be thought of as a de- 
Scriptive term for a complex pattern of re- 
Sponse characterized by subjective feelings of 
apprehension and tension associated with 
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heightened physiological arousal, especially 
of the sympathetic branch of the autonomic 
nervous system (Spielberger, 1966). Thus, 
one basic requirement of the principle from 
which the systematic desensitization package 
was derived is that the abbreviated relaxation 
training used in systematic desensitization 
result in a reduced level of physiological 
arousal and subjective distress. 

Even though controlled investigation has 
not found hypnotic suggestion to result in 
significant physiological change over that 
achieved by control procedures (Barber & 
Hahn, 1963), hypnotherapists make strong 
claims in this regard (e.g., Corley, 1964), to 
a point of attributing the success of sys- 
tematic desensitization to “hypnosis” (see 
Cautela, 1966). The abbreviated relaxation 
training used with systematic desensitization 
does, in fact, share factors in common 
with hypnotic-induction procedures. Gill and 
Brenman (1959) noted that all standard-in- 
duction techniques (a) place extensive limits 
on sensory intake, (b) limit bodily activity, 
(c) restrict attention, (d) provide narrow and 
monotonous stimulation, and (e) alter the 
quality of bodily awareness. Barber and 
Calverley (1965) further noted that most 
hypnotic-induction procedures in the experi- 
mental literature include five components: 
(a) defining the situation as hypnosis, (5) 
instructing S to close his eyes, (c) administer- 


425 


426 


ing suggestions of relaxation, drowsiness, and 
sleep, (d) administering motivational instruc- 
tions, and (e) suggesting that it is easy to 
respond to further suggestions. Cautela 
(1966) correctly pointed out that only two 
of the characteristics noted by Barber and 
Calverley are regularly included in brief re- 
laxation training. However, all five elements 
mentioned by Gill and Brenman are also 
found in relaxation training. The major dif- 
ference between relaxation training and hyp- 
notic procedures appears to lie in the focus 
and effects involved in tension and release of 
gross muscle groups, defining the task as a 
passive “learning” situation in which S gains 
control rather than a “hypnotic” situation 
implying operator control, and the use of 
indirect suggestions of warmth, relaxation, 
etc, with instructions to maintain alertness 
rather than direct suggestions of relaxation, 
drowsiness, and sleep. 

Two specific questions thus appear of major 
importance to the practice of systematic de- 
sensitization therapy, and to further under- 
standing of the mechanisms involved in the 
effectiveness of the total treatment package: 
(a) Are reductions in physiological arousal 
and subjective distress producible within one 
to two sessions by either abbreviated progres- 
sive relaxation training or hypnotic sugges- 
tion? (b) Does progressive relaxation training 
produce effects which differ from hypnotic 
suggestion? Data relevant to these questions 
were obtained in the present study, where 
three groups of Ss participated individually 
for two sessions. The Ss of one experimental 
group received abbreviated relaxation train- 
ing, while those of a second experimental 
group underwent a hypnotic induction focus- 
ing on suggestions of relaxation, drowsiness, 
and sleep. The Ss of the third group served 
as self-relaxation controls in order to assess 
the results of the two experimental procedures 
against the effects attributable to merely 
resting quietly for the same period of time 
with instructions to relax. The effects of the 
procedures were then evaluated on four 
physiological measures (forearm-muscle ten- 
sion, heart rate, respiratory rate, skin con- 
ductance) and a self-report anxiety scale. 
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METHOD 
Subjects 


A total of 60 female Ss (17-23 yr. old) participated 
in the experiment as part of an introductory psychol- 
ogy research requirement. All Ss were screened for 
physical health and absence of drugs, but were 
otherwise unselected. No prior information was pro- 
vided about the experiment, except that they were 
scheduled for two 1-hr. sessions 7 days apart, and 
that physiological responses would be monitored, The 
Ss were assigned to the three experimental condi- 
tions (N = 20 each) according to the time of arrival 
at the laboratory, following a repeating Latin square 
in blocks of three. The latter procedure, while 
maintaining random assignment, also counterbalanced 
for possible confounding due to time of day, season, 
or E fatigue. 


Procedure 


All Ss were run individually by one 
married male advanced graduate student in clinical 
psychology, assisted by a polygraph operator—both 
under the regular supervision of the investigator. 
Upon entering the experimental room for the first 
session, S completed a set of paper-and-pencil scales, 
requiring a maximum of 10 min., the results of which 
will be reported elsewhere. She was then introduced 
into an electrically shielded enclosure, and placed in 
a semirecumbent position in a recliner chair. During 
the next 5 min, E and the polygraph operator at- 
tached a strain gage and electrodes for physiological 
recording, being careful to explain the purpose 0 
each, and assuring S that no shock or other un- 
pleasantries would be involved. The polygraph oper- 
ator then retired to the adjoining control room while 
S completed a pretreatment anxiety scale—the Anx- 
iety Differential (Husek & Alexander, 1963; Paul, 
1966). The E then related the following genera 
introduction to Ss in all conditions: 


E, a 24-yr.-old 


This experiment is being done in order to learn 
more about the clinical treatment of anxiety. 
Several of the procedures which we will use are 
taken directly from the clinical setting. Of special 
interest in the treatment of anxiety are the speci c 
physiological manifestations of tension or physio- 
Jogical arousal. We suspect that normal subjects, 
such as yourself, with differing personality charac- 
teristics will show differing physiological systems 
involved in relaxation and stress, even of very 
mild sorts. ^ 

The tests you just took are designed to indicate 
your major personality characteristics; and 
you're interested in their results you can contact 
us later in the semester and we'll be happy t° go 
over them with you. We hope that this experiment 
will lead to a technique whereby psychologists can 
quickly localize the areas where a given persons 
tension and anxiety are expressed, and the 
way of overcoming them. í 

Needless to say, we consider this work to be 9' 
utmost importance. Your cooperation will be neces- 


| 


RELAXATION TRAINING AND HYPNOTIC SUGGESTION 427 


sary in order to insure the validity of the results. 
As we indicated in your appointment letter, we'll 
have you come back again next week at the same 
time. We'll be asking you to take the Blue Test 
[Anxiety Differential] again at the end of today's 
session and at the beginning and end of the session 
next week, but you won't have to take any of the 
other tests again. 


The specific introduction for each experimental 
condition (described in the following paragraph), 
followed without interruption. After questions were 
answered, a 10-min. silent adaptation period began, 
with Ss instructed to sit quietly with eyes open 
“while we finish calibrating the instruments.” The 
last minute of this adaptation period was marked on 
the polygraph chart paper to serve as the pretreat- 
ment basal level (Period 1) for all physiological 
measures. All Ss were then instructed to close their 
eyes, and the next 2 min. were devoted to presenta- 
tion of imaginal stimuli in conjunction with another 
study, The treatment or control procedures were 
then undertaken for 24-28 min. (M = 27 min.) per S, 
matched across conditions with the progressive relax- 
ation Ss. Three 20-sec. time periods (Periods 2, 3, 
and 4) were marked on the chart paper of relax- 
ation Ss, corresponding to 4, 4, and 1 through the 
training procedure, to provide scorable samples of 
physiological activity. Identical time periods for 
scoring were then determined for the hypnotic and 
control Ss within the same block. The last minute 
of recording after completion of training, but prior 
to termination instructions, provided the equated 
time interval for posttreatment assessment (Period 
5). Immediately following termination instructions, 
a posttreatment Anxiety Differential was obtained. 

On Ss arrival for the second session, she was im- 
mediately placed in the recliner chair, and strain 
gage and electrodes again attached. The second pre- 
treatment Anxiety Differential was obtained, followed 
by a brief review of the introduction and query 
about the instructions of the previous session. As in 
the first session, a 10-min. adaptation period followed 
with the last minute again serving as a common time 
for assessment of pretreatment basal level (Period 
1). All Ss were then instructed to close their eyes, 
and the same treatment or control procedures in- 
volved in the first session were undertaken for 22-25 
min. (M —23 min.), equated across groups. Periods 
2, 3, 4, and 5 were marked on the chart paper as 
in the first session. After Period 5, 2 min, were 
devoted to presentation of imaginal stimuli in con- 
junction with another study, followed by an addi- 
tional 5 min. of the treatment or control procedure. 
Termination instructions were then given, and a post- 
treatment Anxiety Differential again obtained before 
disengaging S from the apparatus. 

Brief relaxation training. The progressive relaxation 
Procedures were those previously used with system- 
atic desensitization in controlled outcome studies of 
individual (Paul, 1966, 1967) and group (Paul, 1968; 
Paul & Shannon, 1966) treatment. Immediately after 
the general introduction, Ss assigned to this experi- 


mental condition were presented with the following 
specific introduction: 


The rest of the period will be spent training 
you to learn how to relax—probably to become 
more relaxed than you've ever been before. In 
order for you to learn this, we will use a clinical 
technique which was first developed by Dr. 
Edmund Jacobsen of the University of Chicago 
back in the 1930s. Basically, the procedure hinges 
on the fact that complete relaxation is the absence 
of tension. If you are completely relaxed, it is 
physiologically impossible to be tense or anxious, 
Since the modified Jacobsen technique can be 
learned readily, the real advantage is that you can 
use relaxation to counter any anxiety or tension 
you might feel in the future. 

The way in which we'll do this is to have you 
systematically focus your attention on the various 
gross muscle groups throughout your body, first 
tensing each group for a few seconds—holding 
them long enough for you to identify exactly where 
you feel tension, and what it feels like. Then, 
when I say "relax," I want you to immediately 
let go—to stop tensing—and merely focus your 
attention on what those muscles feel like as relaxa- 
tion takes place. By first tensing your muscles, the 
level of tension increases over your current oper- 
ating level, such that when you release the muscles, 
the level of tension drops below the point where 
you started, Each time you tense and release a 
muscle group, the resulting level of tension becomes 
progressively lower and lower—to the point where 
no tension is present at all. By focusing your at- 
tention on this process as it takes place, with 
practice, you can eventually reach the point where 
just thinking or recalling the experience of release 
is sufficient to bring about relaxation itself. 


The order in which muscle groups would be taken 
was then briefly described, demonstrating those 
exercises frequently found difficult. 


It’s important that you remember to focus your 
attention only on the specific muscle group we're 
working with at any given time, Especially so 
that you may learn the procedure, since TIl be 
asking you to practice relaxation before you come 
back next week. Actually it will be to your 
advantage to master this procedure, since, if used 
correctly, it can help a lot in reducing tension 
around exam times, give you a “second wind” when 
you're tired from studying, and even help you fall 
asleep at night when you may be tense or have 
many things on your mind. 

Since I'll ask you not to talk during the. train- 
ing procedure, except to answer my questions ith 
a simple yes or no, head nod, or "umhum," do 
you have any questions at this time?—All right, 
just sit quietly with your eyes open for a few 
minutes while we finish calibrating the instruments. 


‘After completion of the intervening procedures 
described in the preceding paragraphs, progressive 
relaxation training was undertaken following the 
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operations detailed in Paul’s (1966) experimental 
manual,? with S first instructed to focus her atten- 
tion on the dominant hand and forearm, to tense 
and hold these muscles, and then to release the ten- 
sion immediately upon E's instruction to “relax.” 
During the release period, instructions to focus 
attention with indirect suggestions of warmth, relax- 
ation, etc., were quietly repeated, for example: 


Relax—attending only to the relaxation process in 
your right hand and forearm—becoming more and 
more relaxed—warmer and heavier—nothing for 
you to do but notice the pleasant sensations in 
your right hand and forearm as they become more 
and more relaxed. 


The same procedure of tension, release, and focus 
with indirect suggestions was systematically followed 
through the other gross muscle groups (dominant 
upper arm, nondominant hand and forearm, non- 
dominant upper arm, forehead, eyes and nose, cheeks 
and mouth, neck and throat, chest, back, and 
respiratory muscles, abdomen, dominant upper leg, 
calf, foot, nondominant upper leg, calf, and foot). 
Through completion of the neck and throat muscles, 
a minimum of two tension-release cycles were insti- 
tuted, moving on to the next muscle group as soon as 
the present group was reported to be as completely 
relaxed as the previous one. Throughout the remain- 
ing muscle groups, S was asked whether the current 
group was as completely relaxed as the previous one 
following a single tension-release cycle. In all in- 
stances, however, the next muscle group was not 
undertaken until the current group was reported to 
be as completely relaxed as the previous, or until 
four tension-release cycles had been attempted. After 
all muscle groups had been completed, S was in- 
structed to simply attend to the relaxation taking 
place in her entire body (for an additional 30 sec.), 
and then asked if she felt amy tension at all. If 
tension was reported, further tension-release cycles 
were instituted in the particular muscle group. After 
a report of *no tension anywhere," S was instructed 
to "simply enjoy the state of relaxation—becoming 
even more relaxed—just listening to the sound of 
your own breathing," for an additional minute. 
Termination instructions were then given: "In a 
few moments I'm going to count backwards from 
four to one. On the count of four, start moving your 
legs; three, your hands and arms; two, your head; 
and one, sit up and open your eyes" The Anxlety 
Differential was obtained immediately following the 
count of one. The S was then asked about any diffi- 
culties during training and reassured that even bet- 
ter results would be obtained the following session. 
While S was being disengaged from the apparatus, E 
instructed her to practice relaxation twice per as 
between sessions, for about 15 min.; indicating that 


?'The timing in relaxation training involved 5-7- 
sec. tension periods with 20-30-sec. release periods 
(correction of a misprint in the 1966 manual, page 
119, in which release time is incorrectly givi 
10-20 sec.). ne 
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she could become even more skilled with practice. 

For the second session, preliminary instructions 
consisted of a query regarding the success of prac- 
tice during the previous week, clarification of any 
problems involved, and a brief statement that the 
procedures would be the same as the previous ses- 
sion, but were likely to take less time as S had some 
practice. 

During both sessions, the timing followed that 
presented in the experimental manual (Paul, 1966, 
see Footnote 2), except for the introduction of 
recording periods to allow systematic sampling of 
the polygraph record, without the activity generated 
by the active tension of muscles. Thus, an addi- 
tional 20 sec. of attention to relaxation, with indirect 
suggestions, was introduced three times during the 
training procedure, upon completion of the non- 
dominant upper arm (Period 2), the neck and 
throat (Period 3), and the dominant foot (Period 
4). 

Hypnotic suggestion. The hypnotic-induction pro- 
cedure consisted of a composite previously developed 
by the investigator to serve as an “all purpose” re- 
laxation induction for initial training of clinical stu- 
dents (Paul, 1965). Immediately after the general 
introduction Ss assigned to this experimental condi- 
tion were presented with the following specific intro- 
duction: 


The rest of the period will be spent training you 
to learn how to relax—probably to become more 
relaxed than you've ever been before. In order 
for you to learn this, we will use a clinical tech- 
nique which was first developed by Dr. M. V. 
Kline in New York in the early 1950s. Basically, 
the procedure utilizes hypnotic imagery to induce 
a state of deep relaxation. Since you can readily 
learn parts of the technique, the real advantage 15 
that you can use relaxation to counter any anxiety 
or tension you might feel in the future. 

The way in which we'll do this is to have you 
systematically focus your attention on a visual 
image of yourself. As I describe various sensations 
for your image, you'll find that you actually 
respond in accordance with the suggestions as We 
gradually induce deep relaxation with hypnotic 
suggestion. 

I am sure that you will find hypnosis a most 
interesting experience. Your ability to be hypno- 
tized depends on your willingness to cooperate. It 
has nothing to do with your intelligence or your 
willpower—if you pay close attention to what 
say, and follow what I tell you, you can easily 
fall into a hypnotic sleep and experience the in- 
teresting things I will tell you to experience. Hyp- 
nosis is nothing fearful or mysterious— your MIS 
ingness to cooperate and your interest are what 
ask for. 


if she had been hypnotized 
before, and told that the use would probably be 4 
bit different than other procedures she might I 
acquainted with. The order of events, from visus 


The S was then asked 


1 
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imagery, eye fixation of the image, and numerical 
deepening procedures were then described briefly: 


Of course we wouldn’t ask you to do anything 
which would make you appear foolish, nor is 
there any danger of your revealing things about 
yourself. In fact, I'll ask you mot to talk during 
the training period. You'll be completely aware of 
everything that’s going on at all times, and you'll 
be able to remember everything we do. I'll simply 
be helping you to learn how to relax completely. 
Actually, it will be to your advantage . . . [etc., 
exactly as in relaxation training instructions]. 


The standardized hypnotic induction was then un- 
dertaken, after the intervening procedures described 
in the preceding paragraphs. The induction was 
patterned after Kline's (1953) visual imagery tech- 
nique, with an eye fixation induction emphasizing 
suggestions of heaviness, drowsiness, sleep, and relax- 
ation, following Weitzenhoffer (1957) and Barber 
and Hahn (1963), but directed to S's image of her- 
self. Direct suggestions of heaviness, warmth, and 
relaxation were then focused on Ss' own specific 


. muscle groups, followed by numerical deepening pro- 


cedures, patterned after induction techniques used in 
previous research by London and Davison. A test 
challenge for arm immobilization from the Stanford 
Hypnotic Susceptibility Scale (Weitzenhoffer & Hil- 
gard, 1959) was included, after Period 4, for com- 
parative purposes. The entire procedure was ad- 
ministered in a lulling, soothing, "hypnotic tone," 
emphasizing focus on E's voice, and direct suggestions 
of drowsiness, sleep, warmth, comfort, heaviness, 
relaxation, slow and regular breathing. 

The termination instructions and procedure were 
the same as those used for relaxation training, with 
the additional statement that "When I reach the 
count of one you will be wide awake. You will feel 
alert, and well rested, as if you had taken a short 
nap." The S was instructed to practice the imagery 
procedure twice per day between sessions, for about 
15 min.; indicating that she could become even more 
skilled with practice. For the second session, pre- 
liminary instructions exactly paralleled those of the 
relaxation group, and the hypnotic induction followed 
the same procedures as the first session. 

Self-relaxation control. Immediately after the gen- 
eral introduction, Ss assigned to the control condi- 
tion were given the following specific introduction: 


The rest of the period will be spent with you 
merely sitting quietly and relaxing while we re- 
cord your physiological responses on the dyno- 
graph. 

We're especially interested in the different rates 
at which people become relaxed in various physio- 
logical systems, and the way in which you relax 
yourself. I'll just sit in the room with you while 
you rest quietly with your eyes closed. The only 
thing you'll have to do for this session is to be 

—— 


3P, London and G. C. Davidson, personal com- 
munication, 1964. 
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sure you don't fall asleep, however, I'll have to 
ask you not to talk during that period, so do you 
have any questions now?—All right, just sit 
quietly with your eyes open for a few minutes 
while we finish calibrating the instruments. 


After the intervening procedures were completed, 
Ss in this condition were told, *Now just get com- 
pletely comfortable and rest quietly with your eyes 
closed for the remainder of the time. Try to become 
completely relaxed, but be sure you don't go to 
sleep." Termination procedures were the same as 
those for the relaxation group, with the explanation 
given “so as not to disturb the electrodes.” The con- 
trol procedures for the second session were identical 
to those of the first session. 

Apparatus. All treatment and control conditions 
were conducted in an air-conditioned experimental 
room below ground. The shielded enclosure was a 
room within a room, measuring 7 X 5 X 64 ft., made 
of completely connected copper screening attached to 
a 2X2 in. wooden frame, with an earth ground. 
The polygraph, also with an earth ground, was 
located in an adjoining room, separated from E and 
S by a one-way glass. All electrodes were fed through 
a junction box within the enclosure, with a common 
S ground from the dominant leg, through shielded 
cable to the control room. 

Physiological measures were continuously re- 
corded on a 4 channel Beckman Type RB dyno- 
graph, equipped with 474A amplifiers, on Offner No. 
V654 chart paper driven at 5mm/sec. All electrodes 
were cleaned and recoated, where appropriate, after 
each S was run. Commercial electrode cream (Grass 
EC2) was used for all connections. The skin area 
for electrode placements was first cleaned with 
alcohol, and then rubbed vigorously for 1 min. with a 
gauze pad coated with electrode cream. The elec- 
trodes for recording muscle tension and skin con- 
ductance were first filled with cream and then taped 
in place within Dr. Scholl’s No. 312 foam-ease corn 
pads, also filled with cream. Electrodes for record- 
ing heart rate were filled with cream and taped di- 
rectly to the skin area, as was the ground. All 
electrodes, except the ground, were then lightly 
wrapped with elastic bandage to prevent movement 
artifact. 

Muscle tension was recorded from the dominant 
forearm in order to obtain maximal amplitude of 
tonic muscle action potentials, Silver cup electrodes 
(Grass E-1B) were attached, with the active elec- 
trode centered on the common extensor and the 
reference electrode on the flexor. The signal was fed 
through a Beckman 9852 EMG integrator. Muscle 
tension, in mean microvolts/second, was then deter- 
mined from the average of the greatest amplitude of 
integrated muscle action potential within each second 
of each recording period (60 sec. for Periods 1 and 
5; 20 sec, for Periods 2, 3, and 4), excluding heart- 
rate artifact. 

Heart rate was recorded from Grass E5G electrodes 
placed over the radial artery of each wrist, fed 
through a Beckman 9857 cardiotachometer coupler. 
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TABLE 1 
OVERALL MEANS AND CORRELATIONS FOR INITIAL RESTING LEVEL, PERIOD 1, 
AND FINAL PERIOD, PERIOD 5, ON ALL MEASURES 
Anxiety Differential Heart rate Respiratory rate Muscle tension Skin conductance 
Period 
M SD M SD M SD M |r SD M SD 
Session I 
iod 1 62.90 | 12.5654 | 86.22 | 12.0030 | 17.08 | 3.6305 22.04 | 18.9379 | 3.246 | .58022 
Period 5 50.17 | 11.9464 | 81.17 | 11.4687 | 14.30 | 3.6622 | 19.66 | 17.4039 | 2.533 57472 
Pearson r A646 .8882 .3750 9743 .6981 
Seer ii 56.40 | 15.4176 | 84.83 | 11.6723 | 17.57 | 3.6677 | 24.26 | 18.7846 | 2.899 | .84625 
Period 5 A705 | 11.5310 | 79.33 | 11.2652 | 15.58 | 4.0171 | 21.99 | 17.6015 | 2.098 | .64234 
Pearson r .5052 .8700 .4598 .9657 .1535 


Note.—Heart rate in beats/minute ; respiratory, rate in cycles/minute; muscle tension in mean microvolts/second ; skin con- 


ductance in micromhos; M = 60; all Periods 1-5 dii 
a Nonsignificant correlation; however, part-score 
analysis. 


Heart rate, in beats/minute, was then determined by 
counting pulses from cardiotachometer recordings, or 
R-wave peaks from electrocardiogram (EKG) re- 
cordings, within each recording period with appropri- 
ate conversions for the 20-sec. periods. 

Skin conductance measures were obtained through 
a Beckman 9892A skin resistance coupler from 
Grass E-1B silver-silver chloride electrodes placed on 
the dorsal and volar (just in front of the meta- 
tarsals) surface of the dominant foot. The lowest 
resistance in K ohms was then determined for each 
recording period, and converted to conductance in 
micromhos. 

Respiratory rate was obtained from a 10-in. 
mercury strain gage attached to an adjustable elastic 
belt which was fitted around the chest at the bottom 
of the sternum. The strain gage activated a Parks 270 
plethysmograph, which fed directly through a Beck- 
man 9853 strain gage coupler. Within each record- 
ing period, data were first transcribed according to 
respiratory period (number of seconds to complete 
10 respiratory cycles), and then converted to respira- 
tory rate in cycles/minute. 4 

All data reduction was carried out by two trained 
clerks, overlapping scoring on 20 occasions for each 
measure, with interscorer reliability exceeding .999 on 
all measures, and additional spot checks maintained. 
Any missing data, due to equipment artifact (less 
than two for any measure), were replaced by the 
unweighted mean for the treatment group. Conver- 
sions and data analyses were performed by the IBM- 
7090 computer of the University of Illinois Computer 
Science Laboratory. 


RESULTS 


Randomized groups analyses of variance 
performed on all Period 1 physiological mea- 
sures and pretreatment scales obtained in the 
first session found no significant between- 
groups differences for any measure (all p’s > 


ferences significant at p < .001. X ‘ 
analysis included for consistency-identical results obtained with raw score 


.20), indicating that the assignment procedure 
had adequately randomized and equated 
groups prior to the introduction of treatments. 
Examination of response to the arm immobi- 
lization challenge for the hypnotic group found 
“pass” rates of 90% for the first session and 
85% for the second session, as compared to 
14% and 36% on standardization samples 
(Hilgard, 1967), indicating a high degree of 
effectiveness, by usual standards, for the in- 
duction procedure. 

The unweighted means for initial resting 
levels and final period of each session for the 
total sample are presented in Table 1 for gen- 
eral orientation. Table 1 reveals a change in 
the “relaxed” direction in each session, which 
was found to be highly significant (all 's < 
.001) for all measures by £ test of the differ- 
ence between correlated means. However, in- 
spection of the average change from pretreat- 
ment resting levels during Session I and II 
for each group, presented in Figures 1 and 2, 
respectively, shows that the relaxation group 
was the only one to consistently produce such 
changes across measures and occasions. 

While within-session trends are of interest, 
the most relevant data concern the compara- 
tive change between groups from Period 1 to 
Period 5 during each session. Tn order to rule 
out possible influence of basal levels on the 
magnitude of response, as is frequently foun 
with physiological data, a base-free measure 
of change is desirable (Benjamin, 1967; 
Tucker, Damarin, & Messick, 1966). There- 
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Fic. 1, Mean change from initial resting level on all measures during Session I. 
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Fic. 2. Mean change from initial resting level on all measures during Session II. 
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fore, residual change scores, or “part scores,” 
consisting of the difference between the ob- 
tained Period 5 score and that predicted by 
linear regression from the Period 1 score, were 
computed for each S on all measures before 
further analyses were undertaken. The overall 
evaluation of differential treatment effects was 
then carried out by analyses of variance on 
final period part scores for each measure in 
each session. 

Highly significant treatment effects (5 < 
01, df = 2/57) were found for the Anxiety 
Differential in both sessions (F = 11.083 and 
8.921) and for all physiological measures, 
except skin conductance (Session I; F= 
12.423, 8.208, 5.716, for heart rate, respira- 
tory rate, and muscle tension, respectively; 
Session II: F = 9.786, 6.190, and 6.688, re- 
spectively). Although the combined change 
for skin conductance was significant in both 
sessions, no significant differences were found 
between treatments on either occasion (F = 
1.696 and < 1, for Session I and II, df= 
2/57). These analyses indicate that differen- 
tial degrees of reduction in physiological 
arousal and distress occurred among groups. 

A finer grained analysis was then under- 
taken on all measures except skin conduc- 
tance to determine the specific effectiveness of 
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relaxation training and hypnotic suggestion, 
and whether relaxation training produced ef- 
fects which differed from those attainable 
with hypnotic suggestion, The results of these 
multiple comparisons are summarized in Table 
2. As seen by joint inspection of Figure 1 and 
Table 2, in the first experimental session, re- 
laxation training produced significantly greater 
changes in the relaxed direction than controls 
on all measures tested. Additionally, relaxa- 
tion training produced significantly greater 
decreases in heart rate and tonic muscle ten- 
sion than hypnotic suggestion, while hypnotic 
suggestion resulted in significant decreases in 
respiratory rate and Anxiety Differential over 
controls. While the latter two measures fol- 
lowed the same trend as others, showing 
greater decreases for the relaxation group 
than for the hypnotic group, these differences 
did not achieve statistical significance. 

In the second experimental session (see 
Table 2, Figure 2), relaxation training was, 
again, found to produce significantly greater 
decreases than controls on all measures tested. 
As in the first session, significantly greater 
reductions in heart rate and tonic muscle ten- 
sion were found for relaxation training than 
for hypnotic suggestion. In the second ses- 
sion, however, the hypnosis group was also 


TABLE 2 


Duncan MurrIPLE-RANGE TESTS OF SIGNIFICANCE OF BETWEEN-GROUP DIFFERENCES ON FINAL 
PERIOD Part Scores ror EACH MEASURE 


Comparison (N = 20/group) 
Measure Relaxation vs. control Relaxation vs. hypnosis Hypnosis vs. control 
F Obtained SSR Obtained SSR Obtained SSR 
p 5,381 
Anxi i i 14.311 > 5.674 2.812 (ns) 5.381 11499 > S 
erie NN 5.464 > 2.358 6.575 > 2492 1.111 (ws) ds 
Respiratory rate 3.884 > 1.700 1.293 (ns) 1.609 Eg. (e) AER 
Muscle tension 3462 » 2031 3.267 1.922 E. 92 
w Saat conductance* — — — — 
‘SSion 
Anxiety Differential 10715 > 3.620 1.039 (ns) 3.426 26/8 2 de 
HUE $5 2 18i 145 Gs) 158 221] S 1.738 
espirat t 3.632 > 1. ` j i 
fate teint 3 4836 » 2337 2.602 > 2.211 2234 > 2.211 
Skin conductance* = — = ae 


Note.—SSR = shortest significant range at p < .05 and joint protection level of 90%. 
^ Nonsignificant F; therefore; multiple comparisons not computed. 
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TABLE 3 


PERCENTAGE, OF SUBJECTS SHOWING SIGNIFICANT 
REDUCTIONS IN THE RELAXED DIRECTION OVER 
COMBINED MEASURES 


Treatment | $$ 
0-1/4 2/4 3/4 4/4 
Session I* 
Relaxation training 5 10 85 
Hypnotic suggestion 20 50 30 
Control 5 35 50 10 
Session II^ 
Relaxation training 10 20 70 
Hypnotic suggestion 10 65 25 
Control 35 35 25 5 


Note.—Skin conductance not included as all Ss showed 
significant reductions, 

ax? = 26.12, df = 4, p <.001, N = 20 per treatment. 

by? = 35.90, df = 4, p <.001, N = 20 per treatment. 


found to produce significantly greater de- 
creases than controls on all measures tested, 
suggesting a training effect. Overall compara- 
tive changes from the beginning of Session I 
to the end of Session II were identical with 
those found in Session II, with even greater 
between-groups differences in magnitude of 
response. Thus, it seems clear that both hyp- 
notic suggestion and relaxation training did 
produce significant cognitive and physiologi- 
cal changes within the brief training sessions, 
and that relaxation training resulted in effects 
over and above those attributable to hypnotic 
suggestion. 

As is typical of physiological data, signifi- 
cant positive but low intercorrelations (.20s 
and .30s) were found among part scores on at 
least one occasion for all measures, with the 
exception of skin conductance. All correlations 
between the latter part scores and other mea- 
sures were negative, both within and between 
occasions, with about half reaching statistical 
significance (|r| > .25, p < .05). In contrast, 
the part-score stability coefficients for each 
measure from Session I to Session II were 
higher and all significant (r = .76, .62, .69, 
.62, and .58, respectively, for Anxiety Differ- 
ential, heart rate, respiratory rate, muscle ten- 
sion, and skin conductance, p < .01). These 
data suggest that the “relative response 
stereotypy” (Lacey & Lacey, 1958) frequently 
found in stress research, might also operate 
when physiological arousal is decreased as 
well as increased. However, excluding skin 
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conductance, highly significant coefficients of 
concordance (Siegel, 1956) were obtained for 
the relative consistency across measures of § 
response (W = .37 and .30 for Sessions I and 
Il, p< .001) and relative treatment effect 
(W = .81 and 1.00 for Session I and II, p< 
01). 

While the magnitude of response on any 
given measure may be partially a function of 
each S's relative reactivity in the specific sys- 
tems, the direction of change in each system 
should be the same such that differences in 
the degree of decreased arousal should also 
appear in the number of systems showing 
change in the relaxed direction, irrespective of 
magnitude, These data are summarized in 
Table 3, where the comparative figures on 
number of measures agree entirely with the 
earlier analyses on magnitude. 


Discussion 


The combined findings show that both the 
brief relaxation training previously used with 
successful systematic desensitization therapy, 
and, the hypnotic suggestion procedure were 
effective in reducing subjective reports of 
tension and distress, even within a single 
session. By the second induction, hypnotic 
suggestion was found to produce significantly 
greater decreases in physiological arousal than 
self-relaxation controls, as measured by 
changes in heart rate, muscle tension, and 
respiratory rate; although, only the system 
under voluntary control (respiration) showed 
such effects during the first induction. In 
contrast, relaxation training was found to 
result in significanty greater decreases than 
controls on all of the above measures from 
the first training session, with effects main- 
tained in the second. Further, the relaxation 
training procedure produced significantly 
greater reductions than hypnotic suggestion In 
systems not under direct voluntary control 
(heart rate, tonic muscle tension) in both ses 
sions. Thus, with regard to the questions 
which led to the present study, it is clear that 
(a) both hypnotic suggestion and abbrevi- 
ated progressive relaxation training do pro 
duce decreased physiological arousal and sub- 
jective distress within one to two sessions; 
(b) progressive relaxation training is more 
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effective than hypnotic suggestion in produc- 
ing desired physiological changes, whether 
considered in terms of efficiency, intensity, or 
extent. 

The findings with skin conductance, in 
which all Ss showed reductions over time in 
both sessions, without between-groups dif- 
. ferences raise important questions regarding 
this measure. Higher conductance levels have 
recently been found under conditions of at- 
tention, novelty, or unfamiliarity than under 
actual stress (Flanagan, 1967; Roessler, 
Burah, & Childers, 1966), in some cases 
changing in the opposite direction to other 
physiological measures (MacNeilage, 1966), 
thus suggesting greater independence from 
sympathetic control. While autonomically 
mediated sweat gland activity has been as- 
sumed, historically, to form the primary basis 
for tonic changes, a number of factors have 
now been found to influence skin conductance 
(Darrow, 1967; Martin & Venables, 1966; 
Wilcott, 1967). Especially important is recent 
evidence that the permeability, or “water 
load,” of the cornium and underlying layers 
of the epidermis between the surface electrode 
and deeper tissues influences the level of con- 
ductance, particularly under conditions of “low 
reactivity at high resistance,” such as those 
of the present study (Darrow, 1964; Edel- 
berg, 1966). Additional studies varying local 
blood supply by drugs or arterial occlusion 
indicate that, in the relative absence of sweat 
Bland activity, increased vasodilation results 
in increased water load, and as a consequence, 
higher skin conductance (see Juniper, Blan- 
ton, & Dykman, 1967). Thus, it seems possi- 
ble that skin conductance might gradually 
decrease along with sweat gland activity; 
however, as greater reductions in sympathetic 
activity occur, peripheral vasodilation could 
Paradoxically effect the level of skin conduc- 
tance, This hypothesis seems sufficiently ten- 
able at present to recommend extreme caution 
ìn the interpretation of skin conductance in 
any investigation expected to reduce physio- 
logical activity below normal resting levels. 
Until additional clarification regarding skin 
Conductance has been obtained, future inves- 
tigations of the latter type might gain more 
relevant data through direct measurement of 
Sweat production and blood volume. 
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Hypotheses regarding the influence of fac- 
tors noted by Gill and Brenman and Barber 
and Calverley, described earlier, appear on 
examination of intrasession trends (Figures 1 
and 2) combined with the significant differ- 
ences in extent and magnitude of response. 
Such examination suggests that the limita- 
tions on sensory input and bodily activity 
plus motivational instructions, and eye closure 
—common factors in all three conditions— 
were not actively involved in reducing arousal 
level. These common factors may have been 
sufficient to avoid an increase in arousal, but 
apparently exerted little further effect after 
the adaptation period. Restricted attention, 
narrow and monotonous stimulus input, and 
suggested ease of response were additional 
factors common to both relaxation training 
and hypnotic conditions, The results obtained 
on the Anxiety Differential suggest that the 
latter factors in combination with either di- 
rect or indirect suggestions of relaxation, are 
sufficient to alter the cognitive experience of 
distress, while the results on physiological 
measures suggest that the differential content 
of stimulus input and attentional focus were 
of prime importance in the alteration of so- 
matic responses, Thus, defining the situation 
as hypnosis, followed by direct suggestions of 
sleep, drowsiness, relaxation, etc., and arm 
immobility challenge, resulted in erratic 
physiological responses during the first ses- 
sion, with ultimate changes only in the volun- 
tarily controlled response systems. During the 
second session, physiological response became 
less erratic, with ultimate change in systems 
not under direct voluntary control as well. On 
the other hand, defining the situation as pas- 
sive learning, followed by systematic tension 
release of gross muscle groups, with indirect 
suggestions contingent upon Ss’ own behavior, 
resulted in consistent response with ultimately 
greater changes in systems not under direct 
voluntary control in both sessions. 

The findings for hypnotic suggestion add 
support to the growing body of evidence that 
hypnotic phenomena represent nothing dis- 
continuous from other forms of behavioral in- 
fluence. Hypnotic studies frequently report 
greater changes on self-report measures and 
voluntarily controlled response systems (Bar- 
ber, 1965). These data are in agreement with 
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the interpretation of hypnotic effects within a 
learning framework in which suggestions may 
first lead to voluntary compliance, but, as 
restricted attention results in operator-emitted 
stimuli becoming more central and relevant, 
previously learned responses would be more 
likely elicited upon continued presentation of 
the verbal stimuli (Paul, 1963). In contrast 
to the hypnotic procedure, relaxation training 
appears to have induced changes as a conse- 
quence of abrupt release after tensing muscle 
systems, without relying solely on past learned 
responses to verbal or ideational stimuli. The 
indirect suggestions used in focusing attention 
on each muscle group during the release phase 
might, then, actually strengthen the associa- 
tion between these stimuli and physiological 
responses, such that verbal and ideational 
stimuli could become more adequate elicitors 
of relaxation responses, even in the absence of 
prior tension, as well as contributing to alter 
cognitive experience (see Paul, 1966; Paul & 
Shannon, 1966). 

The results obtained in the present investi- 
gation reveal brief relaxation training to pro- 
duce subjective and physiological changes in 
the direction opposite to anxiety, as assumed 
by the reciprocal inhibition principle from 
which systematic desensitization therapy was 
derived. A necessary requirement for inter- 
preting the major mode of operation of sys- 
tematic desensitization as response suppres- 
sion or modified counterconditioning is thus 
met. Grossberg and Wilson (1967) have re- 
cently documented increased physiological re- 
sponse to the imaginal presentation of fearful 
stimuli, as compared to neutral stimuli ap- 
propriately counterbalanced across Ss for ac- 
tivity. Additionally, Davison (1968) found 
significantly greater improvements for Ss 
treated by systematic desensitization than for 
Ss treated by the same procedures without 
relaxation training. Several studies (Lang, 
1969) have also reported fewer “signals” of 
anxiety during image presentation following 
relaxation training than in the absence of such 
training. Thus, while not excluding the im- 
portance of other factors, several converging 
lines of evidence support the contention that 
relaxation training produces effects which op- 
erate antagonistically to inhibit anxiety re- 
sponses, as the underlying principle predicts. 
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EFFECTS OF VARIATIONS IN THE PREPARATORY 


INTERVAL ON THE REACTION TIMES OF 
RETARDATES AND NORMALS* 
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University of Alabama 


Two studies were conducted to evaluate the effects, on simple reaction times, 
of warning signal duration (WSD) and preparatory interval (PI) with normal 
and mentally retarded Ss. In the first study PIs of 2, 4, 8, and 12 sec. were 
presented in an unpredictable sequence. For half of the Ss the warning signal 
remained on throughout the entire PI (filled condition) and for the re- 
mainder of the Ss, the warning signal was present for only 1.5 sec. of the PI. 
Significant effects of intelligence, PI, and WSD were found. The interaction 
Intelligence X PI was also significant. The unfilled condition produced faster 
RTs for both normals and retardates. The interaction resulted from the par- 
ticularly poor performance of the retardates at the short PIs. The second study 
employed a 1.5-sec. WSD with PIs of 2, 4, 8, 16, and 32 sec. An interaction 
again was found involving intelligence and PI. The results were discussed in 


terms of temporal uncertainty. 


The procedures employed in the reaction- 
time (RT) task typically involve presentation 
of a discrete warning signal followed by a 
blank interval. The entire period set off from 
the beginning of the warning signal to the 
appearance of the reaction signal is known 
as the preparatory interval (PI). An implicit 
response sequence, sometimes referred to as 
“prepartory set,” is assumed to occur during 
PI as S readies himself to respond quickly 
to the reaction stimulus. 

Usually such “sets” are indirectly inferred 
from the effects that manipulations of the 
PI produce upon reaction time. If the inter- 
vals within a session are unpredictably vari- 
able in duration and S is therefore uncertain 
about the onset of the reaction signal, he 
must maintain his readiness throughout the 
entire interval in order to be consistently fast 
in his responding. The ability to maintain this 
set has been the focus of a number of re- 
searchers working with abnormal persons. One 
implication of this concern is that certain 
types of disturbances are reflected in an im- 
paired capacity to hold or direct attention. 

Recently several studies have been con- 


1Requests for reprints should be sent to Alfred 
A. Baumeister, Department of Psychology, Univer- 
sity of Alabama, University, Alabama 35846. 

2'The authors wish to express their gratitude to 
the patients and staff at Partlow State Hospital and 
School, Tuscaloosa, Alabama, for their cooperation. 


ducted in which the RTs of mental retardates 
and normals have been compared in relation 
to certain manipulations of the PI. Hermelin 
(1964) has reported that subnormal Ss are 
differentially disadvantaged by long PIs. An 
interaction of this nature is predicted by 
Ellis (1963) who has postulated that re- 
tardates suffer short-term memory (STM) 
defects and, consequently, are particularly 
handicapped in situations that require them 
to respond on the basis of absent cues. A 
study by Terrell and Ellis (1964), conducted 
as a direct test of this STM deficit hypothesis, 
compared normal and retarded Ss under con- 
ditions where PI was either partially or com- 
pletely filled by the warning signal. Their 
data indicate an interaction in which the 
retardates, but not the normals were facili- 
tated by the “filled” intervals. Supposedly, 
dependency on STM was less critical when 
the warning signal was constantly present. 
Unfortunately, a number of more recent 
studies, reviewed by Baumeister and Kellas 
(1968), not only fail to confirm the find- 
ings of Hermelin (1964) and Terrell and 
Ellis (1964) but actually report results that 
lead to opposing conclusions (e.g. Hawkins 
& Baumeister, 1965; Kellas & Baumeister, 
1968). / 
Procedural differences among the various 
studies are of sufficient magnitude to make 
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VARIATIONS AND REACTION TIMES OF RETARDATES AND NORMALS 


3 

direct interexperimental comparisons virtually 
impossible. Any one of a number of pro- 
cedural factors conceivably could account for 
the discrepant findings. The two experiments 
reported here represent an effort to evaluate 
further the nature of possible interactions in- 
volving characteristics of PI and intelligence. 
The first study is a fairly close replication 
of the procedures employed by Terrell and 
Ellis, the second investigation was concerned 
with the contextual effects of irregular PIs 
differing markedly in duration. 


EXPERIMENT I 
Method 


Subjects. An effort was made to obtain Ss similar 
to those tested by Terrell and Ellis (1964). Thirty 
Ss between the ages of 17 and 28 were selected 
from the population of a state institution for the 
mentally retarded. The mean age of the group was 
22.2 yr, SD — 3. Their IQs, taken from institution 
records, had a mean of 55.5, SD = 72. Ss exhibit- 
ing gross sensory or motor defects were not included 
in the sample. A comparison group of 30 normals 
was selected from an introductory psychology class 
at the University of Alabama. Their mean age was 
204, SD — 5.1. 

. Apparatus. The apparatus consisted of a red warn- 
ing light, 1 in. in diameter, a response telegraph key, 
à door buzzer, a Hunter Klockounter, and a series 
0f decade timers for control of stimulus presentation. 
The light and response key were mounted on a 
small table. The retardates were tested in a small 
Toom on the institution grounds, the normals in a 
Similar room at the University. Both rooms were 
dark except for illumination provided by a single 
Tw, lamp. Recording and controlling equipment was 
located in adjacent rooms. 

Procedure. Normals and retardates were randomly 
divided into two subgroups of 15 Ss each. Tests of 
Significance revealed no chronological age (CA) bias 
in the assignment of Ss to subgroups. No significant 

N difference was found between average IQs of the 
retarded subgroups. 

One subgroup received a filled condition in which 
the warning light remained on throughout the PI 
Until the response was made to the reaction signal 
(door buzzer). The other group received an unfilled 
condition in which the warning light was presented 
only for the initial 1.5 sec. of the PI. All Ss were 
given 10 trial blocks consisting of 4 trials each. In 
every trial block each PI of 2, 4, 8, and 12 sec. 
Was Presented in random order. Each S was tested 
with a different random sequence of PIs. The inter- 
trial interval was 8 sec. 

A s S was instructed to depress the response key 

e onset of the warning light and to hold it down 

Until the buzzer sounded. He was told to lift his 
ger as quickly as possible upon hearing the buzzer. 
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The swiftness of the response was emphasized in the 
instructions. Three practice trials were given. If 
necessary, the instructions were repeated or clarified 
during these trials. Before entering the test room, 
the retardates were shown an assortment of prizes 
and were told that if they “did well” they could 
choose one to keep after the test. 


Results 


Medians were obtained for the 10 RTs for 
each S under the various warning conditions. 
Means of these median scores are plotted in 
Figure 1. Standard deviations in relation to 
these points ranged .03—08 sec. The results 
of analysis of variance applied to the median 
data are summarized in Table 1. 

Main effects of PI, intelligence, and warning 
signal duration (WSD) were all significant. 
With regard to the WSD effect, it should be 
emphasized that it was the unfilled condition 
that produced the fastest reactions for both 
normals and retardates (p < .01). 

The interaction Intelligence X PI was also 
significant. Simple analyses of this effect re- 
vealed that the retardates, relative to the 
normals, were differentially affected (p < .05) 
at the shortest (2-sec.) PI. An examination 
of the sequential pattern of the warning 
periods indicated that the retardates were 
particularly slow on trials in which the 
2-sec. PI had been preceded by the longest 
(12-sec.) interval. 


EXPERIMENT IT 


One point on which the present findings 
are in agreement with those of Terrell and 
Ellis concerns the interactive effects of PI and 
intelligence on RT. In both studies the re- 
tardates were relatively disadvantaged under 
the shortest PIs. This result directly contra- 
dicts expectations based on the stimulus- 
trace hypothesis and the findings of Hermelin 
(1964). 

The second study in this series was under- 
taken to assess further the effects of the PI 
on the RTs of normal and retarded Ss. In 


3 Differences in variance between intelligence 
groups, while not extreme, were nevertheless signifi- 
cant (Fmax = 9.84). Consequently, the critical values 
were obtained by referring to tabled values for one- 
half the alpha level actually reported. This is the 
procedure recommended by Lindquist (1953) to 
avoid understating the probability of a Type I error. 
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Fic, 1. RT as a function of intelligence group, preparatory interval, 
and warning condition (Study 1). 


this case, some of the intervals were extended 
to 32 sec. in order to insure a fair test of 
the predicted interaction Intelligence x Delay 
Interval. The retardates were subdivided into 
“fast” and “slow” groups on the basis of 
previous RT scores, This procedure was fol- 
lowed to permit an evaluation of the effects 
of PI in relation to general RT ability within 
the retarded population. 


Method 


Subjects. The normals were 15 volunteer college 
students from an introductory psychology class. 
Their mean age was 20.6 yr. The retardates were 
30 institutionalized males who had participated pre- 
viously in at least two RT experiments. On the 
basis of their earlier performance, the retarded Ss 
were assigned equally to fast and slow groups. The 
mean ages were 22.9 yr. and 23.2 yr.; the mean 
IQs were 60.31 and 51.3, respectively. The difference 


TABLE 1 


ANALYsIS OF VARIANCE ON MEDIAN REACTION TIMES 
For STUDY 1 


Source 


Intelligence (A) 
Filled-unfilled a) 
AXB 


Error (A, B, A X B) 
EDO interval (C) 


AXBXC 
Error (C,A XC,B XC,A XB XC) 168 | 


in mean IQ was found to be statistically significant. 
(p < 05). É 

Apparatus. The apparatus and test conditions were 
the same as in Experiment I. m 

Procedures. Procedures were essentially identical 
to those employed in the first study under the 
unfilled condition. A warning signal of 1.5 sec. was 
followed by a variable interval of 2, 4, 8, 16, or 9 
sec. These intervals were presented systematically 
over 50 trials so that each PI preceded every other 
PI an equal number of times. AIl Ss within each 
of the three subgroups received different sequences 
of PIs. 

A 


Results 


The 10 observations for each S under each 
of the preparatory conditions were average". 
Curves for the three groups are displayed in 
Figure 2. Standard deviations ranged .03-.10 
sec 

Analysis of variance based on these data re- 
vealed the interaction (p < .01) Groups X PI 
An inspection of Figure 2 discloses that this 
interaction arose from the relatively poor P2 
formance of the slow group of retardates ^ 
the 2-sec. PI. A more detailed examination v. 
the contextual effects of the various PIs in’ 
cated that although all three groups shota 
a tendency to respond more slowly 0? E 
trials that had been immediately preceded e 
trials involving a long PI, the slow retarda 


" ere 
4As in Experiment I, the intelligent groups ij K, 
not equally variable in performance. 


] 
5 i3 iHi dues. 
adjustment was made in obtaining the critical v4 
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Fic. 2. RT as a function of group and preparatory interval (Study 2). 


were particularly disadvantaged under this 
condition. 


Discussion 


To the extent that the variables manipu- 
lated in this series of studies reflect the oper- 
ation of STM processes, one may conclude 
that no evidence has been found to support 
Speculation that mental retardates suffer a 
STM deficit. Specifically, Ellis (1963) has 
hypothesized that retardates would benefit, 
telative to normals, in an RT situation where 
the warning signal was continuously available 
during the PI, Terrell and Ellis (1964) did 
Teport an interaction of this nature, a finding 
Which the results of the present investigation 
substantially contradict. 

, Because the procedures employed here were 
highly similar to those used by Terrell and 
Ellis, it is difficult to identify the sources of 
the discrepancy. The minor procedural differ- 
€nces which are apparent simply do not seem 
Sufficient to account for the completely 
Opposite findings concerning the effects of 
the WSD. 

It is obvious that the two warning signal 
Conditions, filled and unfilled, did not pro- 
Vide the same information to the S concern- 
mg the onset of the reaction signal. Uncer- 


tainty may have been greater when the PI 
was completely filled by the warning light. 
Under this condition, S was required to ready 
himself from the beginning of the interval. On 
the other hand, the unfilled warning condition 
afforded an additional cue with the cessation 
of the warning signal. In this case, S did not 
have to prepare for the reaction stimulus 
until the termination of the warning signal. 
The period during which S was required to 
maintain his state of readiness was thus 
shorter under the unfilled condition. If this 
interpretation is correct, one may expect that 
the difference between the two warning condi- 
tions would be most pronounced at the 
shortest interval, where the warning signal 
itself constituted a substantial portion of the 
preparatory period. An analysis based on 
mean RTs, in fact, does indicate (p < .025) 
that the difference in RTs between the two 
warning conditions was greatest at the 
short PIs. 

Although an interaction was found involv- 
ing intelligence groups and length of PI, the 
effect, again, was opposite to that predicted 
by the STM position of Ellis (1963). In this 
case, the short intervals produced the greatest 
differences between normals and retardates. 
It should be noted, also, that these data 
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contradict Hermelin (1964) who found that 
subnormals were more disadvantaged than 
normals by progressive lengthening of the 
PI. However, the results of the present studies 
are fairly consistent with the majority of in- 
vestigations concerning the effects of irregular 
preparatory periods on RT. 

The finding that the slow retarded group 
(Study 2), in particular, was adversely af- 
fected at the short intervals suggests that 
these Ss had difficulty in learning the statisti- 
cal characteristics of PI. A trial-by-trial ex- 
amination of the data indicated these Ss were 
especially slow on 2-sec. trials that were im- 
mediately preceded by the long intervals (16 
or 32 sec.). It is as if S is led to “expect” 
a long interval and is, therefore, caught un- 
prepared when the interval is very brief. 
While this context effect has been reported 
previously for both normals (e.g., Klemmer, 
1956) and retardates (e.g., Baumeister & 
Hawkins, 1966), it does appear to be more 
pronounced among less adaptive individuals. 

In view of the findings concerning the ef- 
fects of various temporal manipulations of 
the warning interval in the typical simple 
RT task, it seems more meaningful to invoke 
an explanatory concept such as “expectancy” 
or set rather than STM. For one thing, the 
operations employed to define STM in this 
context are unclear (e.g., exactly what is S 
supposed to remember and over what inter- 
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val?) and, for another, the effects of 
manipulations are inconsistent with p 
tions generated within a STM position. 
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SUBJECTIVE REPORT AND CREDIBILITY: 
AN INQUIRY INVOLVING HYPNOTIC HALLUCINATIONS * 
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University of Waterloo 


This paper argued that credibility of subjective reports is in part situationally 
determined. An experimental demonstration was considered showing that under 
conditions where report credibility was a reasonable assumption, hypnotic Ss 
reported previously suggested hallucinations, whereas simulator controls did 
not, The experimental evidence also suggests that hypnotic Ss are honest and 
their reports credible across a variety of situations. It was concluded that 
subjective reports of hypnotic Ss may well correspond to actual perceptual 
experience, and that motivated compliance does not satisfactorily explain 


hypnotic phenomena. 


Science has frequently been able to mini- 
mize its reliance on data which are said to be 
nonobjective, but no science has entirely cir- 
cumvented subjective experience. In psychol- 
ogy, it often seems that the richest sources 
for understanding human behavior are the 
most purely subjective ones. Hypnotic phe- 
nomena are a striking case in point. They 
appear to offer access to processes involved 
in perception, cognition, attitudes, fantasy, 
and defenses. Yet hypnotic phenomena, being 
highly subjective in nature, are often dis- 
missed or ignored by psychologists. The em- 
phatically subjective quality of the hypnotic 
experience makes it suspect, and even in- 
credible, to those of a behavioristic persuasion. 

Simple motivational and/or behavioristic 
explanations of hypnotic behavior have 
been attempted (Barber & Calverley, 1962; 
Edmonston, 1967; Hull, 1933). However, 
what hypnotic Ss say about their experiences 
often does not seem to jibe with such expla- 
nations. It is possible, of course, that what 
hypnotic Ss say about their experiences is 
false, Consequently, it is not unreasonable to 
ask whether and when persons tell the truth 

1 This study was supported by Research Grant No. 
100 from the Ontario Mental Health Foundation and 
by Research Grant No. MA-2328 from the Medical 
Research Council. 
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about their experiences. It is certainly the 
case that the validity of any report presuming 
to reflect subjective experience cannot be 
directly evaluated. However, in everyday life 
we do tend to value certain kinds of evi- 
dence—including verbal reports—as pertinent 
to drawing conclusions about another person’s 
subjective experience. For example, we ordi- 
narily accept as credible the report of a night- 
mare by a child who has awakened crying out 
in terror for his mother, even though the 
dream itself was inaccessible to direct observa- 
tion, There are other occasions, however, 
when we do not take a person’s remarks at 
face value. Flattery is often suspect, as are 
the nostrums of politicians seeking office. 
Thus, there are factors implicit in any given 
context that help determine what is credible 
and what is not. 

It is not the intention of this paper to in- 
vestigate the parameters of credibility per se. 
Bem (1965) and Jones and Davis (1965) 
have made constructive attempts to this end. 
Rather, the authors would like to state an 
intuitively plausible hypothesis about report 
credibility, and to use this assumption sys- 
tematically in studying a highly subjective 
phenomenon, namely, hypnotic hallucinations. 

The working hypothesis is that a person’s 
experiential report is credible in inverse pro- 
portion to the extent that this report is 
demanded and/or contingently reinforced by 
the environment. Given such an assumption 
it is the authors’ hope that the methods and 
reasoning employed in the present experiment 
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will have general significance for solving some 
current problems in experimental hypnosis, 
and indeed for all psychological investigations 
in which Ss’ verbal reports constitute prime 
or ancillary data. 

Consider, then, the case of hypnotically 
induced hallucinations. When does an S’s report 
of an hypnotically induced hallucination re- 
flect his actual experience, and when does it 
serve only to please the operator by comply- 
ing with his expectancies and the general 
situational demands? Put another way, under 
what conditions are subjective reports apt to 
reflect or correspond to an S's hallucinatory 
experience, and under what conditions are 
they likely to be a function of S’s perception 
of what he is supposed to say in the situation, 
irrespective of his experience? 

An answer to such questions, in terms of 
the assumption concerning report credibility, 
is fairly simple. Reports of hypnotic halluci- 
nations, given in a context free of demands 
to report them, are more credible than are 
similar reports given in a context where 
hallucinatory reports are implicitly demanded 
or reinforced. 

However, translating this principle into a 
concrete demonstration presents an immediate 
practical problem: Since suggestions to hal- 
lucinate constitute implicit demands to report 
hallucinations, how can one divest a report 
context of implicit demands to report the 
hallucinations suggested? It was to get at this 
issue that Bowers (1967) partially replicated 
an earlier study by Barber and Calverley 

(1964). The latter authors had shown that 
Ss receiving hypnotic suggestions to halluci- 
nate and Ss task motivated to hallucinate 
both rated their hallucinations as significantly 
more real than control Ss. Bowers (1967), 
however, pointed out that in both the hyp- 
notic and task-motivated groups, the experi- 
ential reports of hallucinations were given 
in a context implicitly demanding reports of 
hallucinations. That is, Ss were first given 
suggestions to hallucinate, and then immedi- 
ately asked whether they in fact hallucinated. 
Implicit in the initial suggestions was a de- 
mand to report hallucinations. Thus, although 
Ss’ reports may have reflected an actual hal- 
lucinatory experience, it is equally and per- 
haps even more likely that the reports were a 
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function of situational demands to report 
hallucinations. The existence of the latter pos- 
sibility seriously vitiates report credibility. 

To obviate this ambiguity, Bowers (1967) 
ran two groups of task-motivated Ss. The first 
group rated the reality of their hallucinatory 
experiences under the same set of demands 
that characterized the hallucinatory episode 
itself. The Ss in the second task-motivated 
group, however, before rating their experi- 
ence, were confronted with (a) a demand for 
honesty in all subsequent reports, and (b) an 
interviewer who had previously been un- 
connected with the experiment. As in the 
Barber and Calverley (1964) study, Ss in the 
first group significantly increased their post- 
test ratings of the hallucinations’ reality over 
a pretest level. The Ss in the second group 
showed no rating increase whatever. It was 
concluded from these findings that the context 
in which reports of hallucinations occur is 
very important. When the report context is 
purged of demands for high ratings (ie, 
hallucinations rated as very real), Ss’ ratings 
remained at relatively low pretest levels. 

Although the methods of the present experi- 
ment are quite different from those employ X 
in Bowers’ (1967) study, the strategy 15 
analogous. The question is the following: In 
a context free of situational demands to report 
hallucinations, what will hypnotic Ss report? 
If they report the hallucinations previously 
suggested and do so in a context which does 
not implicitly demand such reports, it can be 
argued that the reports are referable to their 
actual experience. 

The demonstration in this experiment that | 
a context is free of demands to report hal- 
lucinations hinges to some extent on the be- 
havior of quasicontrol Ss selected for their 
unsusceptibility. It is assumed that unsuscep- 
tible Ss are incapable of experiencing hypnoti- 
cally induced hallucinations. Consequently, 
reports of hallucinations by these Ss wou 
suggest the presence of situational demands 
to report them. Conversely, the absence 0 
hallucinatory reports by these Ss would imply 
the absence of demands to report them. 
Therefore, it is in a context in which these 
quasicontrol Ss do not report hallucinations 
that hallucinatory reports by hypnotic 
become credible. 
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METHOD 
Subjects 


A total of 23 highly susceptible Ss and 14 unsus- 
ceptible Ss were selected for this experiment. The 
susceptible Ss had all been in previous hypnotic ex- 
periments conducted at the University of Waterloo, 
and were initially selected as being some of the 
highest scoring Ss on the Harvard Group Scale of 
Hypnotic Susceptibility (HGSS, Shor & Orne, 1962) 
and the individually administered Form C of the 
Stanford Hypnotic Susceptibility Scale (SHSS, 
Weitzenhoffer & Hilgard, 1962).* Altogether, the 
susceptible Ss had been seen for at least six indi- 
vidual training and testing sessions. Of the highly 
susceptible Ss, 16 were assigned to the hypnotic 
condition, and 7 to a control condition in which 
they were asked to simulate hypnosis. The unsus- 
ceptible Ss were selected on the basis of a score of 
0-3 on the HGSS and were also assigned to a simu- 
lating condition (0-3 simulators). The 0-3 simulators 
were used to minimize the possibility of their 
becoming inadvertently hypnotized during the course 
of the experiment (Hilgard & Tart, 1966; Tart & 
Hilgard, 1966). The susceptible simulators were em- 
ployed to control for S effects, and for the effect of S 
familiarity with hypnotic techniques. 


Procedure * 


The Ss in the experiment were seen by a hypnotist 
who was unfamiliar with them and who therefore 
did not know in advance whether a particular S was 
susceptible or unsusceptible. Upon entering the lab- 
oratory, S was told to sit in a large, comfortable 
chair, The E then told S that earphones would be 
placed on his head, and taped instructions would 
be played. The S was told that E did not know 
Which of several instructions would be played, as the 
particular tape he was about to hear was randomly 
selected by a third person. This in fact was true. 
Furthermore, S was told not to reveal to E by word 
Or sign, the nature of the taped instructions. After 
placing the earphones on S and turning on the tape 
recorder, E left the room so as not to be present 
during the playback. In this way, any facial or 
Other cues made by S during the instructions were 
unavailable to E, thereby tending to preserve E's 
naiveté concerning S's condition. 

One kind of tape contained instructions to simulate 
hypnosis, and to fool the hypnotist into believing S 
was in fact hypnotized. The exact wording of the 
instructions was similar to those reported earlier 
(Bowers, 1966). The second kind of instructions 
merely told S that he would subsequently be hypno- 
tized. Much of this recording consisted of total 
HIS 


^ The scores of the susceptible Ss on the HGSS 
ranged 8-12 and on the C scale 7-12. 

č The present experiment is actually the second of 
two experiments which were run one after the other, 
always in the same order. The results of the first. 
experiment were completely negative and are not 
reported here, 
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silence, so as to make the tape equal in length to 
the simulator instructions. Both tapes contained 
explicit demands for honest reports in a post- 
experimental interview (PEI), after a signal from E 
(*Now the experiment is over") was given. The last 
instruction was to remove the earphones. 

After S had removed the earphones, E returned 
and delivered hypnotic suggestions adapted closely 
from those used on the SHSS Form C. When S ap- 
peared to be deeply hypnotized, instructions and sug- 
gestions for the first experiment were delivered (see 
Footnote 5). After completing the first experiment, 
E introduced S to the second experiment. The 
materials for this experiment consisted of a series 
of 5X 7 in. display cards, on all but two of which 
a color chip, approximately 1 in. on a side, appeared. 
Each chip consisted of colored, transparent cello- 
phane mounted on top of two intersecting black lines 
at right angles to each other. The lines extended well 
beyond the boundaries of the chip, and were easily 
visible underneath it. The chips differed indepen- 
dently along three dimensions: color (red or green), 
shape (square or triangular), and position (the chips 
appeared in either the upper or lower half of the 
display card). Each S in the experiment was first 
exposed to the same two sample display cards, 
clearly labeled A and B, and presented one after the 
other. The hypnotized (or simulating) S was in- 
structed to open his eyes and look at Display A. 
The E then described Display A for S, noting 
that a red square chip was in the upper section of 
the display card. After eliciting agreement from S 
about the color chip on Display A, E replaced 
Display A with Display B and told S to describe 
the color chip thereon. The Ss invariably described 
the chip accurately (green, triangular, lower section). 

Next, S was presented with a pair of conjoined 
display cards labeled Display I and Display II, re- 
spectively. Each S was confronted with one such 
pair randomly selected from a pool of eight card 
pairs constructed for the experiment. The color chips 
on these eight pairs of display cards differed system- 
atically and independently from each other. For ex- 
ample, Display I of one pair of cards might consist 
of a green triangle in the lower section and Dis- 
play II might be a red triangle in the upper section. 
This pair of display cards, taken as a unit, differed 
from all other seven paired displays. The display 
cards were presented in a way that made it impos- 
sible for E to know which one of the eight different 
card pairs he was presenting to a given S. Thus, 
it was impossible for E to communicate to S, how- 
ever unwittingly, any biases arising out of his knowl- 
edge of what color chips actually appeared on 
the displays. j 

Before actually opening his eyes to view Displays 
I and II, S was given further suggestions to deepen 
hypnosis. Then he was told: 


First Im going to show you Displays I and II. 
Each of these displays resembles Displays A and B 
which you have already seen, except for one thing. 
On neither Display I nor II will there be any 
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color chips. You will be able to see Displays I 
and II quite clearly, but on neither of them will 
you see any color chip whatsoever ; Display Cards 
land II will be absolutely empty. There will be 
no color chips in either of the two displays. Do 
you understand? O.K. Open your eyes slowly and 
look at the two blank display cards. What do you 
see on Display I? What do you see on Display 
1I? O.K. Close your eyes for a few moments while 
I change the two displays. [S’s answers were 
recorded.] 


Several things should be noted at this point before 
going on. First, and obviously, the authors are here 
suggesting negative hallucinations. Second, reports 
of negative hallucinations are being elicited in a 
context which implicitly demands of S a report of 
“I don't see anything.” If S wished only to 
comply verbally with the situational demands, he 
knows everything he needs to know about what to 
say. Moreover, if the suggestion for negatively 
hallucinating the color chips were successful in elicit- 
ing the hallucination, S’s verbal report to this effect, 
taken in this context, would not differentiate him 
from the merely compliant S. 

The next step in the experiment was to present S 
with two conjoined display cards labeled Displays 
III and IV which had no color chips on them what- 
ever. In every respect other than the differences in 
the labels (III and IV instead of I and II) and the 
absence of color chips, the cards were identical with 
those previously viewed. The S was then told: 


Now I am going to show you the last pair of 
display cards, Displays III and IV. When I ask 
you to open your eyes in a few moments, you will 
see a color chip on both Display III and Dis- 
play IV. You will see the two chips quite clearly 
just as you saw the color chips a while ago on 
Displays A and B. The chips will be either red 
or green, square or triangular, and in either the 
upper or lower section of each display. When you 
open your eyes, I want you to tell me exactly 
where you see the color chips, what colors they 
are, and what shape. Do you understand? 


The S was then told to open his eyes and was asked 
for the (hallucinated) color, shape, and position of 
the chips on each of the two displays. After yielding 
his response, S was commended and instructed to 
close his eyes. All of S's responses were recorded. 
Again notice that the demands inherent in the sug- 
gestion for positive hallucinations clearly imply the 
preferred kind of verbal report. 

Next, E said the following to those Ss who had 
223 peepoited negative hallucinations to Displays 

and II. 


Now, do you remember just a moment ago when 
you opened your eyes and saw nothing on Dis- 
plays I and II? In fact, there was a color chip 
on each display then, but because of my suggestion 
you thought you did not see them. Now I want 
you to recall what in fact was there that first 
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time, This sounds hard, but you really did see the 
color chips and now that I am removing my ear- 
lier suggestion, you will find that you cam remem- 
ber seeing the chips that were there. You will find 
that my reminding you that something really was 
on each display helps you remember now seeing 
what was really there. Now you can remember 
what was really on Displays I and II. 


These Ss were then asked to recall the chips the 
very presence of which were not acknowledged ear- 
lier. Those Ss who initially did identify the color 
chips on Displays I and II were simply asked to 
recall them again. 

By suggesting recall for previously unacknowledged 
color chips, the authors attempted to alter the de- 
mands of the situation sufficiently so as to produce 
memories of what was previously “not seen.” Orne 
(1962) has noted that Ss who have negatively hal- 
lucinated a chair “demonstrate that at some level 
they are aware of the chair” by tending to avoid it 
while walking around the room. One might expect 
therefore, that hypnotic Ss might be able, on com- 
mand, to retrieve previously registered but heretofore 
unavailable information. 

The S was then given posthypnotic suggestions to 
remember everything that had transpired during the 
experiment. Finally, S was awakened and immedi- 
ately told “Now the experiment is over.” It will be 
remembered that this phrase was introduced during 
the taped instructions as the signal for simulating 
Ss to stop faking, and for Ss in both conditions 
to answer all subsequent PEI questions as honestly 
as possible. All Ss were asked two initial questions: 
[Question 1] Were you asked on the tape recorder 
to fake hypnosis, and to fool the hypnotist into 
believing that you were hypnotized (yes or no)? 
[Question 2] Whether or not you were asked to fake 
hypnosis, did you become hypnotized (yes or no) P^ 

Then followed the structured PEI, the aim of 
which was to elicit S's experience of the test situa- 
tion, The most important and revealing of these 
questions follow. 


[Question 3] Would you say that the color chips 
you now remember seeing on Display Cards I an 
II were (a) less real, (b) equally as real, (c) more 
real than the color chips which appeared on the 
original Display Cards A and B? j 
[Question 5] For some people in this experiment, 
color chips appeared on Display Cards II and IV. 
For other people, there were no color chips 0D 
Displays III and IV. Were you in the condition 
in which (a) color chips were present on Displays 
III and IV, (b) color chips were not present on 
Displays III and IV? 


The Ss who correctly stated that color chips did not 
appear on Displays III and IV were asked no further 
questions. The Ss who affirmed the presence of ge 
chips on. Displays III and IV were asked severa 
more questions: 


[Question 6] Would you say that the color m 
which appeared on Display Cards III and IV we 
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(a) less real, (b) equally as real, or (c) more 
real than the color chips you now remember ap- 
pearing on Display Cards I and II? 

[Question 7] Would you say that the color chips 
which appeared on Display Cards III and IV were 
(a) less real, (b) equally as real, or (c) more real 
than the color chips which appeared on Cards 
A and B? 

[Question 10] Which would now seem most in- 
credible to you: that there really were no color 
chips on Displays Card I and II aíter all or 
that there were no color chips on III and IV. 


The significant aspect of the PEI reports of Ss 
is that they were given in a context which was 
hopefully free of demands to report hallucinatory 
experiences, In this respect, the PEI differs impor- 
tantly from the report context of the experiment 
per se. Precisely because giving "correct" responses 
is presumably not an issue in the PEI, the reports 
made in this context are potentially more credible 
insofar as they are more apt to be referable to S's 
subjective experience. 

The success of the manipulations to evoke honest 
reports can be assessed in part by the answers of 
the unsusceptible, 0-3 simulating Ss during the PEI. 
Because 0-3 simulators are essentially unhypnotiza- 
ble, any reports of hallucinatory experiences they 
give during the PEI would be suspect as reflecting 
their response to residual demands implicit in the 
report context, rather than being an accurate experi- 
ential account. If it is at all common for 0-3 simu- 
lators to report hallucinations in the PEI, then the 
significance of similar reports by the hypnotic Ss 
would not be credible. 


RESULTS 
Subject Reports during Experiment 


The importance of the design of this study 
is that it allows direct comparison of verbal 
teports of Ss in two conditions (hypnotic and 
simulating) in two different report contexts.° 
According to the argument, mere verbal com- 
pliance is most apt to characterize Ss’ reports 
during the experiment per se, where demands 
for hallucinations are implicit in the halluci- 
Natory suggestions, Table 1 shows the break- 
down of Ss? reports of negative hallucinations 
to Displays I and II during the experiment. 
Observation. of this table indicates that one- 
half of the 0-3 simulators complied with the 


* Actually, there were four conditions, if the sus- 
Ceptible simulator Ss are included. Since the number 
of these Ss was relatively small, and because, with 
the exception of one S, the data of this group were 
Very similar to that of 0-3 simulators, relatively 
little space is devoted to presenting the results of 
the susceptible Ss, 
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TABLE 1 


Reports oF NEGATIVE HALLUCINATIONS OF 
Hypnotic AND 0-3 SIMULATOR SUBJECTS 


0-3 simu- 


Condition Hopoate 3 sii 


Report negative hallucination 2 7 
Deny negative hallucination 14 7 


Note.—Fisher's exact probability, p < .05. 


demand to negatively hallucinate, whereas 
only one-eighth of the hypnotic Ss reported 
negative hallucinations, Assuming for the 
moment that compliance is the sole basis for 
these hallucinatory reports, the 0-3 simulators 
are significantly more compliant than the 
hypnotic Ss in a demand context in which 
report compliance is very probable. 

Table 2 indicates that much the same kind 
of results occurred for reports of positive 
hallucinations on Displays III and IV. Al- 
though the data here did not reach signifi- 
cance, it is clear that more simulator than 
hypnotic Ss tended to report positive halluci- 
nations in a context where such reports were 
implicitly demanded by the suggestions. 
Again, it seems that report compliance in the 
context of implicit demands to report positive 
hallucinations is more characteristic of simu- 
lating than of hypnotic Ss. 

Tables 1 and 2 do not include data from 
the seven highly susceptible simulator Ss. 
Four of these Ss reported negative hallucina- 
tions to Displays I and II. The same four Ss 
reported positive hallucinations on Displays 
III and IV. 

It was expected that the countersuggestion 
for hypnotic Ss to remember color chips that 
had previously been negatively hallucinated 
would reveal the extent to which registration 
of the color chips at some level had in fact 
occurred. The fact that only two hypnotic Ss 
initially verbalized negative hallucinations 


TABLE 2 


Reports or PosrrrvE HALLUCINATIONS OF HYPNOTIC 
AND 0-3 SIMULATOR SUBJECTS 


Hypnotic | 0-3 simu- 
"Ss i 


Condition ators 


Report positive hallucination 6 9 
Deny palve hallucination 10 5 


Note.—Fisher's exact probability, ns. 
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TABLE 3 


Reports or POSITIVE HALLUCINATIONS OF HYPNOTIC 
AND 0-3 SIMULATOR SUBJECTS DURING POSTEX- 
PERIMENTAL INQUIRY 


Condition Byneoue osama 
Report positive hallucination T 0 
Deny positive hallucination 9 14 


Note.—Fisher's exact probability, p < .01. 


makes it difficult to interpret the fact that 
half (ie. one) of them accurately remem- 
bered the negatively hallucinated chip. The 
countersuggestive manipulation promises to be 
of some interest, however, and a successful 
replication of this aspect of the experiment 
on a larger N would have theoretical signifi- 
cance concerning the mechanisms of halluci- 
natory activity. 


Reports of Subjects during PEI 


So far, the authors have been concerned 
with verbal reports by Ss evoked prior to the 
prearranged signal of “Now the experiment 
is over.” At this signal, the demand context 
changed. The S was to provide an honest 
report instead of a correct report which satis- 
fied the suggestion-demand to hallucinate. 
As mentioned earlier, Bowers (1967) showed 
that a similar alteration did have a significant 
effect on Ss’ reports of hallucinations. Al- 
though the PEI context is not literally de- 
mand free, the nature of the demands on S 
are, psychologically speaking, quite different. 
The constraints on Ss’ reports are no longer 
to comply with the perceived purpose of the 
experiment, but rather to report honestly their 
actual experience. Whether or not one suc- 
ceeds in altering the demand characteristics 
of the experimental situation should be ap- 
parent in part by the experiential report of 
the 0-3 simulators. There is no basis for 
believing that these Ss should experience 
hypnotically induced hallucinations, and hal- 
lucinatory reports made by these Ss would 
suggest that residual demands to comply were 
still operative. The use of unsusceptible Ss 
as controls thus helps to assess the success 
with which the demand context is truly 
altered. If in response to demands for honesty, 
0-3 simulators report no hallucinatory ex- 
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perience, then the demand context of the PEI 
can reasonably be assumed to have been 
altered in such a way as to produce honest 
instead of correct reports. If one can demon- 
strate the absence of situational demands to 
report in correct ways, then PEI hallucina- 
tory reports by hypnotic Ss take on high 
credibility. 

The data gathered in the PEI showed a 


dramatic change by the 0-3 simulators, in . 


the predicted direction. The most revealing 
data concerned Ss’ responses to Question 5 
which asked whether or not S was in the 
condition in which color chips were present 
on Displays III and IV. It will be recalled 
that confirming the presence of chips to Dis- 
plays III and IV constituted a positive hallu- 
cination. As Table 3 indicates, not one of the 
0-3 simulator Ss stated that he was in the 
condition in which color chips appeared on 
Displays III and IV; seven hypnotic Ss said 
they were. The fact that no simulator Ss 
reported color chips on Displays III and IV 
suggests that the situational demands for an 
hallucinatory report were successfully nullified. 
In light of this finding, there is correspondingly 
less chance that hypnotic Ss were responding 
to situational demands during the PEI to 
report hallucinations. Therefore, the halluci- 
natory reports, given by hypnotic Ss in this 
context, have some credibility claim. 
Although the seven hypnotic Ss in question 
were alike in stating that color chips were 
present on Displays III and IV, the actual 
nature and vividness of the hallucinated color 
chips were quite variable. In fact, as the 
reader may have noticed, one hypnotic S who 
did not report color chips on Displays IH 
and IV during the experiment reversed himself 
during the PEI. The remaining six Ss were 
the same ones who had previously reported 
positive hallucinations to Displays III and IV. 
One of these Ss reported a yellow triangle in 
the upper section of Display III and nothing 
on Display IV. Of course yellow was not 4 
color that was used in this experiment. The 
two above Ss and four of the remaining five 
Ss who reported positive hallucinations 1°- 
sponded to Question 6 that the (positively 
hallucinated) chips on Displays III and IV 
seemed less real than those remembered 0” 
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Displays I and II; the remaining S claimed 
the (hallucinated) chips on Displays III and 
IV were more real. To the seventh question, 
four Ss stated that the color chips on Dis- 
plays III and IV appeared less real than the 
chips which had appeared on the original Dis- 
plays A and B. One S stated that the chips 
on Displays III and IV were equally as real 
as those appearing on Displays A and B, 
and another S said the chips on Displays III 
and IV were more real than those on A and B. 
The last S (the one who had not positively 
hallucinated in the presence of the Displays 
III and IV) said that a red chip seen on 
Display III was less real, and a green chip 
on Display IV was equally as real as those 
chips appearing on Displays A and B. 

It is clear that the positive hallucinations 
were seldom as vivid as actual color chips. 
However, the hallucinations were all suffi- 
ciently convincing to warrant Ss' acknowledg- 
ment of their external reality as a subjectively 
more valid option than would be the denial 
of their experiential reality altogether. Indeed, 
three out of the seven hypnotic hallucinating 
Ss stated in response to Question 10 that it 
would *now seem more incredible" that there 
were no chips on Displays III and IV than 
that there were no color chips on Displays I 
and II after all. 

Although Ss' reports to Question 5 and 
Subsequent questions were probably the most 
interesting aspect of the present findings, Ss' 
reports to Question 3 were also quite con- 
Sistent with the data and arguments previ- 
ously presented. This question asked whether 
the color chips on Displays I and II now 
Seem less real, equally as real, or more real 
than those chips that had appeared on Dis- 


TABLE 4 


RESPONSES or HYPNOTIC AND 0-3 SIMULATOR SUBJECTS 
TO QUESTION 3 OF THE POSTEXPERIMENTAL 


INQUIRY 
Condition ee noce 
BH I&IIchips more or 
qually as real as Displays 
A&B gd 10 14 
Displays I & II chips less real 
than Displays A & B 6 0 


Note.—Fisher's exact probability, ? <.02. 


449 


TABLE 5 


CHANGE IN REPORTS DURING THE PosTEXPERIMENTAL 
Inquiry or Supyects WHO REPORTED POSITIVE 
HALLUCINATIONS DURING THE EXPERIMENT 


> Hypnotic | 0-3 simu- 
Condition Ss lators 
No change in report from one 
context to another 6 0 
Changein report from one 
context to another 0 9 


Note.—Fisher's exact probability, p < .001. 


plays A and B. All 0-3 simulators said they 
appeared equally as real, but hypnotic Ss were 
again more variable. Two hypnotic Ss said 
color chips on Displays I and II now seemed 
more real than the chips on Displays A and 
B; eight Ss said they were equally as real; 
and six Ss said they were less real. If the 
more real category is collapsed into the cate- 
gory equally as real, then the result, as seen 
in Table 4, can be tested by the usual Fisher's 
exact probability formula. These data seem to 
indicate that significantly more hypnotic than 
0-3 simulator Ss experienced some perceptual 
distortion on Displays I and II, despite the 
fact that significantly more 0-3 simulator Ss 
originally reported negative hallucinations to 
Displays I and II (see Table 1). 

A final indication of the extent to which 
the report context is vital for assessing the 
experiential reality of visual hallucinations 
can be observed in Table 5. Originally in the 
experiment, 6 hypnotic and 9 simulator Ss 
reported positive hallucinations. Of these 15 
Ss, 6 maintained their original report of hal- 
lucinations during the PEI. Table 5 shows 
that these 6 were the hypnotic Ss. Thus, one 
sees that 0-3 simulators who reported positive 
hallucinations, reported them only in a con- 
text where such reports were implicitly de- 
manded by the suggestions to do so. The 
hypnotic Ss who reported positive hallucina- 
tion, however, did so both in a context where 
they were implicitly demanded and in a con- 
text relatively free of such demands." 

1 The fact that highly susceptible Ss who are told 
to fake hypnosis might inadvertently, and unwit- 
tingly, become hypnotized was the reason for em- 
ploying unsusceptible Ss as controls in this experi- 
ment. Obviously, S confounding does occur in such 
a design, and this fact can vitiate the generality 
of the findings. It was in cognizance of this fact that 
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Discussion 


The two main findings of this experiment 
are: (a) hypnotic Ss who reported positive 
hallucinations continued to acknowledge them 
under the same PEI report conditions in 
which simulating Ss did not, and (5) signifi- 
cantly fewer hypnotic than simulator Ss ac- 
knowledged negative hallucinations under ex- 
perimental report conditions, conditions in 
which these hallucinations were implicitly 
demanded by specific hypnotic suggestions. 

If it were not for this second finding, the 
first finding would be a less convincing dem- 
onstration that some Ss actually experienced 
positive hallucinations. One might argue for 
instance that the PEI was perceived dif- 
ferently by simulating and by hypnotic Ss 
in ways that could account for differences in 
the hallucinatory reports. The simulators 
were, after all, explicitly directed to stop 
faking and to be honest during the PEI. The 
hypnotic Ss, on the other hand, were never 
told to simulate (i.e., lie) in the first place, so 
that any directive to be honest during the 
PEI might not have had the same significance 
that it had for simulating Ss. It is conceivable, 
therefore, that the hypnotic Ss were merely 
complying with some unwanted and crytic 
demands, inherent in the PEI, to report posi- 
tive hallucination. 

It must be recalled, however, that only 2 
out of 16 hypnotic Ss, as compared with 7 
out of 14 simulator Ss, reported negative 
hallucinations during the experiment when 
such reports were implicitly demanded by the 
suggestions to negatively hallucinate. Compli- 
ance to regnant situational demands certainly 


a small number of susceptible simulators were in- 
cluded in this experiment. With the exception of 
one S, the behavior of Ss in this group paralleled 
quite closely that of the 0-3 simulators. For example, 
three out of the four susceptible simulators who 
reported positive hallucinations during the experi- 
ment later denied them in the PEI. Interpretations 
based on the data of the 0-3 simulators do not 
therefore seem to be misleading. The fact that sis. 
ceptible simulators did tend to behave like 0-3 
simulators suggests (a) that susceptible Ss can resist 
hypnotic suggestions and (b) that the discrepancy in 
the number of hours that hypnotic and 0-3 simu- 
lators had been seen for training was not a factor 
of overwhelming importance in the comparison of 
the data, 
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does not explain the verbal behavior of hyp- 
notic Ss during the experiment. Indeed, it 
seems eminently reasonable to accept as 
credible, the verbal reports of hypnotic Ss in- 
dicating the absence of negative hallucinations 
when these reports occur in an experimental 
context implicitly demanding such hallucina- 
tions. It seems no less reasonable to assume 
the credibility of positive hallucinations ten- 
dered during the PEI; for the PEI constitutes 
a report context which is quite probably less 
demanding of verbal compliance than is the 
experimental situation itself. Inasmuch as the 
hypnotic Ss reported relatively few (negative) 
hallucinations in a context where such reports 
were implicitly demanded, and comparatively 
more (positive) hallucinations in a context 
relatively free of demands to do so, it would 
appear that hallucinatory reports of hypnotic 
Ss are more credible than they are compliant 
to situational demands. 

If, on the basis of the evidence and argu- 
ments here proffered, one can assume that 
hypnotic Ss tend to give credible reports, 
then one can also assume that their reports 
are very important data. In fact, Ss’ verbal 
reports about their hypnotic experiences can 
then be seen to constitute evidence by which 
to ascertain whether or not a person Was 
hypnotized. Having such data helps us to 
wiggle free of the tautology that Barber 
(1964) has pointed to wherein “a person is 
said to respond to suggestions because he is 
in hypnosis and he is said to be in hypnosis 
because he responds to suggestions [p. 839].” 
According to the view espoused here, an S’s 
testimony about how he experienced his be- 
havior is critical in “cracking” the tautology, 
because it constitutes independent evidence 
about S’s actual experiential state. 

The work of Hilgard and Tart (1966; Tart 
& Hilgard, 1966) is very consonant with the 
above line of reasoning. They elicited periodic 
state reports from Ss which served as criteria 
of hypnotic depth that were “relatively inde- 
pendent of the responses to various tests of 
suggestion [Hilgard & Tart, 1966, p. 203]. 
As it turns out, these state reports of hyp- 
notic depth correlated with suggestibility, and 
they did so even in Ss who had not receive 
formal hypnotic induction. The use of their 
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experimental technique thus allows a conclu- 
sion that some nonhypnotic control Ss in fact 
become hypnotized—rather a turnabout from 
a more familiar conclusion that hypnotic Ss 
are merely task motivated. 

In conclusion, the present study supports 
other recent reappraisals suggesting the inade- 
quacy of the motivated compliance view of 
hypnosis (Bowers, 1966; Evans, 1966; Evans 
& Orne, 1965). It has been argued, here and 
elsewhere (Orne, 1966), that the subjective 
characteristics of hypnotic behavior are of 
paramount importance in any attempt to un- 
derstand the phenomena. Consequently, the 
experiential reports of hypnotized Ss become 
very significant data, especially when there is 
new cause to regard them as credible. It 
seems to the authors that it has been an un- 
willingness of some investigators to take seri- 
ously the subjective reports of hypnotic Ss 
that has opened the gates of uncritical skepti- 
cism concerning hypnotic phenomena. Perhaps 
the data and arguments offered herein will 
persuade other workers that hypnotic Ss have 
something important to say. 
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CYCLIC VARIATION IN STUTTERING: 


COMMENT ON TAYLOR AND TAYLOR’S “TEST OF PREDICTIONS 
FROM THE CONFLICT HYPOTHESIS OF STUTTERING” 


JOSEPH SHEEHAN 
University of California, Los Angeles 


Although stuttering behavior may show some constancy over short time 
sequences as Taylor and Taylor have shown, larger time segments do show a 
cyclic variation. Brief readings of experimental passages do not always encom- 
pass the range of cyclic occurrence. Data from Quarrington and from the 
author are cited in support of the approach-avoidance conflict theory of 
stuttering, particularly the fear-reduction hypothesis: viz., that the occurrence 
of stuttering behavior reduces the fear that elicited it. 


An intriguing feature of the mystery of 
stuttering is that most stutterers are able to 
speak most of their words fluently, as John- 
son, Brown, Curtis, Edney, and Keaster 
(1967) have shown. A wavelike, periodic 
fluctuation in frequency of stuttering has 
added to the mystery. 

During the course of exposition of a con- 
flict theory of stuttering, Sheehan (1958) 
posed two questions related to stuttering as a 
momentary blocking of speech: (a) What 
makes the stutterer stop? (b) Once stopped, 
what enables him to continue? To answer 
these questions, Sheehan proposed two major 
hypotheses: (a) the conflict hypothesis—the 
stutterer stops whenever conflict approach 
and avoidance tendencies reach an equilib- 
rium; (5) the fear-reduction hypotheses—the 
occurrence of stuttering behavior reduces the 
fear that elicits it. Experimental evidence for 
both hypotheses was covered in connection 
with the theory presentation and need not be 
reviewed here. 

As a corollary or outgrowth of the fear- 
reduction hypothesis, the theory suggested 
that the wavelike variation noted in stuttering 
might be explained as follows: During dis- 
fluent periods, stuttering reduces fear suf- 
ficiently to permit fluency, so that there is a 
buildup of fluency due to actual occurrence 
of stuttering behavior. During fluent periods 
the fear is not dissipated and builds up, for 
the fluency itself sets up a role expectation 

1 Requests for reprints should be sent to the au- 
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that makes stuttering more probable. In this 
way stuttering leads to fluency and fluency 
leads to stuttering. Hence the wavelike, cyclic 
pattern of stuttering occurrence. 

Recently in this journal, Taylor and Taylor 
(1967) offered certain data as a test of “the 
Sheehan-Quarrington conflict hypothesis.” 
More precisely, they should have referred to 
the fear-reduction hypothesis, just summat- 
ized in the preceding paragraph. I. K. Taylor 
(1966) had had nine stutterers read a 500- 
word passage three times in succession. Fur- 
ther analysis of these data showed “no tend- 
ency either for periodicity or for clustering 
of the stuttering events [Taylor & Taylor, 
1967].” Apparently, from these data the 
Taylors felt that they had disproved the €x- 
istence of cyclic variation in stuttering. At 
least, on the basis of this finding they ques 
tioned the whole approach-avoidance conflict 
interpretation of stuttering (Sheehan, 1953). 

The ambiguity here concerns the duration 
of the phases of the cyclic variations of stut- 
tering, that is, the length of the waves. 

From clinical observations and from Quar- 
rington's? data it may be said that there arè 
large waves and ripples, and ripples within 
larger waves. The crests and troughs may 
vary widely from one individual to another, 
and within the same individual at different 
times. But the cyclic characteristic of stutter 
ing is not confined to a narrow time segment 
such as that studied by Taylor and Taylor 
The pattern of variation may be second 
second, minute to minute, hour to hour, day 
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to day, week to week, month to month, and 
even year to year. Stuttering cycles may also 
vary according to the life stage of the 
individual stutterer (Conway & Quarrington, 
1963; Quarrington, 1965). 

In support of this pattern of variation, 
Quarrington has recently gathered data on 39 
young stuttering children, mean age 4 yr., 
4 mo., observed over a 1-yr. period. In 32 of 
these stuttering persisted over the 1-yr. 
period, during which their mothers supplied 
daily ratings of the frequency of stuttering. 
When submitted to Fisher's harmonic analy- 
sis, the resulting data curves revealed the 
presence of significant periodicities. Fifty-nine 
percent of the persistent stutterers showed 
statistically significant periodic components in 
the cycle length range of 2-6 mo. There was 
no evidence of shorter periodicities, nor was 
there evidence of any weekly patterning of 
stuttering frequency. 

Since stuttering behavior is notoriously de- 
pendent on situational variables, Taylor and 
Taylor should hardly have expected to en- 
compass the range of wavelike variation in 
stuttering within the minutes it takes to read 
an experimental passage. Yet that is appar- 
ently what they assumed. 

The Taylor and Taylor data are interesting 
but not astonishing in showing that stuttering 
may occur with some constancy over short 
time sequences. But every study of the adap- 
tation effect, involving gradual reduction of 
stuttering frequency over successive readings 
of the same passage, has shown a similar 
constancy. The time span it takes to read 
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Passages during an experimental session is 
really quite short: 15 min., or 30, but seldom 
over an hour even for the most severe stut- 
terer. These are not the predominant orders 
of time referred to in describing stuttering 
behavior as cyclic. 

Neither the Sheehan conflict theory nor 
the case for cyclic variation in stuttering rest 
primarily upon the minute time segments that 
Taylor and Taylor apparently expected, but 
naturally enough did not find. The waves in 
stuttering are those of a big ocean, not of 
a small puddle. 1 
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VERBAL OPERANT CONDITIONING OF HOSPITALIZED 
PSYCHIATRIC PATIENTS 
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Two experiments were conducted. In Experiment I, 81 Ss were divided into 
nine subgroups according to age and length of hospitalization. Plural nouns 
were reinforced by positive verbal feedback. Results were (a) least chronic Ss 
increased their operant rates; (b) most chronic Ss did not; (c) there was sig- 
nificant interaction with respect to age and chronicity. In Experiment Il, 60 
schizophrenics Ss were divided into six subgroups with respect to length of 
hospitalization and mode of reinforcement (he, verbal, primary, combina- 
tion). Acute-type schizophrenics responded positively to all types of reinforce- 
ment. Chronics responded only to primary reinforcement and to the com- 
bination procedure. Mode of reinforcement was a significant variable affecting 


operant rate. 


Verbal operant conditioning with psychi- 
atric patients residing in a hospital or institu- 
tional setting has received increasing attention 
in the past few years. Particular attention has 
been devoted to research with patients with a 
diagnosis of schizophrenia even though the 
reliability of diagnosis is seriously open to 
question (Mehlman, 1952; Schmidt & Fonda, 
1956). The literature dealing with verbal con- 
ditioning of normals and psychiatric patients 
has demonstrated consistently that normals 
condition much more readily, at least when 
verbal reinforcement is used as feedback (Buss 
& Lang, 1965). 

Baron (1966), in attempting to explain 
these findings, suggested that schizophrenics 
have had a negative premorbid social rein- 
forcement history for a variety of behaviors. 
Baron suggested that schizophrenics evaluate 
positive verbal-social reinforcement as inap- 
propriate and thus fail to respond in a posi- 
tive manner. However, some patients diag- 
nosed as schizophrenic are apparently more 
responsive than others and condition more 
readily (Johannsen & Campbell, 1964). In 
addition, patients who leave the hospital 
sooner (ie., within 180 days) condition more 
readily, apparently because they are of the 
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acute rather than chronic type (Salzinger & 
Portnoy, 1964), In related research, True 
(1962) found that reactive schizophrenics 
conditioned more rapidly than those classified 
as process. It has further been demonstrated 
by a number of research studies that schizo- 
phrenics respond or learn more under the 
conditions of verbal censure than verbal re- 
ward. 

Schizophrenic patients seem to react to pri- 
mary or nonverbal reinforcement more Con- 
sistently than to verbal reinforcement. Kil- 
berg (1962) found that nonverbal rewards in 
the form of cigarettes did not lead to signifi- 
cant differences in verbal conditioning of nor- 
mals and schizophrenics. The more general 
operant conditioning procedures investigated 
by Ayllon and Haughton (1962, 1964) also 
supports this hypothesis. Further classifica- 
tion and exploration in this area of behavioral 
modification appears warranted, especially 
with regard to variable interaction, extinction, 
generalization effects, and combinational pro- 
cedures. This type of experimentation, in ad- 
dition to providing information with regat 
to principles of human learning, has practical 
implications related to more effective treat- 
ment of institutionalized psychiatric patients. 
It is not enough to say that chronic psychi- 
atric patients and/or schizophrenics are "dil- 
ferent" and “unresponsive,” but also to Cf- 
cumyent these difficulties by devising effec- 
tive procedures to increase adaptive behaviors 
and extinguish maladaptive behaviors. 
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TABLE 1 
SUBJECTS WITH RESPECT TO CHRONICITY: AGE RANGE, NUMBER OF HOSPITALIZATIONS, 
AND DIAGNOSES 
Age ranges (no. Ss) Diagnoses* (no, Ss) 
Chronicity Men ER 
26-42 35-43 44-52 1 2 3 4 
6 mo. 9 9 9 1.01 17 3 0 7 
1-2 yr. 9 9 9 1.85 20 1 1 5 
3-5 yr. 9 9 9 1.93 21 9-1 2 4 


*1 = schizophrenia; 2 = psychoneurosis; 3 = brain syndrome; 4 = personality disorder. 


The purposes of the two experiments con- 
ducted were to ascertain the changes in verbal 
operant response rate as a function of age, 
length of hospitalization (ie. chronicity 
level), interaction of these variables, and fi- 
nally the effects of mode of reinforcement 
(ie., verbal, primary, or a combination of the 
two of these). Patient awareness was also as- 
sessed as a variable affecting learning and was 
ascertained in the manner of Denike (1964). 


EXPERIMENT I 
Method 


7 Subjects. The Ss consisted of 81 patients residing 
in a state mental hospital. All Ss were male with at 
least a fifth-grade reading level as ascertained by use 
of the Wide Range Achievement Test (Jastak & 
Bijou, 1946). The Ss were divided into three major 
age and chronicity groups as determined by total 
length of time spent in the hospital without regard 
to number of hospitalizations. Table 1 presents a 
summary of S group selection. 

Procedure. The verbal operant conditioning pro- 
cedure was similar to that used by DeNike (1964). 
Human nouns constituted the critical response class 
reinforced verbally by E saying “good” or “fine.” 
Each S was required to say a total of 200 words 
singly with E facing S. The S was instructed to not 
tepeat words. The conditioning task of 200 words 
Was divided into eight trial blocks of 25 words each 
for purposes of statistical analysis. No reinforce- 
ment was given during the first trial block in order 
to establish operant levels of the Ss. To ascertain 
Awareness, after each trial block Ss were given 1 
min, to express verbally any thoughts they might 
have about the experiment. After the entire pro- 
cedure, Ss were asked a series of questions devised to 
assess awareness of reinforcement contingency of the 
human noun concept or of any related concept (e.g., 
Did you notice that I said anything? What kind of 
Words did you say most? Why?) To be classified 
aware an S had to say either that E said “good” or 
fine” when he gave human nouns, words about 
People, or illustrated this concept by example. 


Results 


Individual scores were obtained by sub- 
tracting the number of human noun responses 
in the operant base-line block from the num- 
ber emitted in the last trial block. Because 
the within-group variance was heterogeneous, 
analyses of the data were carried out both 
with raw and transformed scores. (Square-root 
transformations were necessary to obtain ade- 
quate within-group homogeneity of variance.) 
A factorial analysis revealed that between- 
group differences were statistically significant 
with respect to the chronicity of Ss (p < .01), 
but not with respect to age. The interaction 
effect Age X Chronicity Level was highly sig- 
nificant (p «.01). Inspection of the data 
suggested that the interaction effect Age X 
Chronicity occurred most markedly in the 
middle-chronicity-level group where older Ss 
responded or learned more efficiently than 
younger Ss. Group learning differences gen- 
erally were, as expected, in favor of the acute- 
type patients. That is, Ss with shorter hos- 
pitalization learned relatively more in terms 
of increased operant rate. 

Only 12 of the 81 Ss expressed any form of 
aawreness of reinforcement contingency. Six 
of these Ss were of the acute type, 2 were 
moderately chronic, and 4 were chronic-type 
patients. In the acute group of patients be- 
tween-group differences were not significant 
(p> .10) with respect to S's awareness. In 
the other groups (i.e. moderately chronic and 
chronic) there were not enough aware Ss to 
test for significance of between-group differ- 
ences. 

EXPERIMENT II 


Method 
Subjects. Sixty male patients between the age of 
20 and 35 yr. who met the following qualifications 
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TABLE 2 


REINFORCEMENT PROCEDURES FOR ACUTE AND 
CHRONIC GROUPS 


0 70 
Subjects | trials tels tras | trials 
Acute 

Ay Operant | PRVR PRVR VR 

Ay Operant PR PR VR 

As Operant VR VR VR 
Chronic 

Cı Operant | PRVR PRVR | VR 

C: Operant PR PR VR 

C; Operant VR VR VR 


Note.—A and C refer to acute or chronic, respectively. 
VR, PR, and VRPR refer to method of presentation of rein- 
forcement with VR. being verbal reinforcement, PR primary 
reinforcement, and VRPR a combination of the two. N = 10 
in each subgroup. 


were selected: (a) at least a seventh-grade educa- 
tion; (b) at least a fifth-grade word pronunciation 
level (Wide Range Achievement Test, Jastak & Bi- 
jou, 1946); (c) either delusional, hallucinating, or 
both; and (d) independently diagnosed as schizo- 
phrenic by a hospital staff psychiatrist. 

In addition, 30 of the Ss selected had been in the 
hospital less than 6 mo.; 30 had been hospitalized 
for more than 3 yr. These two major groups were 
classified as either acute (less than 6-mo. hospitali- 
zation) or chronic (more than 3-yr. hospitalization). 
Each of the major groups was subdivided into three 
treatment subgroups of 10 Ss each. Assignment to 
the treatment subgroups was on a random basis. 

Table 2 summarizes the experimental paradigm. 
The verbal operant conditioning procedure was 
again similar to that used by DeNike (1964). All Ss 
were instructed to say words singly, without repeat- 
ing words, Human nouns were reinforced after each 
S had said the first 20 words without interruption. 
Each S said a total of 230 words. 

As indicated in Table 2, acute and chronic Ss in 
each of the three treatment subgroups received a 
different mode of reinforcement. The type of pri- 
mary reinforcement used was that of candy and 
cigarettes, Verbal reinforcement consisted of the E 
saying “good” or “fine.” The combination procedure 
involved the simultaneous use of both of these types 
of feedback. It should be noted that in the last block 
of 70 trials, all Ss received only verbal reinforce- 
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ment. The Ss were given 1 min. to verbalize 
thoughts about the experiment at trial-block inter- 
vals. At the end of all trials, Ss were asked a series 
of questions as in Experiment I to elicit any evidence 
of awareness of reinforcement contingency. 


Results 


Subgroup means are presented in Table 3. 
Initial operant rates did not differ significantly 
(p> .10). Scores used in further statistical 
comparisons were determined by subtracting 
each of the initial S's operant base-line rates 
from the operant in the trial block analyzed, 
Square-root transformations were found to be 
necessary in order to gain homogeneity of 
within-group variance. 

A 2 X 3 factorial analysis was performed to 
test for between-group differences and inter- 
action effects on the data of the second block 
of 70 trials. Mode of reinforcement resulted in 
significant between-group differences ($< 
.05). Chronicity level and interaction effects 
were not significant. The same type analysis 
was made with respect to the last block of 70 
trials with similar results. In fact, as illus- 
trated in Table 3, very little change occurred 
in the last block of 70 trials in which all 5$ 
experienced the same mode of reinforcement, 
Inspection of the data suggests that, as might 
be expected from previous research, the 
groups responded very much the same with 
the exception of the chronic subgroup whi 
received only verbal reinforcement on 
trials. A series of ¢ tests comparing each sub- 
group with each of the others in the last 70 
trial block revealed that all groups were 
superior to the chronic group receiving only 
verbal reinforcement. However, the signifi- 
cance level of the acute verbal reinforcement 
chronic verbal reinforcement comparison was 
not high enough to discount chance differ- 


TABLE 3 
Suscroup MEANS IN TRIAL BLOCKS 

[9 Block 2 Block 3 

Gren 20 trials 70 tals 70 trials 70 trials 
Acute verbal reinforcement 26 6 6.6 7.0 
Acute primary reinforcement 1.9 45 9.9 8.2 
Acute primary reinforcement-verbal reinforcement 1.8 8.1 10.1 11.6 
Chronic verbal reinforcement 23 32 3.6 31 
Chronic primary reinforcement 0.6 53 7.0 1 
Chronic primary reinforcement-verbal reinforcement 35 8.3 79 61 
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ences. All other subgroups were clearly su- 
perior to the chronic-verbal-reinforcement 
group (? < .05). No other subgroup compari- 
sons (i.e., excluding the chronic-verbal-rein- 
forcement group) approached a level of sta- 
tistical significance (p > .10). As indicated in 
Table 3, the means of the subgroups receiving 
primary reinforcement or the combination 
procedure were highly similar. 

Only 10 of the 60 Ss participating in the 
study verbalized any type of reinforcement 
contingency, 5 in the acute group and 5 in 
the chronic group. The aware- and unaware- 
acute Ss exhibited very similar operant levels 
on the last block of 70 trials. However, 
chronic-aware Ss receiving primary reinforce- 
ment or a combination of primary and verbal 
reinforcement clearly were superior to their 
unaware peers (p < .01). 


Discussion 


The results of Experiment I clearly suggest 
that chronic psychiatric patients, ignoring 
diagnosis, react very little in a positive way 
to verbal reinforcement as compared with 
acute patients. The chronic patient maintains 
the status quo in terms of his verbal operant 
behavior or may even decrease the frequency 
of the particular operant involved, Age itself 
does not lead necessarily to differences in 
positive responsiveness but does interact with 
level of chronicity, especially in the middle 
level of chronicity (i.e., 1-2 yr.). 

The results of Experiment II clearly dem- 
onstrate that primary reinforcement or a 
combination of primary and verbal reinforce- 
ment was more effective in producing an 
increased operant in chronic schizophrenics 
than verbal reinforcement alone, However, 
there did not seem to be any clear differential 
effects of the mode of treatment with respect 
to acute schizophrenics. The combination of 
two modes of reinforcement as compared with 
primary reinforcement alone did not lead to 
differential effects with either the acute or 
chronic schizophrenics. 

An important question raised by the results 
was whether the schizophrenic patient who 
increased his operant rate of primary rein- 
forcement would then regress when the mode 
of reinforcement was changed to positive 
verbal feedback. In this regard, there was no 
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marked change in operant level when modes 
of reinforcement changed. However, more 
trials after the transition would be considered 
necessary to adequately test out this pos- 
sibility. The effects of combination procedures 
should be more thoroughly investigated in 
further research in view of the important 
treatment implications as well as basic 
research in learning. 

Only about 21% of the patients verbalized 
any type of reinforcement contingency. From 
both experiments there were 11 aware-acute 
versus 9 aware-chronic patients out of a total 
of 94 so classified. This is somewhat sur- 
prising in that one would expect the acute 
patients to be more “alert” as to what was 
taking place. Motivation on the part of the 
aware Ss was evaluated by interview. There 
was no clear relationship between what Ss 
said and what they did. Their answers were 
somewhat vague or contradictory. These dif- 
ficulties of assessing motivation were appar- 
ently not encountered by DeNike (1964). 
However, in a recent unpublished study by 
Wagoner (1967) using college-type Ss, dif- 
ficulties of assessing motivation to receive a 
verbal reward were encountered. When some 
Ss solved the concept, they tested out other 
hypotheses, or at times decided not to respond 
in the same way. Others did the opposite of 
what they said they were motivated to do. 
It is possible also that Ss may reach a plateau 
of positive response, as has been suggested 
by Baron (1966). For the time being at least, 
one is led to conclude that an S's expressed 
awareness may be neither necessary nor suf- 
ficient as a factor influencing an increased 
operant rate. What a person says and does 
is not always consistent. This does not argue 
against S's assessed awareness, however, as 
being an important variable involved in any 
type of learning situation. 
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PERCEPTUAL STYLES AND THEIR CORRELATES 
AMONG SCHIZOPHRENIC PATIENTS 


CARMI SCHOOLER anp JULIAN SILVERMAN? 
National Institute of Mental Health, Bethesda, Maryland 


The experimental procedures on which an earlier tridimensional model of 
schizophrenic perceptual functioning was based were replicated, other poten- 
tially relevant procedures included, and the resultant relationships delineated 
and tested through multivariate statistical techniques. Factor analysis supported 
the existence of the three dimensions of the original model—stimulus intensity 
control, scanning control, and field articulation. Through cluster analysis there 
emerged groups of Ss whose differing perceptual styles paralleled the previ- 
ously reported pattern of differences between paranoid and nonparanoid 
schizophrenics. Other results, however, made several major qualifications of 
the original theory necessary: (a) perceptual inefficiency and anchoring emerge 
as additional meaningful dimensions of perceptual functioning; (b) behavioral 
guardedness replaces coherent paranoid delusions as a significant correlate of 
scanning; and (c) stimulus augmentation is related to strong internal affect 
only among acute patients; among chronic patients the relationship is reversed. 


Through the examination of the relation- 
ships between styles of cognitive functioning 
and so-called “psychophysiological defense 
mechanisms," one of the authors has outlined 
a tridimensional model of perception that can 
be applied to the study of both psychopatho- 
logical and normal groups. A full description 
of the model can be found in Silverman 
(1964b, 1967c, 1968). In brief, the dimen- 
sions involved are stimulus intensity control, 
scanning control, and field articulation. 

Stimulus intensity control refers to the 
intensity with which perceptions are experi- 
enced; it might also be considered the in- 
tensity dimension of attention, In sensory re- 
sponse studies conducted with low or ordinary 
ranges of stimulation, hypersensitive indi- 
viduals experience stimuli intensely; they 
react strongly to such input. Under conditions 
of high-intensity stimulation, or of sudden or 
otherwise novel stimulation, their responses 


1The authors are extremely indebted to Frances 
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staff of St, Elizabeths Hospital, Washington, D. C. 
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and do not necessarily reflect the point of view of 
the Hospital. * 

Requests for reprints should be sent to Carmi 
Schooler, Laboratory of Socioenvironmental Studies, 
National Institute of Mental Health, Bethesda, 
Maryland 20014. 


are attenuated. Such individuals are labeled 
reducers. Apparently because of their auto- 
matic tendency to attenuate high-intensity 
sensory input they are relatively tolerant of 
painful stimulation (Petrie, 1967; Silverman, 
1964a). On kinesthetic figural aftereffect pro- 
cedures (Petrie, 1967; Silverman, 1967c), 
reducers underestimate the size of a “test bar” 
after having rubbed an interpolated stimula- 
tion bar for a period of time. Those on the 
other end of this dimension, who are termed 
augmenters, show a notable intolerance for 
pain but a relative tolerance for reduced 
sensory input. They respond on the kines- 
thetic figural aftereffect procedures by over- 
estimating the test bar after having rubbed 
the interpolated stimulation bar. Performance 
on these procedures can thus be used as indi- 
cators of position on the stimulus control 
dimension. 

The second underlying dimension, scanning 
control, is concerned with the extensiveness 
with which individuals sample visual stimuli. 
Extensiveness of scanning is determined either 
directly from eye-movement recordings or 
inferred from procedures that are known to 
correlate with such recordings. The indirect 
method used most frequently is a particular 
form of size-estimation procedure (Gardner & 
Long, 1962). On this procedure minimal scan- 
ning of the visual field is associated with large 
overestimations of the size of visual stimuli. 
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Field articulation, the third underlying per- 
ceptual dimension, is inferred from tasks that 
necessitate attention to certain segments of a 
stimulus field and simultaneous inhibition of 
attention to other segments of the field. This 
dimension has been conceptualized as encom- 
passing Witkin’s concept of field dependence- 
independence (Witkin, Lewis, Hertzman, 
Machover, Meissner, & Wapner, 1954). 

Studies of these response dimensions in 
heterogeneous schizophrenic populations have 
indicated that acute schizophrenics with good 
premorbid histories and coherently expressed 
delusional systems tend to show extensive 
scanning and augmentation of the intensity 
of stimulation. At the opposite pole are the 
nonparanoid patients with poor premorbid his- 
tories who tend to evidence relatively minimal 
scanning and a reduction of the experienced 
intensity of stimulation (Silverman, 1967c). 
At the time of the formulation of the tri- 
dimensional model clear-cut differences be- 
tween schizophrenic subtypes had not been 
found on the field articulation dimension. 
Recent articles by Sugerman and Cancro 

(1968) and Silverman (1968) indicate the 
unusual cómplexity of this dimension when Ss 
studied are schizophrenics. 

In the present study perceptual variables 
and psychological and personal history vari- 
ables were examined with an aim toward 
evaluating the above conceptual framework in 
greater detail. The same experimental pro- 
cedures on which the formulation was based 
were employed, and the resultant relationships 
were delineated and tested by means of multi- 
variate statistical techniques. Other potenti- 
ally relevant variables that had not been 
previously evaluated in this context were also 
included, both to extend the scope of the 
model and to evaluate possible alternatives 
to it. 

The following perceptual procedures from 
which the dependent variables were derived 
were selected because they were central in 
developing the original model: visual size- 
estimation procedure (Silverman, 1964b); a 
test of kinesthetic figural aftereffects (Silver- 
man, 19643); a measure of “field indepen- 
dence," the rod-and-frame test (Witkin, 
1965); and a measurement of category width 
(Pettigrew, 1958; Silverman, 1964c). In- 
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cluded because of their possible relevance 
were the Miiller-Lyer Illusion Test (Gardner, 
1961), the two-flash threshold procedure 
(Venables, 1963), and the Petrie small and 
large block stimulation procedures—a some- 
what different and more elaborate measure of 
kinesthetic figural aftereffects than that used 
by Silverman (Petrie, 1967). 

The independent variables were measures 
of characteristics of intellectual and psycho- 
social functioning and personal history that, 
on the basis of previous research, were ex- 
pected to distinguish schizophrenic patients 
with apparently opposite perceptual styles: 
(a) acute paranoid schizophrenics with 
good premorbid histories and coherently ex- 
pressed delusions, and (5) chronic nonpara- 
noid schizophrenics with poor premorbid 
histories (Silverman, 1967c). A separate mea- 
sure of each of these psychosocial and person- 
ality variables was included in order to assess 
the relative importance of each in predicting 
perceptual style. Selected on this basis were: 
length of hospitalization, paranoid versus non- 
paranoid subtype, Phillips Premorbid History 
Ratings (Phillips, 1953), and the separate 
items of the Wing Scale of Paranoid Behavior 
(Wing, 1961). 

Other independent variables, effective in 
predicting the performance of schizophrenics 
in other studies, were included in an attempt 
to evaluate other aspects of abnormal behav- 
ior in relation to perceptual style. Central 
among these were measures of intellectual 
functioning: the Digit Span, Similarities, and 
Vocabulary subtests of the Wechsler Adult 
Intelligence Scale (WATS; Wechsler, 1955), 
and a clinical interview-based rating of ad- 
justment (Rackow, Napoli, Klebanoff, & 
Schillinger, 1953). All of these measures ale 
concerned with the degree of efficiency with 
which the schizophrenic patient deals with his 
environment. Personal history variables that 
were examined because of their possible ef- 
fects on perceptual performance included age; 
age at first hospitalization, education, and 
social class of origin. Finally, two instru- 
ments—the Lorr Inpatient Multidimensional 
Psychiatric Scale (IMPS; Lorr & Klett, 
1965) and the Spohn Ward Behavior Scale 
(Schooler & Scarr, 1962)—were included in 
order to provide a fuller picture of each pa- 
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tient’s present symptomatology and social 
behavior. 


METHOD 
Subjects 


All Ss were white male hospitalized schizophrenics 
between 17 and 57 yr. of age who had no record of 
organic impairment. Acute and chronic patients of 
both paranoid and nonparanoid diagnostic subtypes 
were tested. All of the chronic patients had a history 
of more than 4 yr. of hospitalization and had been 
hospitalized more than 18 mo. since their most recent 
admission. Acute patients were those who were newly 
admitted to St. Elizabeths Hospital, Washington, 
D. C., who had a total hospitalization time of less 
than 2 yr, and who had not been hospitalized in 
any hospital more than 6 mo. during the present 
year. 

Somewhat different criteria were used to select the 
different subgroups. Among the chronics an attempt 
was made to test all paranoid schizophrenic patients 
who had either (a) attended college or (b) not gone 
to high school. This criterion was included originally 
in an effort to evaluate the effects of education. Non- 
paranoid chronics were selected by choosing, for each 
paranoid used, the alphabetically next nonparanoid 
patient on the roster of the same hospital division 
who met the appropriate criteria and who was of 
the same educational level as the selected paranoid S. 
This educational criterion proved, however, too re- 
strictive in terms of the number of patients available. 
Because of this, an equal number (4) of paranoid 
and nonparanoid patients who had attended high 
school were then randomly selected for inclusion in 
the sample. Of the 23 chronic paranoid patients so 
chosen, 2 refused and 1 was mute; of the chronic 
nonparanoid patients, 2 refused and 6 were mute. 
The final sample of chronic schizophrenics consisted 
of 20 paranoid and 15 nonparanoid Ss. The acute 
sample consisted of all patients admitted during a 
3-mo. period who were diagnosed as schizophrenic 
and who met the appropriate criteria of hospitaliza- 
tion, age, race, and lack of organic impairment. 
Nineteen patients met this criteria. Two nonparanoid 
patients were unable to perform the tasks and 1 
paranoid and 1 nonparanoid patient refused. The 
final sample of acute schizophrenics consisted of 9 
honparanoid and 6 paranoid Ss. 


Procedure 


Each S was seen on three occasions. During the 
first session he was interviewed by the senior author. 
On the basis of the interview he was evaluated on 
the IMPS (Lorr & Klett, 1965), the Wing Scale 
(Wing, 1961), and the Montrose Rating Scale 
(Rackow et al, 1953)? 


? In a pilot study of reliability of a group of three 
trained raters, which included the interviewer 
(Schooler), the average interobserver correlation was 
greater than .90 for all but one of the IMPS factors 
(Motor Retardation = .74); the average intercorre- 
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A battery of perceptual procedures and WAIS sub- 
tests were administered in the subsequent two testing 
sessions; each session lasted about 90 min. The intel- 
ligence tests were administered according to the de- 
scription in the WAIS manual; the format for 
administering the perceptual procedures was essen- 
tially the same as that used in the previously cited 
perceptual studies. Procedures included in the second 
session were: (a) size-estimation procedure, (b) 
WAIS Vocabulary subtest, (c) two-flash threshold 
procedure, (d) Petrie large block stimulation pro- 
cedure, and (e) Titchner Circles Illusion (Wapner & 
Werner, 1957). Included in the third session were: 
(a) WAIS Object Assembly Task, (b) Silverman 
kinesthetic figural aftereffect procedure, (c) Witkin 
rod-and-frame test, (d)  Müller-Lyer illusion, 
(e) WAIS Digit Span, (f) Petrie small block stimu- 
lation procedure, and (g) Pettigrew category width 
test. 


Perceptual Procedures 


Titchner Circles Illusion. This illusion consisted of 
two inner circles, each of which was surrounded by 
a concentric ring of circles, The standard, always 
on the left, was composed of an inner circle, 16.5 
mm. in diameter, and five surrounding circles, each 
with a diameter of 25 mm. The inner circle of the 
variable, always on the right, varied from 9.00 to 
21.00 mm. in steps of .75 mm, The S was presented 
with a chart on which were mounted seventeen 
4.5 in.X 7 in. cards on which a standard and a 
variable were drawn. The inner circle of the vari- 
able configuration was smallest at the top of the 
chart and increased in size with each succeeding card. 
The S was asked to judge for each of the 17 pairs 
whether the center circle on the right (variable) was 
smaller, equal, or larger than the center circle on the 
left (standard). On the first of two trials, S made his 
judgments beginning at the top of the chart and 
working down (D). On the second trial he began 
with the bottom pair and worked up (A). Perform- 
ance scores were calculated in terms of the number 
of the card on which the S first responded to the 
illusion, if he responded at all. Card 11 was the one 
on which standard and variable were equal. Two 
scores were calculated for each S: (a) the average 
of the two responses (Titchner Mean) and (b) the 
difference between the score on the descending series 
and the score on the ascending series (Titchner D-A). 

Müller-Lyer Illusion. The Müller-Lyer Illusion is 
an illusion of length: two objectively equal lines ap- 
pear unequal when at the end of one line small 
lines are drawn at acute angles to it, and at the end 
of the other line small lines are drawn obtusely to it 


lation among observers across all of the factors being 
.91. A similarly high correlation with the ratings of 
other trained raters had previously been obtained 
with the Montrose Rating Scale, Although no direct 
interobserver reliability tests were made on the Wing 
Scale, the published evidence indicates that there is 
a high degree of agreement among professionally 
trained users. 
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(Gardner, 1961). The experimenter (Z) gradually 
changes the length of the variable line until S says 
that it is the same length as the standard line. On 
half of the trials the variable line was started at a 
point distinctly smaller than the standard (ascending 
series), while in the other half of the trials it was 
started a point distinctly larger than the standard 
(descending series), The starting positions of the 
trials were varied in the order DAAD. Eight trials 
were presented. An S's score on a given trial was the 
number of millimeters he erred from a correct judg- 
ment. Two scores were obtained: (a) the mean error 
based on all trials (Müller-Lyer mean) and (b) the 
mean error on descending trials minus the mean 
error on ascending trials (Müller-Lyer D-A). 

Size-estimation procedure. Following Silverman's 
(1964c) procedure, S was asked to adjust a circular 
patch of light so that it appeared to be the same 
size as a hand-held disk. The light patch appeared 
on a double-ground glass screen 5 in. square that 
was mounted on a 9-in. square section of the end of 
a box. The 10-w. concentrated arc lamp (Sylvania 
C10 DC) was focused on the screen from inside the 
box to produce a smooth-edged circle of uniform 
brightness, The size of the circular patch projected 
on the glass screen was varied by having a circle 
cut in a piece of metal move back and forth on a 
spiral track as S turned a crank that was connected 
to the track. 

Three disks were used: (a) a 10-gm. "neutral" 
gray disk 4.92 cm. in diameter, (b) a 65-gm. “neu- 
tral" gray disk 4.77 cm. in diameter, and (c) a 
10-gm. disk covered with soft black wool 5.1 cm. in 
diameter. The S held each disk in his left hand, 15.5 
in. away from his eyes while he adjusted the variable 
circle of light that was also 15.5 in. away from his 
eyes. Twelve judgments were made in four series 
of three judgments each, with a random order of 
disk size in each series. The order of ascending and 
descending judgments was DDAA. The S was in- 
structed to adjust the circle of light so that it looked 
exactly like the size of the disk he was holding in 
his hand. At the end of each trial, S’s estimation was 
recorded, the light patch was turned off, and the 
mechanism reset at either ascending or descending 
starting position. No time limit was specified. Two 
scores were calculated: (a) the mean response on 
the 12 trials and (b) the difference between the mean 
score of the ascending trials and the mean score of 
the descending trials, 

Kinesthetic figural aftereffect procedures. In the 
first procedure (Silverman, 1964a), four stimulus bars 
were mounted on saw horses at a convenient height 
for a standing S. The two test bars, between which 
the S stood as the procedure began, were a 1.5-in.- 
wide standard bar on his left and a comparison bar 
with twenty-six js-in. gradations on his right. The 
wide end of the comparison bar, which was initially 
closest to S, measured 2.5 in.; the narrow end of the 
bar was 1 in. wide. The ninth gradation from the 
far end (narrow end) of the comparison bar was 
equal to the standard bar (1.5 in. wide). Directly 
behind the two test bars were two stimulation bars— 
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a .5 in.-wide bar on the left and a 3-in. bar on the 
right. All four bars were approximately 50 in. long, 
The blindfolded S first made four base-line judg- 
ments of where the graduated comparison bar felt 
just as wide as the standard bar. These four judg- 
ments were made from descending and ascending 
starting positions in DAAD order. The S then moved 
to the two stimulation bars and rubbed them simul- 
taneously, three times per second for 30 sec., in time 
to a metronome. Following this interpolated stimula- 
tion he again judged the width of the 1.5-in. stan- 
dard bar. This sequence was repeated until four 
judgments of the standard bar had been made fol- 
lowing interpolation. The starting positions for these 
poststimulation judgments were the same as for the 
base-line series. Two separate scores were computed. 
The first score was obtained by calculating the dif- 
ference in the chosen width of the comparison bar 
before and after each of the four interpolated trials 
(KFA total). An S’s score was the mean value for 
the four trials. The mean difference was also obtained 
for the ascending trials separately (KFA ascending). 

The second figural aftereffects procedure employed 
the apparatus and technique used by Petrie (1967). 
The blindfold S made a series of six base-line judg- 
ments of the size of a standard block. Judgments 
were made by moving the left hand along a tapered 
block until a point was reached that felt equal in 
width to the standard block held in the right hand. 
Following this the S rubbed a stimulation block for 
a period of 90 sec., after which the initial procedure 
of judging the size of the standard block was re- 
peated. The judgment procedure was again repeated 
following a 15-min. rest period. During the 45 min. 
preceding the base-line test and during the 15-min. 
rest period, S was required to avoid using his fingers 
in any way. This procedure was presented in both 
the second and third sessions, a 48-hr. interval sepa- 
rating them. In Session 2, in which the "large block 
stimulation test" was given, a 13-in.-wide block was 
used as the standard and a 24-in.-wide block for 
stimulation. In Session 3, during which the "small 
block stimulation test" was presented, a 2-in.-wide 
block was used as the standard and a 1-in.-wide 
block was used for stimulation. Two scores were 
computed from this experiment: (a) a difference 
score between the mean of the four judgments im- 
mediately preceding large block stimulation and the 
mean of the four judgments immediately following 
large block stimulation (Petrie large block stimula- 
tion), and (b) a difference score between the means 
of the four judgments immediately preceding and 
immediately following small block stimulation (Petrie 
small block stimulation). 

Tuwo-fash threshold procedure. The two-flash 
threshold was measured using a Grass Photo Stimula- 
tion Model PS2 with intensity set at 4. The lamP 
was located approximately 10 ft. from S. The S was 
asked to report whether he saw one or two flashes 
of light. Two trials were presented under ordinary 
room-illumination conditions. Each trial consisted 0 
a different series of 10 two-flash intervals ranging 
in duration from 20 to 110 msec. The first series 
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started with very brief intervals between flashes, 
and the duration between pairs of flashes was gradu- 
ally increased. The second series started with long 
intervals between flashes, and the duration between 
pairs of flashes was gradually decreased. The S’s 
two-flash threshold is the duration of the shortest 
interval at which two flashes are discriminated. Two 
scores were computed: (a) the average of the ascend- 
ing and descending thresholds (two-flash mean) and 
(b) the difference between the descending and 
ascending two-flash thresholds (two-flash D-A). 

Measure of category width procedure. The measure 
of category width consists of 20 multiple-choice items 
(Pettigrew, 1958). In each item an average value 
of a category is stated (e.g. the average speed of 
birds in flight) and S is asked to choose one of four 
alternatives for (a) the upper value of the category 
(the speed of the fastest bird) and for (b) the lower 
value of the category (the speed of the slowest bird). 
The range between these two values defines the 
limits of the category. The S was given the test 
booklet containing the items to be judged. Each 
item was also read to him before he made his choice. 
The scores used from these data were the Category 
Width II scores. This scoring used the 10 items that, 
according to Pettigrew's factor analysis of the 20- 
item questionnaire, were less influenced by quantita- 
tive reasoning ability (Silverman, 1964c). 

Rod-and-frame test. The apparatus used for the 
Rod and Frame Test consists of a luminous square 
frame, pivoted at its center so that it can be tilted 
to left or right. Also pivoted at the center, but 
moving independently of the frame, is a luminous 
rod, During testing the room is darkened completely 
so that all the S can see is the frame and rod that 
are presented in various tilted positions. The S is 
required to adjust the rod to an absolute vertical 
position by manipulating a switch mounted in a 
handheld control box. The Ss were first given two 
learning trials, after which they were given eight 
trials,? using the standard in which the rod and 
frame were preset at standard positions used by 
Witkin et al. (1954). The score computed was the 
average number of degrees of deviation from the 
Vertical for the eight trial series (Rod and Frame 
mean). 


METHOD or ANALYSIS 


Of the 14 dependent variables that were derived 
from the perceptual procedures, no more than two 
Scores were derived from any single test; there were 
no linearly interdependent subsets. Two methods 
Were used to analyze these variables: (a) factor 
analysis was employed to determine whether the 
Postulated perceptual dimensions, stimulus intensity, 
Scanning, and field articulation, would emerge as 


8The rod-and-frame settings for the eight trials 
Were: (1) Frame 242°, Rod 242°; (2) Frame 242°, 
Rod 298°; (3) Frame 298°, Rod 298°; (4) Frame 
298°, Rod 242°; (5) Frame 242°, Rod 242°; (6) 
Frame 242°, Rod 298°; (7) Frame 298°, Rod 298°5 
(8) Frame 298°, Rod 242°. 
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empirical factors, and (b) cluster analysis was em- 
ployed to group Ss on the basis of similarity of 
perceptual performance in order to determine whether 
the emergent groups would have the predicted 
characteristics. 

The factor analysis was of the principal component 
type, with unities placed on the diagonal of the 
correlation matrix. Principal components were com- 
puted for each eigenvalue greater than .8, and all 
factors with eigenvalues of more than 1.0 were 
rotated to simple structure through the Varimax 
procedure. Individuals were assigned factor scores on 
each factor using the procedure described by Ryder 
(1965) and correlations computed between the factor 
scores and the independent variables, 


RESULTS 
Factor Analysis 


Five factors emerged. The first we have 
interpreted as perceptual inefficiency, The 
perceptual tests with reliable loadings on this 
factor (greater than .40) include the rod-and- 
frame test; the two-flash threshold proce- 
dure; two measures of high anchoring effect 
on tasks involving discontinuous stimulus 
presentation—the Titchner Circles Illusion 
(D-A) and two-flash discrimination (D-A)— 
and the Petrie large block stimulation score. 
The Ss with high scores on this factor 
evidenced greater error in rod-and-frame per- 
formance, higher two-flash thresholds, greater 
anchoring, and greater errors on the Petrie 
procedure. On the nonperceptual measures, 
they showed poor performance on the WAIS 
subtests and were rated high on the disorien- 
tation scale of the IMPS and poor in overall 
adjustment as measured by Montrose Rating 
Scales. The Ss with high Factor 1 scores were 
older, were institutionalized longer, were more 
likely to have fathers in low-status occupa- 
tions, and were more poorly educated. Thus 
this first factor appears to reflect a general 
dimension of effciency-inefficiency, involving 
not only perceptual but also intellectual and 
social competence (Schooler & Parkel, 1966). 

The second factor that emerged was inter- 
preted as stimulus augmentation. It had 
loadings above .40 on both of the kinesthetic 
figural aftereffect measures (A) (B), on the 
visual size-estimation measure, and on the 
category width measure (see Table 1). Indi- 
viduals scoring high on Factor IT tended to 
enlarge (or at least not to attenuate) the ex- 
perienced intensity of sensory stimuli and to 
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TABLE 1 
PERCEPTUAL FACTORS: LOADINGS 


T I Il V 
High score i Hr IV i 
Measure represents Perceptual | Sumus | Scanning | Anchoring | gepenteney 
Visual size-estimation 
mean Overestimation .022 438 —.629 -110 069 
Visual size-estimation 
D-A Anchoring —.029 — 315 —.660 —.146 —.141 
Titchner mean Minimal illusion —.162 —.076 650 058 257 
Titchner D-A Anchoring 431 —.057 .510 207 —.184 
Miiller-Lyer mean Error —.072 —.106 417 —.224 A6 
Müller-Lyer D-A Anchoring —.016 —.001 .087 .830 —,042 
Rod-and-frame mean | Error .594 —.021 227 —.258 497 
Petrie large block 
stimulation Overestimation 677 —.022 053 —.315 — 326 
Petrie small block 
stimulation Overestimation 127 .065 —.029 —.147 —.674 
Two-flash mean High threshold .780 014 —.130 .224 219 
Two-flash D-A Anchoring .612 —.232 —.322 468 .080 
Kinesthetic figural 
aftereffect ascending| Overestimation 077 .882 .080 017 —419 
Kinesthetic figural 
aftereffect total Overestimation —.228 159 —.169 042 —.085 
Category width II Broad categories .066 A7 231 .565 —.064 
% of variance 
accounted 14.944 13.561 12.845 11.410 11.108 
TABLE 2 
PERCEPTUAL FACTORS: SIGNIFICANT CORRELATES 
Measure High score P. y 1 Sti a Til Iv stimulus 
represents incffic CRM ubi Aae Ye on | Scanning Anchoring dependency 
WAIS Digit Span Better —.433 E 
WAIS Object Assembly — | Correct — 365 —.253 — 217 
WAIS Similarities Correct —91 —,242 
WAIS Vocabulary Correct — 484 — 63h 
Chronicity More 383 334 
IMPS apathy) and 
Retardation More 233 
IMPS Paranoid Projection | More —281 
IMPS Perceptual s 
Distortion More —.242 
IMPS Hostile Belligerence | More | 333 251 
IMPS Disorientation | More En 332 : 
es Conceptual j 
isorganization More M 
Staff Social More s 
Peer Social More — 308 
Wing Poverty of Speech | More 246 1 240 
Wing Incoherent Speech | More p — 383 
Wing Flat Affect More 262 » 345 
Montrose Rating Scale Good ad- A17 $ 
justment 
Age (first admission) older s — 262 —.262 
Age (year born) Younger —.304 300 $ 
Education Higher —.305 o —.301 
Father’s occupational 
Qum. Lower 367 
Total time hospitalized | Longer “441 231 
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use broader categories in classifying perceptual 
stimuli. The Ss scoring high on this factor 
also performed poorly on the WAIS Object 
Assembly subtest. Apparently, classifying per- 
ceptual input in broad categories is associ- 
ated with difficulty in correctly organizing the 
pieces of the Object Assembly task. In the 
interview situation, Ss with high Factor II 
scores were rated as evidencing flat affect 
and poverty of speech (Wing Scale). They 
were also younger at first admission and older 
at the time of testing.* 

Factor III was interpreted as a scanning 
factor. Overestimation on the visual size- 
estimation task, which is significantly cor- 
related with minimal scanning (Silverman & 
Gardner, 1967), has a very high negative 
loading on this factor. The Ss with high posi- 
tive scores on this factor were regarded as 
relatively extensive scanners. They tended to 
underestimate and to anchor less on the visual 
size-estimation task, but anchored more on the 
Titchner Circles Illusion, despite being less 
prone to it (Table 1). 

The Ss with high Factor III scores were 
rated on the IMPS as more likely to show 
apathy, retardation, and disorientation, and 
less likely to show conceptual disorganization, 
perceptual distortion (hallucinations), and 
paranoid projection. They were less likely to 
evidence incoherent speech (Wing Scale) and 
were also rated by ward attendants as socializ- 
ing less with both staff and peers (Table 2). 
Apparently, individuals who extensively scan 
their surroundings are guarded both in their 
verbal communications and in the extent to 
which they interact with others. 

Factor IV with its high loadings on Miiller- 
Lyer (D-A), two-flash (D-A), and category 
width estimation was interpreted as an an- 
choring factor (Table 1). Individuals scoring 
high on this factor apparently employed broad 
categories in making perceptual judgments 
along specified dimensions and thus more 
readily perceived as identical phenomena that 
were similar but not identical. The Ss scoring 
high on this factor were younger on first ad- 


*The apathy ratings and diagnosis correlates of 
Factor II did not quite reach statistical significance, 
although high Factor II scoring Ss tended to be 
Tated as more apathetic and nonparanoid ($ < .06). 
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mission and were rated on the IMPS as hostile 
(Table 2) and belligerent. 

Factor V was interpreted as a “stimulus 
dependence” factor. The Ss scoring high on 
this factor evidenced high error scores on the 
rod-and-frame test and the Müller-Lyer 
Illusion. They also evidenced marked kines- 
thetic size underestimations on the Petrie 
procedure following stimulation with a small 
block. The loading of —.326 on the Petrie 
large block stimulation condition suggests that 
high-scoring Ss may also have a tendency 
toward stimulus-intensity reduction (Table 1). 
The negative, though relatively insignificant, 
loading of figural aftereffects (A) on this 
factor is consistent with the latter possibility. 
High-stimulus-dependent Ss evidenced poor 
performance on the WAIS subtests, although 
the correlations of the subtests with Factor V 
were somewhat lower than with Factor I. 
These Ss also were rated as hostile and bel- 
ligerent (IMPS) and as evidencing flat affect 
and poverty of speech (Wing). Their educa- 
tional level was lower and they had greater 
total hospitalization time (Table 2). 

In summary, the factor analysis of the 
battery of perceptual tests indicates the exis- 
tence of (a) the scanning, stimulus augmen- 
tation, and stimulus dependence factors spe- 
cifically postulated in the tridimensional 
model (Silverman, 1964b, 1964c, 1967c) ; (b) 
an anchoring factor * congruent with the find- 
ings of Boardman, Goldstone, Reiner, and 
Fathauer (1962); and (c) a perceptual in- 
efficiency factor congruent with the findings 
of Johannsen, Friedman, and Liccione (1964) 
and Schooler and Scarr (1962). 


Cluster Analysis 


Cluster analyses of Ss using the hierarchical 
grouping method (Ward, 1963) were carried 
out on the performance scores for the 14 (non- 
redundant) perceptual variables. As noted pre- 
viously, this form of analysis was carried out 
in order to examine the characteristics of the 
groups that emerge when Ss are classified 


5 This factor is also congruent with the works of 
M. Buchsbaum, J. Silverman, and R. Henkin, “Con- 
trast Effects on the Auditory Evoked Response and 
Its Relation to Psychophysical Judgment,” paper 
submitted for publication. 
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TABLE 3 
HIERARCHICAL CLUSTERING OF SUBJECTS BASED ON PERCEPTUAL VARIABLES: 
SIGNIFICANT DIFFERENCES BETWEEN CLUSTERS CONTAINING 
THREE OR More SUBJECTS 
7-cluster analysis 4-cluster analysis 
Variable 
1s 2b 5e 64 1'e 2t 
Perceptual 
Size estimation (Mean) 5.16 5.26 5.83 5.04 5.34 
Size estimation (D-A) 1.63 1.01 5.25 
Titchner (D-A) —.33 —.50 0.0 
Miiller-Lyer (Mean) 2.74 1.81 2.39 | 241 1.89 
Miiller-Lyer (D-A) —609 | —254 | —2.23 
Rod-and-frame (Mean) 10.53 7.62 17.08 15.49 8.85 
Petrie large block stimulation —.59 | —140 1.77 
Two-flash threshold (Mean) 53.66 83.25 118.33 
Two-flash threshold (D-A) —18.67 5.00 50.00 
Kinesthetic figural aftereffects (A,A) 1.42 2.36 .50 .62 2.06 
Kinesthetic figural aftereffects (Augmenting) 1.00 2.19 147 .66 242 
Category width II 29.20 38.90 28.67 28.38 37.57 
Nonperceptual | 

IMPS—Perceptual Distortion 3.13 4.50 24.00 5.00 
IMPS— Conceptual Disorganization 1.60 3.20 12.33 1.38 
WAIS—Similarities 12.87 14.80 7.67 7.50 
Wing—Incoherent Speech 1.27 1.60 3.67 1.25 1.25 1.87 
Montrose Rating Scale 22.87 21.50 14.33 18.00 
Spohn Peer Social 2.60 5.00 
Age (year born) 22.27 28.30 12.33 14.25 | 1929 | 2622 
Age (first admission) 30.46 26.48 
Factor I: Perceptual inefficiency —.08 —.05 20 AS 
Factor II: Stimulus augmentation —.02 06 —.05 —.13 —.07 05 
Factor III: Scanning 01 —.03 =31 09 04 —.07 
Factor IV: Anchoring —41 07 01 .00 —.07 .06 
Factor V: Stimulus dependency 02 —.04 —.03 —.01 


a Cluster 1, N = 
b Cluster 2, N 
e Cluster 5, N 
à Cluster 6, N 
* Cluster 1’, N 
{Cluster 2^, N 


4. 
3. 


according to their perceptual performances. 
The highest degree of within-group similarity 
and between-group difference occurred when 
the sample was broken down into seven 
clusters. Four of the clusters contained more 
than two individuals. These four form a very 
interesting pattern in terms of the differences 
among them in their factor scores. It can be 
seen from Table 3 that the clusters can be 
divided into two classes in terms of their 
perceptual efficiency scores, Clusters 1 and 2 
being significantly more perceptually efficient 
than Clusters 5 and 6. Since Cluster 2 re- 
sembles Cluster 5 in that Ss in these clusters 
evidenced relatively minimal scanning, it is 
not surprising that when hierarchical cluster- 
ing is continued from seven to four, two new 
clusters emerge containing more than two 
individuals. These two new clusters are the 


result of collapsing the original four major 
clusters on the basis of similarities on the 
scanning dimension (Factor III) and by dis- 
regarding differences on the perceptual et 
ficiency dimension (Factor I). Cluster 1 
represents the combination of Clusters 1 and 
6 with the addition of the one individual who 
comprised Cluster 4 (n = 24). Cluster 2' rep- 
resents the combination of Clusters 2 an 
(n = 23).* Analysis of the differences between 
Ss in Clusters 1’ and 2' indicates that Cluster 
1’ Ss evidenced significantly greater scanning 
(Factor III), narrower perceptual categories 
(Factor II), and less anchoring (Factor I 
(Table 3). On the individual perceptual tests) 
Cluster 1’ Ss showed more error on 
Miiller-Lyer and rod-and-frame tests, 
augmentation on the kinesthetic figural after- 


$See Footnote 4. 


SCHIZOPHRENIC PERCEPTUAL STYLES 


effects procedure (B), and narrower category 
widths on the Pettigrew procedure. They were 
also rated as evidencing less social interaction 
and less incoherent speech and were older 
both at time of testing and at first admission 
(Table 3). The differences in the perceptual 
performance of the individuals in these two 
clusters parallel the differences between the 
perceptual styles of paranoid and nonparanoid 
types described by Silverman (1964b, 1967c). 
The perceptual style seen as distinguishing 
the paranoid schizophrenics is similar to that 
manifested by Cluster 1’ Ss. Comparison of 
the chart diagnoses of individuals in the two 
clusters reveals a greater, though not quite 
significantly so, proportion of paranoids in 
Cluster 1’ (14 paranoids, 10 nonparanoids in 
Cluster 1’, 11 paranoids and 12 nonparanoids 
in Cluster 2’). The picture is complicated, 
however, by the fact that the two clusters 
do not differ on any of the symptom variables 
that are conventionally used to distinguish 
paranoids from other schizophrenics (ratings 
of coherent delusions on the Wing Scale, and 
of paranoid projection, hostile belligerence, 
and grandiose expansiveness on the IMPS). 
The only significant differences in symptoma- 
tology between the clusters are the smaller 
amounts of incoherent speech and social inter- 
action with peers shown by Cluster 1’ Ss. 


Discussion 


Before elaborating on the main findings of 
this investigation, some comments are in order 
regarding the different factor loadings found 
among a group of ostensibly similar variables 
—the Petrie large block and small block stim- 
ulation conditions and the kinesthetic figural 
aftereffect procedure employed by Silverman 
(19642). On Factor I, errors of overestima- 
tion in the large block stimulation condition 
are associated significantly with other kinds 
of perceptual errors and with intellectual in- 
efficiency. Performance scores on the small 
block stimulation condition do not load sig- 
nificantly on Factor I (Table 1). They do 
load, however, on Factor V where errors of 
underestimation in the small block stimula- 
tion condition are associated significantly with 
marked stimulus dependence. Note that on 
both factors, Ss evidencing a high degree of 
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error on the Petrie test are greatly influenced 
by the size of the interpolated stimulation bar 
employed. The fact that on Factor V under- 
estimation errors on the large block stimula- 
tion condition also are associated with stimu- 
lus dependence (although to a lesser degree) 
suggests that consideration of the Petrie 
formulation of reduction-augmentation is 
necessary for an adequate conceptualization 
of the stimulus dependence factor. Indeed, 
this relationship between augmenting-reduc- 
ing behavior and stimulus dependence has 
been specifically hypothesized elsewhere (Sil- 
verman, 1967a). Individuals who evidence 
marked reduction of the experienced intensity 
of stimulation as inferred from the Petrie 
procedure are expected to evidence a disposi- 
tion to be less analytic or differentiated in 
their perceptual judgment, that is, to be 
more stimulus dependent. The fact that the 
kinesthetic figural aftereffect procedure em- 
ployed by Silverman has its major loading on 
a different factor than the Petrie procedure is 
not comprehensible at this time. It may be 
that reduction-augmentation, inferred from 
figural aftereffect procedures, is affected by 
two different processes, one being related to 
analytic-differentiated responsiveness (Factor 
V), the other having to do with breadth of 
categorization behavior (Factor II). Such 
speculations are testable with a variety of 
techniques (Buchsbaum & Silverman, 1968), 
and an investigation of this specific problem 
is now being carried out in our laboratory. 

In the main, the results of this investigation 
provide support for the existence of the three 
dimensions postulated in the original model 
(Silverman, 1967c). Stimulus intensity con- 
trol, scanning control, and field articulation 
appear here in essentially the same form as 
stimulus intensity augmentation (Factor ID, 
scanning (Factor III), and stimulus depen- 
dence (Factor V). In addition, two other fac- 
tors emerged that we termed perceptual effi- 
ciency (Factor I) and anchoring (Factor IV). 
The perceptual efficiency factor, which ac- 
counted for the highest proportion of variance 
in this schizophrenic population, has not been 
found in comparable factor-analytic studies 
with normal Ss (Gardner, 1961; Gardner, 


Holzman, Klein, Linton, & Spence, 1959). It 
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emerged as the largest factor in this study 
quite probably because it taps the disruption 
of perceptual functioning, which is a promi- 
nent characteristic of many schizophrenic Ss. 
The emergence of anchoring, the second di- 
mension not found in previous factor-analytic 
studies, is due quite likely to the inclusion of 
specific anchoring scores (D-A), which also 
were not included in previous relevant stud- 
ies. The importance of this factor as a basic 
dimension of perceptual behavior is suggested 
by the high loading on the factor (.565) of 
the Pettigrew category width test. This posi- 
tive correlation between anchoring scores and 
broad category width scores suggests the ex- 
istence of a response dimension on which 
people differ in the extent to which they per- 
ceive stimuli as being equivalent, that is, as 
belonging to the same category. 

Further support for the empirical founda- 
tion of the original tridimensional model is 
derived from the results of the cluster analy- 
sis. The pattern of differences between per- 
ceptual styles of Cluster 1’ Ss and Cluster 2’ 
Ss parallels the pattern of previously reported 
differences between paranoid schizophrenics 
and nonparanoid schizophrenics. As noted, 
however, the two clusters did not differ in 
prevalence of delusional symptoms that are 
conventionally accepted as the distinguishing 
characteristic of paranoid schizophrenia. 
Rather, the clusters differed in a group of 
symptoms that can be interpreted as inter- 
personal guardedness: Cluster 1’ Ss (para- 
noidlike) exhibiting both more coherent speech 
in the clinical interview and less social inter- 
actions with staff and fellow patients. Thus, it 
could be that previously found relationships 


TABLE 4 


CORRELATES OF STIMULUS AUGMENTATION (Factor II) 
AMONG CHRONIC AND ACUTE PATIENTS 


Scale Acute* 


Chronich 


——— ——— 
IMPS Anxious Intropunitiveness 


IMPS Incoherent Speech p 
IMPS Disorientation —.554 
Wing Flat Affect -051 
Wing Impoverished Speech .055 


é ia = 15. A correlation of .514 is needed for p <.05, two- 
ailed, 
5 im 35. A correlation of .335 is needed for p <.05, two- 
ailed. 
* The difference between the correlation fi 
is p «.005, two-tailed. Bio ng evo 
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between diagnosis of paranoid schizophri 
and the perceptual style evidenced by 
reaction type were the result of a tendency 
diagnose patients showing relatively 
behavior and coherent speech as paranoid." 
existence of such a relationship is suppo 
by examination of some of the correlates 0 
the scanning factor (Factor IIT) among th 
entire sample (Table 2). The Ss who sc 
high on this factor showed less inco| 
speech and less social interaction with peer 
and staff. They also showed less concep’ 
disorganization, less perceptual distortion, 
in fact, significantly less paranoid projec 
on the interview-based IMPS. Whether 
latter finding means that high scanners 
actually less prone to paranoid delusions tha 
low scanners or whether it merely reflects 
fact that they are more guarded and 
likely to expose their delusions in an im 
view situation remains an unsolved ques 
However, it is apparent that a highly 
cant relationship exists between extel 
visual scanning of the environment and 
group of traits that we have interpre 
behavioral guardedness. 

Another noteworthy finding is the positi 
correlation obtained between Factor II, # 
stimulus augmentation factor, and flat afi 
on the Wing Scale (see Table 2). This cot 
lation was, at first glance, somewhat co 
ing, for it was expected, on the basis of p! 
ous research (Silverman, 1967b), that s 
phrenic Ss who augment the experienced it 
tensity of stimulation would not tend to 
dence flat affective responsiveness. The 
work, however, dealt primarily with 
patients. Further analyses, in which acute 
chronic patients were considered separ 
yielded fascinating results. Acute pat 
with high stimulus augmentation scoi 
showed significantly more anxious intropU 
tiveness and incoherent speech and si 
cantly less disorientation (Table 4). In 
trast, chronic patients with high S 
augmentation scores showed significan 4 
anxious intropunitiveness and signite: 
more flat affect, poverty of speech, and 
orientation, Thus, among acute schizop? 
patients, stimulus augmentation is rela 
the presence of inner turmoil, incohei 
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speech, and minimal environmental disorien- 
tation. On the other hand, chronic schizo- 
phrenic Ss showing stimulus augmentation 
appear to have inner lives that are flat and 
undifferentiated. The distinct salience of ex- 
ternal stimuli against such a flat background 
may make such stimuli seem stronger and 
differences between them irrelevant. In the 
two populations the same perceptual charac- 
teristic—stimulus augmentation—is thus as- 
sociated with quite different affective response 
characteristics, 

In summary, once certain fairly major 
qualifications are made, the results of the 
present study can be seen as supporting the 
original tridimensional model of perceptual 
functioning as it applies to hospitalized schizo- 
phrenics. One major qualification is an in- 
crease in the number of underlying dimen- 
sions; perceptual inefficiency and anchoring 
emerge as meaningful factors in addition to 
the three previously delineated ones—stimu- 
lus augmentation (stimulus intensity control), 
scanning (scanning control), and stimulus de- 
pendence (field articulation). Other modifi- 
cations of the earlier theory made necessary 
by our findings are in the description of the 
interrelationships among certain of the person- 
ality and perceptual style variables. Behav- 
ioral guardedness replaces coherent paranoid 
delusions as a significant correlate of scan- 
ning. Stimulus augmentation and broad cate- 
gory width estimations are positively related 
to a high degree of anxious intropunitiveness 
only among acute patients. Among chronic 
patients the relationship is in the opposite 
direction. 

Although these qualifications are substan- 
tial, and further research will probably result 
in even further modifications, the multidi- 
mensional description of perceptual function- 
ing presented here has partially withstood 
the test of rigorous replication and certainly 
Provides as promising a way of meaningfully 
accounting for the variance among schizo- 
Phrenics as is now available. 
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RELATIONSHIP BETWEEN PSYCHOSOCIAL ADJUSTMENT 
AND PERCEPTION OF MATERNAL ATTITUDES ` 


LENORE BEHAR ? anp ROGER F. SPENCER 
University of North Carolina 


A sample of 26 adult male hemophiliacs was divided into two groups on the 
basis of their psychosocial adjustment. The poorly adjusted group rated their 
mothers as significantly higher in tendencies toward dominating, overprotect- 


ing, and ignoring their children. 


In studying the psychosocial adjustment of 
hemophiliacs, it was observed that these Ss 
frequently were exposed to similar patterns of 
child rearing. Support was found in earlier 
studies (Agle, 1964; Browne, Mally, & Kane, 
1960) that mothers of hemophiliacs tend to 
overprotect and infantilize their sons. In fact, 
studies of juvenile diabetics (Swift, Seidman, 
& Stein, 1967) and of other children consid- 
ered to be physically vulnerable (Green & 
Solnit, 1964) also revealed the mothers’ ten- 
dencies to protect, infantilize, and dominate 
their children. These attitudes seemed initially 
based on the realistic possibility of physical 
danger to the child; however, even when the 
children in these studies were old enough to 
fend for themselves, the mothers continued 
their protective roles. 

The mothers’ attitudes in these studies 
seemed to reflect feelings of inadequacy as a 
childbearer and guilty feelings of having 
burdened a child with a chronic illness. Al- 
though feelings of guilt and responsibility 
were also apparent in mothers of diabetics 
and other chronically ill children, they may 
be more salient in mothers of hemophiliacs, 
for in this disease the defects in blood clotting 
are transmitted as sex-linked, recessive ge- 
netic traits by the carrier mother to the son. 
That is, the mother passes on a genetically 
determined condition which she herself does 
not have. 

Hemophilia is a life-long, serious condition, 
found almost exclusively in males and usually 


1The study was supported by Medical School 
Grant 324 UNI-1-218 from the University of North 
Carolina, 

2 Requests for reprints should be sent to Lenore 
Behar, Department of Psychiatry, University of 
North Carolina Medical School, Chapel Hill, North 
Carolina 27514, 


discovered in infancy or early childhood. It is 
characterized by the persistence of bleeding 
following an injury or by “spontaneous” 
bleeding into joints and soft tissues (Ratnoff, 
1963). Although modern medical methods of 
handling bleeding episodes have decreased 
the mortality considerably, serious problems 
still occur—most frequently crippling of 
joints. In order to avoid the danger of bleed- 
ing, it may be necessary for a mother to re- 
strict her son’s physical activity, and in many 
situations to inhibit aggressive behavior. Hav- 
ing experienced these restrictions of childhood 
activity, coupled with frequent periods of be- 
ing confined to bed and waited on, many adult 
hemophiliacs tend to have strong passive de- 
pendent tendencies (Agle, 1964; Browne, 
Mally, & Kane, 1960). Mattsson and Gross 
(1966) have found that the levels of adjust- 
ment achieved are closely related to child- 
rearing attitudes and anxiety in the mother. 

Further investigation of the mother-child 
relationship should be fruitful in understand- 
ing the level of social adjustment achieved by 
adult hemophiliacs. Dealing with the adult 
patient offers the advantage of more crystal- 
lized personality characteristics. Understand- 
ing how he perceives his mother now and how 
he perceives her child-rearing techniques in 
retrospect should yield clues pertaining to 
social adjustment. 


METHOD 


In the present study, 26 male Ss were divided into 
two groups on the basis of their psychosocial ad- 
justment, which was measured by the Social Matur- 
ity Index of the California Personality Inventory 
(Gough, 1966). The scores were divided at the 
median, yielding 14 Ss rated as having low social 
maturity and 12 as having high social maturity. In 
this sample the range of scores was 40-58, with a 
median of 47. The mean of the low group was 45.57 


471 


472 
TABLE 1 
DESCRIPTION OF THE SUBJECT GROUPS 
Group 
Item 
High social maturity | Low social maturity 
(n = 12) (n = 14) 
Age M = 29.0 M = 32.9 
(range 16-75) (range 17-67) 
Social class* | M = 3.5 M = 40 
Education M = 12th grade | M = 10th grade 
Severity of 4 mild 5 mild 
illness 8 severe 9 severe 


a Measured on a 5-point scale, with 1 representing the highest 
class (Hollingshead & Redlich, 1958). 


and that of the high group, 53.16, the difference 
being statistically significant beyond the .001 level 
(£78.56). In other respects, the two groups were 
rather equally matched, as seen in Table 1. 

The Ss in both groups were administered the Mark 
Scale (Harris, 1955), a scale designed to measure S’s 
perception of his mother's child-rearing attitudes. 

‘The scale consists of 52 items, items which were 
rated by a panel of experts to reflect dominating, 

. Overprotecting, and ignoring on the part of the 
person being rated—in this case the mother and the 
ideal mother. On this test, S is asked whether his 
mother would agree or disagree with the following 
types of statements: 


A mother should never be separated from her 
child. 

If a child is quiet for a few minutes, a mother 
should immediately find out what he's thinking. 


RESULTS AND DISCUSSION 
Table 2 shows the differences between the 


two groups on this questionnaire. The high- 
social-maturity group consistently rate the 


TABLE 2 


PERCEPTION OF MATERNAL ATTITUDES: MEAN 
NuxBERS Or ITEMS INDICATING NEGATIVE 
CHILD-REARING PRACTICES 


Group 
Rating of High Low t 
social social 
maturity | maturity 
(n = 12) | (n = 14) 
Real mother 
Dominance (15 items) 7.08 9.64 | 3.01* 
Ignoring (13 items) 3.83 6.36 | 3.18* 
Overprotecting (24 items)| 5.91 9.26 | 2.66* 
Ideal mother 
Dominance (15 items) 5.58 9.71 | 3.97** 
Ignoring (13 items) 3.67 6.07 | 2.97* 
Overprotecting (24items)| 5.83 10.86 | 2.57* 
I 
*p «.05 
5$ i0 
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mother as less dominating, ignoring, and over- 
protecting than the low-social-maturity Ss 
did. The more mature Ss rated their mothers 
as better mothers, or at least as mothers 
lower in these three negative characteristics, 
Interesting, too, is the direction of change in 
rating the ideal mother, who was described to 
Ss in the instructions as “the best mother 
possible." The more socially mature Ss rated 
the ideal mother even lower than the real 
mother. The less socially mature Ss rate the 
ideal mother even higher on these negative 
characteristics, especially on the dimension of 
overprotection. One might infer the statement, 
*My mother was quite overprotecting but an 
ideal mother should be even more so" by the 
less mature S. We feel that the ideal mother 
represents fantasied wish fulfillment and that 
these individuals would have wanted even 
more overprotection and domination than 
their mothers provided. Support for this in- 
terpretation is gained from previous work 
with this same S group by Spencer (1968), in 
which he found that many of the less well- 
adjusted patients have had serious marital 
problems or have been unable to cut depen- 
dent ties with their mothers, some have devel- 
oped hostile-dependent relationships with 
their wives, and some have had continued 
problems with alcoholism—all reflecting in 
one way or another a passive dependent ori- 
entation with a wish to be dominated and pro- 
tected. 

The present study does not indicate 
whether a person is less well adjusted because 
he has been raised by a dominating and ovet- 
protective mother or whether he just sees his 
relationship with his mother in this way be- 
cause he is poorly adjusted. However, the 
previously mentioned studies of hemophiliacs 
lend considerable support to the former in- 
terpretation, that a dominating and overpro- 
tective mother tends to produce the type of 
adult described above—passive and depen- 
dent. If so, preventive work is necessary. Cer- 
tainly, the mother-child relationship in the 
case of genetically determined conditions 
chronic childhood disorders has more of 4 
chance of being problematic than when the | 
child is normal. At least in these trouble-prone 
cases, early counseling with the mothers or 


with both parents could be offered in an at- 
tempt to alleviate feelings of guilt and to 
thwart the development of an overprotective 
relationship. 


REFERENCES 


Actz, D. P. Psychiatric studies of patients with 
hemophilia and related states. Archives of Internal 
Medicine, 1964, 114, 76-82. 

Brownz, J. W., Marry, M. A, & KANE, R. P. Psy- 
chosocial aspecis of aone children and their 
families. American Journal of Orthopsychiatry, 
1960, 30, 730-740. 

GREEN, M., & SoLȚIrT, A. J. Reactions to the threat- 
ened loss of a child: A vulnerable child syndrome. 
Pediatrics, 1964, 34, 58-66. 

Goucu, H. G. Appraisal of social maturity by means 
71, 189-195. 


PSYCHOSOCIAL ADJUSTMENT AND PERCEPTION OF MATERNAL ATTITUDES 


of the CPI. Journal of Abnormal Psychology, 1966,. 


473 


Harris, J. G. A study of the mother-son relationship 
in schizophrenia. Unpublished doctoral disserta- 
tion, Duke University, 1955. 

Horumocsnurap, A. B., & Repricu, F. C. Social class 
and mental illness. New York: Wiley, 1958. 

Mattsson, A. & Gross, S. Social and behavioral 
studies on hemophilic children and their families, 
Journal of Pediatrics, 1966, 68, 952-964. 

Rarworr, O. D. The therapy of hereditary disorders 
of blood coagulation. Archives of Internal Medi- 
cine, 1963, 112, 92-111. 

Spencer, R. F. Incidence of social and psychiatric 
problems in a group of hemophiliac patients. 
North Carolina Medical Journal, 1968, 29, 332- 
335. 

Swrt, C. R., Seman, F., & Srem, H. Adjustment 
problems in juvenile diabetes. Psychosomatic Medi- 
cine, 1967, 29, 555-571. 


(Received May 4, 1968) 


Journal of Abnormal Psychology 
1969, Vol. 74, No. 4, 474-476 


COGNITIVE INTERFERENCE BY NONVERBAL 
SYMBOLS IN SCHIZOPHRENICS 


JONAS LANGER,! KENNETH B. STEIN 
University of California, Berkeley 
B. G. ROSENBERG 
Bowling Green State University 


The degree of cognitive interference evidenced in schizophrenics, as compared 
with normal medical controls, on the Color-Phonetic Symbol Test (a non- 
conventional version of the Stroop Word-Color Interference Test) was as- 
sessed. As expected, the performance of schizophrenics showed significantly 
greater cognitive interference. Two reasons for these results were noted, one in 
terms of general personality concepts and the other in terms of a cognitive 


theory of symbolic processes, 


Phonetic symbols may influence the per- 
formance of a different but related cognitive 
task in ways that parallel the effect of verbal 
symbols (Langer & Rosenberg, 1966). In a 
study modeled after the conventional Word- 
Color Interference Test or WCIT (eg. 
Stroop, 1935), phonetic symbols were printed 
in different colors to create a congruent (C) 
and noncongruent (N) card. On Card C the 
phonetic symbols were always printed in the 
preferred choices, that is, the colors with 
which most Ss matched them, for example, 
zah in red ink and tur in green ink, as 
determined by Langer and Rosenberg (1964). 
On Card N the phonetic symbols were always 
printed in a nonpreferred color, for example, 
zah in green ink and żur in red ink. As in the 
conventional test, the task on the Color-Pho- 
netic Symbol Test (CPST) was to name the 
ink colors as rapidly and accurately as possi- 
ble. It was found that the noncongruent pho- 
netic symbols interfered with the speed and 
accuracy of the performance on Card N as 
compared with Card C. 

The present investigation further tested 
(a) the general assertion that the influence of 
nonconventional symbolic activity may paral- 
lel that of conventional symbolization upon 
cognitive activity, and (6) the particular as- 
sertion that nonconventional phonetic symbols 
may interfere with the maintenance of di- 
rected attention toward the performance of 


1 Requests for reprints should be sent to Jonas 
Langer, Department of Psychology, University of 
California, Berkeley, California 94720. 
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another cognitive task (Langer & Ri 
1966). It was expected that the perfo 
of schizophrenics on the CPST would p: 
their performance on the WCIT. Since 
phrenics evidence greater interference 
normals on the WCIT (Wapner & 
1960), it was hypothesized that they also 
dence greater interference on the CPST. 


METHOD 
Subjects 


Two groups of Ss were tested on the CPSI 
experimental and a control group. The experim 
group consisted of 20 recently hospitalized 
schizophrenic patients (10 males and 10 
The control group consisted of 20 newly 
medical patients (10 males and 10 females) ht 
ized for nonpsychiatric reasons. a 

The groups were very closely matched for” 
(schizophrenics mean age was 35 yr, SD -—19* 
medical! mean was 38 yr, SD —13.6) and ed 
cation (schizophrenics’ mean was 12.6 yis | 
1.66; medicals’ mean was 12.3 yr, S 
Neither the difference in age nor in education 
statistically significant. 


Procedure 


All Ss were tested on the CPST, which 
scribed fully in Langer and Rosenberg (1966): 
Card C, 10 phonetic symbols are printed 
colors with which they were most cons 
matched by a normative group, that is, 
klak, skaf; blue—mumle, oom, sool, mu; green 
ish, nerd. On Card N the same symbols are P! 
in colors with which they were seldom 2 
Each card consists of 10 rows of 10 symbols 
for a total of 100, The measure used was 
of time to name the colors per card. In 0 
balance possible order effects, half of each g! 
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tested on Card C first and then Card N, while the 
other half of each group was tested in reverse order. 


RESULTS 


Both groups of Ss evidenced the expected 
interference effect on the CPST. In all, 39 of 
the 40 Ss tested took a longer time to com- 
plete Card N than C; no S took longer on 
Card C; and 1 medical S took an equal 
time on both cards. This finding serves as a 
further replication of previous findings 
(Langer & Rosenberg, 1966; Stein & Langer, 
1966) that the interference effect of the CPST 
is a general phenomenon. 

Table 1 presents the mean duration by 
groups to complete Cards N and C. As can 
be seen, the mean difference between duration 
to complete the two cards by the schizophrenic 
group was more than twice that of the medical 
control group. 

Table 1 also reveals that the schizophrenics 
were slower than the medical controls on both 
cards. In order to adjust for this initial mean 
duration difference between groups, each S's 
congruent score was used as his base line so 
that his percentage change from C to N could 
be calculated as follows: noncongruent — 
congruent/congruent X 100. The percentage 
mean difference between duration on Card N 
and Card C was then calculated for each 
group, and the percentages are presented in 
Table 2. Multiple ¢ tests were performed. 
Only the predicted difference between schizo- 
phrenics and normals was statistically signifi- 
cant, 


DISCUSSION 


As expected, the noncongruity between the 
phonetic symbols and the colors clearly inter- 
fered with the color-naming performance to 
a significantly greater extent among schizo- 


TABLE 1 
Overatt Mean Durations ON THE CPST 
M (sec.) 
Group 
Noncongruent | Congruent | Difference 

Be rior tun Oe 
Schizophreni 142.2 112.4 29.8 
Medicas | 91.1 77.5 13.6 
Both 116.6 94.9 21.7 


ee) 
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TABLE 2 
PERCENTAGE MEAN DIFFERENCES 
Item M SD t 

Schizophrenics 28.2 18.8 
Medicals 16.7 6.9 den 
Males 243 152 
Females 20.6 151 953 

toC 25.7 16.3 
CtoN 192 134 VAR 


Note.—Since time scores tend to be Positively skewed, the 
percentage time differences were transformed into logarithmic 
scores. It was found that the probabilities for significance 
remained the same. 

* p < .025, one-tailed test. 


phrenics than among normals. Two different 
(but hardly opposing, and possibly comple- 
mentary) reasons may be given to begin to 
explain this comparative phenomenon. 

The first reason is that cognitive interfer- 
ence is directly related to intra- and inter- 
personal effectiveness. Previous studies on 
overt interference on the conventional WCIT 
(e.g, Wapner & Krus, 1960) and covert in- 
terference on the nonconventional CPST 
(Stein & Langer, 1966) have demonstrated 
that high interference is associated with poor 
personal adjustment. Consequently, it is to be 
expected that the low personal effectiveness of 
schizophrenics will be reflected in greater in- 
terference on the CPST, just as it is on the 
WCIT. 

The second reason is more directly cogni- 
tive in nature. It has been previously demon- 
strated that (a) low consensual agreement in 
matching phonetic symbols to their color 
referents leads to increased interference on 
the CPST (Langer & Rosenberg, 1966), and 
(b) the consensual agreement of schizophren- 
ics in matching phonetic symbols with their 
color referents is about half that of normals 
(Rosenberg & Langer, 1965). Low consensu- 
ality has been interpreted to reflect the per- 
son’s relative lack of cognitive competence in 
differentiating and keeping separate symbols 
from their referents, and that should conse- 
quently lead to greater interference of per- 
formance (see Langer & Rosenberg, 1966, for 
a fuller exposition of the argument). Since 
schizophrenics are low consensualizers, it is to 
be expected, therefore, that they will evidence 
relatively more interference than normals on 
th CPST. 
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CORRELATES OF EMPATHY IN PSYCHOTHERAPY: 
A REPLICATION 


ALLEN E. BERGIN 1 ann LAWRENCE G. JASPER 


Teachers College, Columbia University 


This study replicates previous findings regarding the relationship between vari- 
ous therapist personal characteristics and therapist empathy levels during psy- 
chotherapy. It was concluded that (a) there is a moderate inverse relationship 
between therapist MMPI disturbance levels and degree of therapist empathy in 
interviews; (b) there is no correlation between indexes of therapists’ intellectual 
ability or academic achievement and their empathy scores; and (c) there is 
some reason to doubt that Truax Empathy scores are related to patient outcome 


in nonclient-centered therapy. 


An elaborate series of studies has been pub- 
lished in recent years which gives support to 
Rogers’ (1957) theory that therapist empathy 
is a correlate and cause of positive personality 
change. Some of this evidence also suggests 
that low empathy is related to client deterio- 
ration (Bergin, 1963, 1966; Strupp & Bergin, 
1969; Truax & Carkhuff, 1967). 

If these conclusions are correct, an obvious 
question to pose is: “What produces varia- 
tion in therapist empathy? Is it therapist 
personal characteristics, therapist training and 
experience, or client characteristics?" Evi- 
dence to date indicates that all three factors 
are related to level of therapist empathy 
(Bergin & Solomon, 1968; Truax & Carkhuff, 
1967), though no comparison of their relative 
contributions to empathy has been made. 

The present study replicates and extends an 
earlier study in which therapists! personality 
test scores were found to be significantly cor- 
related with empathy levels, as rated from 
tape recordings of treatment sessions (Bergin 
& Solomon, 1968). Both clinical and research 
evidence suggest that personal characteristics 
of therapists affect treatment outcomes and, 
particularly, that personality disturbance in 
the therapist interferes with therapeutic ef- 
ficacy (for reviews see Bergin, 1967; Strupp 
& Bergin, 1969). 


Stupy I 


In Study I (Bergin & Solomon, 1968), 18 
Postinternship students in clinical and coun- 


1Requests for reprints should be sent to Allen E. 
Bergin, Department of Psychology, Teachers College, 
Columbia University, New York, New York 10027. 


seling psychology and 26 of their clients were 
studied. The main data consisted of correla- 
tions between therapists’ MMPI and Edwards 
Personal Preference Schedule (EPPS) scores 
and their empathy levels during therapy, as 
measured by the Truax Accurate Empathy 
Scale. 

It was originally predicted that the MMPI 
D, Pt, Hy, and Pd scales would correlate 
negatively with empathy, and that EPPS 
Nurturance and Intraception would correlate 
positively. Study I (see Tables 2, 3, and 4) 
yielded significant negative correlations be- 
tween therapists’ empathy and their depres- 
sion (D) and anxiety (Pt) scores on the 
MMPI. Scores from the EPPS Consistency 
and Intraception scales correlated negatively 
with empathy while Dominance and Change 
correlated positively. Chronological age and 
supervisor ratings of therapist competence 
also correlated positively, whereas grades and 
Graduate Record Examination (GRE) scores 
were unrelated to empathy. 


Stupy II 
Method 


Study II replicated Study I exactly except for 
the addition of the several measures noted. 


Clients and Therapists 


The clients (2 = 48) were late adolescent and adult 
neurotics who were seen in an intensive psycho- 
therapy practicum during each of 2 academic years 
in the Teachers College Psychological Consultation 

T. 
AE therapists (n= 36) were fourth-year graduate 
students in clinical and counseling psychology who 
had seen a few cases for diagnosis and brief therapy 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR THERAPIST 
VARIABLES: Stupy II 


Variable M SD |Variable| M | SD 
Empathy ratings 3.16 | 2.03 | EPPS 
ACE rating 5.31| 1.30| Con 53.11| 9.95 
rade Average 
Practice H 6.79 | 1.03| Af 48.25| 9.89 
Academic 6.62 .52| Def 44, 8.27 
GRE 
Psychology 598.53 | 66.11 | Ach 61.17 | 6.59 
Quantitative 563.53 | 90.80 | Ord 42.80 | 9.67 
Verbal 653.24 | 75.55| Exh 49,36 | 9.10 
Age 
"3-50 29,08 | 6.19] Aut 54.81 | 9.56 
23-29 25.48 | 1.62| Int 53.42 | 8.01 
MMPI Suc 53.53 | 10.10 
K 55.70 | 6.74| Dom |47.17 | 10.40 
Hs 48.42 | 6.63| Aba 40.77 | 9.15 
57.17 | 9.11| Nur 49.17 | 8.55 
Hy 57.92 | 6.41| Ch, 49.17 | 10.00 
Pd 59,17 | 9.60| En 46,08 | 10.74 
ie, 
ale 33.00 | 4.16| Het 57.44 | 10.34 
Female 43.05 | 2.58| Agg 51.44| 9.33 
Pa 58.42 | 6.44 
Pt $6.08 | 7.63 
Sc 56.86 | 8.86 
Ma 56.56 | 7.94 
i 51.61 | 7.87 
Es 65.17 | 7.13 


* Raw scores. 


for 1 previous year in the same clinic and who also 
had completed a 1-yr. full-time internship. 

Personality. MMPI and EPPS scores were again 
used to assess level of disturbance and general per- 
sonality characteristics. The therapists were not 
highly familiar with the scoring and use of these 
inventories. Special coding and other precautions 
were taken to insure anonymity of respondents, and 
checks on validity of profiles revealed no evidence 
of faking. 

Empathy, Therapists regularly recorded their ses- 
sions and provided 1 or 2 hr. on each of two 
patients? after the request for data was made. Most 
of the tapes thus collected were obtained from 
therapy during the week preceding or succeeding the 
announcement and, therefore, there was little chance 
for a biased selection of tapes being sampled. 

Empathy was rated on the middle one-third of 
each hour sampled on the Bergmin-Solomon revision 
of the Truax Accurate Empathy Scale (see Truax & 
Carkhuff, 1967, p. 59). 

Other. Ratings were also obtained from supervisors 
of the general therapeutic competence of each stu- 
dent, as well as GRE scores, grade-point averages, 
and five therapist self-ratings on their personal intent 
to be empathic, interpretive, supportive, advising, 
and warm. 


Client Measures 


No measures of client behavior or characteristics 
were obtained except for a global outcome rating 


2 Several therapists were unable to provide tapes 
on two patients; therefore, their empathy ratings 
are based on one case only. This is accepted in light 
of the fact that therapist empathy scores across 
patients in the Bergin-Solomon study correlated .98. 
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on part of the sample made by the supervisor after 
discussion with the therapist and the three other 
students involved in that supervisory group. These 
ratings were made on a 7-point scale ranging from - 
“considerably improved" to “considerably worse.” 


RESULTS 


Interrater reliability on the empathy ratings 
was based on 32 therapy hours which were = 
rated by two judges independently. The 
Pearson correlation between ratings was .88. 

The prime object of the replication was to 
cross-validate the MMPI and EPPS findings 
from the Bergin-Solomon study. Table 1 in- 
cludes means and standard deviations for all 
variables, Table 2 shows the correlations be- 
tween empathy and the therapist personality” 
scores which were the focus of the replication, 
and Table 3 presents similar data for all other 
personality measures. 

Table 2 reveals a rather clear replication 
of the original MMPI data. The D and Ph 
scales correlate negatively and significantly 
with empathy. The EPPS correlations do not" 
replicate in any respect, although several new 
correlations appear (Table 3). It may be safe - 
to assume that in both instances the EPPS | 
correlations were simply random fluctuations. 
It is worth noting that in both Study I and 
Study II, practically all MMPI pathology, 
scales correlated negatively with empathy 
even though they were not significant. 

Table 4 includes correlations of other. 
variables with empathy. 


TABLE 2 


SIGNIFICANT PERSONALITY CORRELATES OF Empatay 
FROM STUDY I AND REPLICATED CORRELATIONS 
FROM Stupy II 


Test Study Ir Study II f 
MMPI* 

D —.41* —.31" 

Pt — 540 —ague 
EPPS» 

Con =.54" .03 

Int — 538% 01 

Dom oret 46 

Chg 550 44 
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TABLE 3 


PERSONALITY MEASURES UNCORRELATED WITH EM- 
PATHY IN STUDY I, AND Srupv II CORRELATIONS 
FOR THE SAME VARIABLES 


Test Study Ir Study II r 
MMPI* 

K 15 04 
Hs = 31 —.24 
Hy —.30 —.20 
Pd 10 —.23 
Mf 37 

Male (n= 16) 13 

Female (x =20) 34 
Pa —.05 .03 
Sc —.05 —.11 
Ma .32 —.07 
Si —.18 ESTA 
Es .08 .32 

EPPS? 

Aff —.18 — .38* 
Def =123 —.22 
Ach —45 .34* 
Ord —.41 .07 
Exh —.2 .06 
Aut :31 01 
Suc 00 .02 
Aba —.12 —.37* 
Nur —.9 127 
End .08 A7 
Het —.05 .08 
Agg 04 —.02 


^n = 12 in Study I, m = 36 in, Study II. 
n = 13in Study I, 1 = 36 in Study II. 
05, two-tailed. 
** p <,02, two-tailed. 


Outcome Measures 


For a limited number of therapists (» = 20) 
and patients (m= 24) measures of patient 
improvement were available, as described in 
the Method section, and were found related 
neither to empathy (r = .05) nor to therapist 
Personality characteristics (D, = —.05; 
Pt, r = .08). 


Discussion 
Personality Characteristics 


„The most important finding is that thera- 
Pists’ Depression (D) and Anxiety (Pt) cor- 
relate negatively with their empathy in the 
live therapeutic process, This is clearly in 
keeping with the common viewpoint that 
therapist personal disturbances interfere with 
therapeutic effectiveness. While we cannot yet 
Set up cutoff scores for selecting out or identi- 
fying therapists whose personal disturbances 
interfere with clinical performance, we have 
established that there is a moderate inverse 
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relationship between level of disturbance and 
level of competence in one important dimen- 
Sion of therapeutic behavior. This confirms 
the importance of focusing upon therapist 
characteristics in the study of treatment proc- 
esses and outcomes. Provided that further 
studies substantiate the assumed relationship 
between empathy and outcome, such inquiry 
is likely to clarify the causes of the large 
variation in observed therapeutic effects, 
which includes both positive changes and sig- 
nificant deterioration among therapy patients. 

The failure of EPPS scores to correlate 
with in-therapy empathy ratings may be due 
to a general limited validity of this instrument 
or to its lesser relevance to therapeutic phe- 
nomena. We cannot interpret the fluctuating 
correlations with this inventory as being other 
than due to chance. 


Intellectual Characteristics 


The lack of relationship between empathy 
and indexes of academic and intellectual 
competence are not surprising. They simply 
reaffirm the almost universally held view that 
scholastic abilities are not very relevant to 
effectiveness in establishing therapeutic rela- 
tionships with clients. The nonsignificant re- 
sults therefore have considerable psychological 
significance, 

Intellectual capacities may, of course, be 
of value in developing expertise in other 


TABLE 4 
OTHER CORRELATES OF EMPATHY 
Study I Study II 
Variable 
r N r N 
41* | 18 | —.40 18 


Supervisor rating 
Grade average | 
Diagnostic practice 


Academic —46 | 18 07 | 35 
E 

arme —48 |17]| .335| 34 

Quantitative 21 | 18 | —.02 | 34 

Verbal —30 | 18 AL 34 
Age 

45 |17]| —.04 23 

2529 .50* | 11 | —.15 36 


Hours of diagnostic and 
therapeutic experience 


* Not available. 
*p < .05, one-tailed, 
** p <.05, two-tailed. 
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forms of therapy, and their lack may be 
critical below a certain low level rarely found 
among graduate students. It may also be 
that the range of grades and intelligence 
for these individuals was restricted enough to 
reduce the possibility of obtaining significant 
correlations. 


Outcome Data 


The failure to find any correlation between 
empathy scores and outcome ratings runs con- 
trary to the large number of studies published 
by the Truax group. At first, this might be 
interpreted as being due to the crudity of the 
present outcome ratings; however, the same 
ratings yielded significant findings in a co- 
ordinate study by Feitel (1968). Using the 
same sample of therapists and patients, she 
found that patient self-ratings of feeling 
understood by the therapist correlated .59 
with outcome and that two patient ratings of 
therapist behavior from the Barrett-Lennard 
Relationship Inventory also related to out- 
come (Empathic Understanding, .34; Regard 
plus Unconditionality, .77). None of these 
ratings was correlated with Truax Empathy 
scores. 

These findings raise at least tentative ques- 
tions about the generalizability of Truax’s 
findings to nonclient-centered therapy, and 
they suggest that his procedures should be 
replicated in full by independent investigators 
on new samples. Truax did attempt a replica- 
tion in cooperation with the Jerome Frank 
group at Johns Hopkins Hospital in Balti- 
more (Truax, Wargo, Frank, Imber, Battle, 
Hoehn-Saric, Nash, & Stone, 1966). However, 
the results of that study actually were quite 
equivocal in spite of contrary assertions. We 
are presently investigating this issue in our 
own center. 


Additional Issues 


It is evident that in both the original sample 
and in the replication we have obtained a 
few significant results from a large number 
of correlations computed on small ws. Ordi- 
narily the meaning of such results would be 
ambiguous and their value small; however, 
there are good reasons for believing that 
the data have psychological significance: (a) 
Many of the correlations were computed 
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simply out of curiosity and were uness 
to the main task, for example, of the 56 
relations with the MMPI and EPPS onl 
were technically required by the design, 
test the original hypotheses and 6 mo 
cross-validate the original correlations. Ti 
thought useful to present a full correl 
table, but it is worth noting that 2 ol 
original 6 relationships held up under re 
tion. (b) Those findings which did hold 
very good sense theoretically and they ay 
on the MMPI, which has consistently: 
related with various aspects of the 
peutic process (Strupp & Bergin, 1969). | 
In addition, many of the zero correlatiot 
of obvious importance. They are in 
with the views of many professionals (ej 
relationship between GRE and Empa 
and should not be considered as neci 
detracting from the significance of the re 
Several additional variables concerni 
experience level, therapist and superviso 
entation, and supervisor evaluation 0} 
therapist were studied. While a numb 
suggestive leads occurred in these resul 
unambiguous findings occurred and thus th 
are not reported. Possible causes of the 
sults and of the only modest correlations! 
curring in the main data are (a) thal 
Truax Empathy scale is currently only 4 
crude way of approximating how therap 
actually behave and that it may not adeq 
Measure types of empathic communi 
which occur outside the client-centered fi n 
work, (5) that the therapist sample was 
tively homogeneous, and (c) that some va 
ables were thus restricted in range. l 


is 
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To test the assumption that attitudes toward deviant behavior could be 
influenced by the degree of disturbance as well as understanding of the be- 
havior, 30 Ss evaluated individuals manifesting various degrees of deviant 
behavior. Another group of 30 Ss evaluated the same individuals, but were also 
provided with developmentally relevant material to help them understand the 
deviant behavior. The results indicated that Ss in both groups tended to react 
negatively to individuals manifesting more deviant behavior patterns. Although 
the availability of developmentally relevant material increased the understanding 
of the deviant behavior, the negative evaluations remained unchanged. Some 
minimal evidence is provided for the possibility that an understanding of 
moderately deviant behavior discourages the use of the “mental illness" label. 


In recent years workers in the mental health 
professions have shown a growing interest in 
the public's perception of *mental illness." Of 
particular concern is the consistent negative 
attitude permeating the public opinion of de- 
viant behavior (Nunnally, 1961; Sarbin, 
1967; Szasz, 1961), along with the view that 
mental illness represents a “disorder without 
a cause" (Cumming & Cumming, 1957; 
Starr, 1955). To the extent that deviant be- 
havior is viewed as being strange, unmoti- 
vated, and incapable of being understood, it 
may only serve to add to the deviant indi- 
vidual’s self-evaluation, as well as a feeling 
that because he has some sort of “mental 
illness,” there is little that he can do to 
change his own behavior. 

The generally assumed deleterious effect of 
the negative public attitude on the stability 
of the discharged mental patients’ readjust- 
ment to society has led to the study of factors 
which might be relevant to the induction of 
a more favorable attitude toward deviant and 

1 Requests for reprints should be sent to Marvin 
R. Goldfried, Department of Psychology, State Uni- 


versity of New York, Stony Brook, Long Island, New 
York 11790, 


? The authors would like to thank Barry Sherman 
for many of his comments during the early phases 
of this study. 


maladaptive behavior. For example, Rothaus, 
Hanson, Cleveland, and Johnson (1963) 
found that a “problems in living” set (cf. 
Szasz, 1961) in employment interviews led 
to a higher likelihood of employment for 
patients than did a mental illness set. 

While studies such as those by Rothaus et 
al. demonstrate a practical consequence of a 
non-mentalillness set, a more prevalent as- 
sumption has been that an increase in infor- 
mation concerning deviant behavior could be 
effective in fostering a more favorable attitude 
toward the mental patient. A phenomenon 
often observed in clinical settings is the tend- 
ency to become more sympathetic and less 
negative in one's evaluation of deviant an 
maladaptive behavior once the causes of these 
behaviors are known. Similarly, Yarrow, 
Schwartz, Murphy, and Deasy (1955) have 
noted that the wife of a mental patient WES 
cally attempts to “normalize” her husband’s 
deviant behavior by looking for reasons under- 
lying such behavior, and Starr (1955) has 
further suggested that as long as behavior 8 
made intelligible, there is a tendency to regar! 
deviant behavior as something other t 
mental illness. E 

It seems, however, that while increased in 


formation may induce a change in public 
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reaction to the mental patient, the nature of 
this attitudinal change has not yet been 
determined. For example, Altrocchi and 
Eisdorfer (1961) have shown that although 
completion of a course in abnormal psychol- 
ogy leads to more understanding of the mental 
patient, it does not induce a more favorable 
attitude toward him. This lack of attitudinal 
change is attributed to the fact that Ss in 
their study were all college and nursing stu- 
dents, and consequently were fairly well- 
informed prior to the course. 

In an attempt to improve attitudes toward 
people displaying deviant behavior, Nunnally 
(1961) varied the label that was used to 
introduce the description of these behaviors. 
Interestingly enough, he found that semantic 
differential ratings of a description of the 
same person would vary, depending on 
whether he was labeled as “an insane man,” 
“an emotionally disturbed man,” or simply 
“a man,” 

In contemporary discussions of the concept 
of mental illness, a growing dissatisfaction 
with the medical model has become most evi- 
dent (Maher, 1966; Szasz, 1961; Ullmann & 
Krasner, 1965). The conceptualization of de- 
viant behavior as the manifestation of an 
underlying disease, in addition to being ques- 
tionable from a scientific, explanatory view- 
point, may have negative effects on the 
public's attitudes toward this deviant behav- 
ior by making it more incomprehensible. By 
way of contrast, we would expect that the 
psychological model of maladaptive behavior, 
Which emphasizes the role of social-learning 
history in the explanation of concurrent be- 
havior patterns—adaptive as well as mal- 
adaptive—could very well add to the public's 
understanding as well as a more sympathetic 
evaluation of deviant behavior. 

The purpose of the present study was two- 
fold: (a) to determine whether or not the 
knowledge of developmentally or historically 
relevant information about a person manifest- 
ing deviant behavior would tend to change 
one’s perception of him, and (5) to extend 
Nunnally's findings by studying the attitudes 
toward individuals who are not labeled in any 
Way, but who manifest varying degrees of 
behavioral disturbance. 


METHOD 
Subjects 


The Ss consisted of 60 adult residents of Rochester, 
New York, who responded affirmatively to a tele- 
phone contact, in which it was explained that a 
survey was being conducted to study people’s opin- 
ions about emotional disturbance. The Ss were told 
that they would be required to read some brief 
descriptions of people and answer some questions 
about them, If they agreed to participate, the ques- 
tionnaire and a self-addressed, stamped return en- 
velope were mailed to them. The names were ran- 
domly selected from males listed in the telephone 
book. This was done to avoid the potential non- 
random bias (e.g., unmarried, divorced, etc.) of 
females who are likely to have telephone listings. 
On the average, Ss used in the study were in their 
midforties and had completed slightly more than 13 
yr. of education. Most’ of them were married and 
employed in jobs which were primarily skilled or 
professional in nature. 

The Ss completed and returned one of two forms 
of the questionnaire. Thirty Ss were sent a question- 
naire containing only a description of individuals, 
while the remaining 30 completed a questionnaire 
in which historically relevant material was added to 
the description of the behavior. Both questionnaires 
are described in detail later. Five Ss in each of the 
two conditions failed to complete the semantic dif- 
ferential ratings, although they did respond to other 
portions of the questionnaire. Consequently, some of 
the analyses were based on Ns of 30 in each condi- 
tion, while in others, the N was 25. 


Measures 


The questionnaires consisted of an explanatory 
introduction—asking S to give his reactions to 
“descriptions of various people”—followed by six 
case vignettes, each describing the behavior of a 
different person. The vignettes were shortened ver- 
sions of case histories found in Zax and Stricker's 
(1963) collection and were chosen from a pool of 17 
histories which had been rated by eight clinical psy- 
chologists on a 7-point scale of degree of disturbance. 
The six vignettes used were the most representative 
of the two descriptions at each of the high, medium, 
and low degrees of disturbance. 

In order to study the effect of historically relevant 
material on one’s reaction to deviant behavior, two 
forms of the questionnaire were used. One form 
contained vignettes having only the description of the 
deviant behavior of particular individuals; this is 
referred to as Form D. The other form, in addition 
to providing the description of the behavior, also 
included material which was historically relevant to 
the behavior described; this is called Form D-H. 

Following each vignette, the questionnaire con- 
tained a series of nine semantic differential scales, 
selected to represent an evaluative factor (Osgood, 
Suci, & Tannenbaum, 1957) and an understandability 
factor (Nunnally, 1961). The scales used for the 
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evaluative factor included effective-ineffective, good- 
bad, pleasant-unpleasant, sincere-insincere; for the 
understandability factor, the scales were familiar- 
strange, predictable-unpredictable, simple-complicated, 
understandable-mysterious, and straight-twisted. In 
addition, two previously used (Nunnally, 1961) and 
relevant scales (safe-dangerous and  healthy-sick) 
were included, even though their factor loadings were 
indeterminate. The Ss were also asked to rate each 
individual described as to personal liking and degree 
of disturbance, as well as to judge whether the 
person had “some sort of mental illness." 9 


RESULTS 


The semantic differential ratings, as well as 
the ratings of likeability and disturbance for 
the two vignettes at each level of disturbance, 
were averaged. Hence, the data for each S 
consisted of a series of ratings at each of 
the low, medium, and high levels of disturb- 
ance. The study employed a 2 X 3 Type I 
“mixed” design (Lindquist, 1953) where the 
effect of historically relevant material (D ver- 
sus D-H) as well as level of disturbance (low 
versus medium versus high) on Ss’ evaluation 
of deviant behavior were studied. 

The results of the analysis of variance for 
the semantic differential ratings revealed that 
the historically relevant information provided 
for each case significantly improved Ss under- 
standing of the deviant individuals de- 
scribed in the vignettes (F — 7.08, df — 1/48, 
p < .05). The background information, on 
the other hand, did not affect Ss’ positive 
evaluation of the individuals described, nor 
did it produce any changes on the safe- 
dangerous or healthy-sick scales. 

The level of disturbance reflected in each 
vignette had a striking effect on the semantic 
differential ratings. Regardless of whether or 
not background information was provided, the 
findings clearly point to the fact that with 
increasing levels of disturbance, the indi- 


3 A description of the vignettes appearing in Forms 
D and D-H, the rating scales of personal liking and 
degree of disturbance, and the analysis of variance 
summary tables may be obtained from the senior 
author or from the National Auxiliary Publication 
Service. Order Document No. 00550 from National 
Auxiliary Publications Service of the American Soci- 
ety for Information Science, c/o CCM Information 
Sciences, Inc., 22 West 34th Street, New Vork, New 
York 10001. Remit in advance $3.00 for photocopies 
or $1.00 for microfilm and make checks payable to: 
Research and Microfilm Publications, Inc. 
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viduals described in the vignettes were sig- 
nificantly less understandable (F — 138.35, 
df = 2/96, p< .001), evaluated less posi- 
tively (F = 86.9, df= 2/96, p< 001), 
and considered more dangerous than safe 
(F = 137.22, df = 2/29, p < .001), and more 
sick than healthy (F = 97.49, df= 2/29, 
p < 001). 

The analysis of Ss' ratings of disturbance 
and likeability indicated that although the 
historically relevant background information 
did not affect these ratings, level of disturb- 
ance did. Hence, the more deviant indi- 
viduals described in vignettes were seen by 
Ss as being both more severely disturbed 
(F = 271.29, df = 2/116, p « .001) and per- 
sonally less likeable (F = 89.24, df = 2/116, 
$ < 001). 

Following each vignette, S was asked 
whether or not the individual described had 
“some sort of mental illness.” The D and 
D-H groups were compared with respect to 
their response to this question by means of 
six chi-square analyses, one for each of the 
vignettes. Five of these comparisons failed 
to reach statistical significance. A significant 
chi-square was obtained with a vignette rep- 
resenting a medium level of disturbance, indi- 
cating that the availability of historically rele- 
vant information made it less likely that the 
individual described would be judge as being 
“mentally ill” (y? = 5.93, df = 1, p < 02). 


Discussion 


The results of this study indicate that the 
availability of certain information, in this 
case, developmentally relevant material, in- 
creases the public’s understanding or compre- 
hension of displayed deviant behavior, while 
leaving unmodified an essentially negative 
evaluation of that behavior. In a sense, the 
present study succeeded in rendering more 
comprehensible what continues to be view 
as bad, unpleasant, ineffective behavior. i 

When presented with the descriptive b 
gnette, each of which represented one of thr ds 
predetermined (by clinician's ratings) wi 
of disturbance, Ss were able to differentia! à 
accurately the behavioral descriptions accor f 
ing to their own estimates of level of Ad j 
ance. Regardless of whether or not develoP 
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mentally relevant material had been provided, 
however, Ss were less understanding of and 
reacted with a uniformly negative reaction to 
those individuals manifesting higher levels of 
disturbance. 

The investigators had originally expected 
that inasmuch as mental illness is typically 
viewed as a disorder whose cause is unknown 
or vague, the existence of mental illness 
might be ruled out in those instances in which 
there was some understanding of the reasons 
for the devant behavior. However, only the 
case of one vignette—a behavioral description 
representing a medium level of disturbance— 
was the developmentally relevant informa- 
tion able to minimize the tendency to apply 
the mental illness label. An intriguing, al- 
though admittedly preliminary explanation, 
may reside in a possible interaction between 
the level of disturbance and information on 
developmental antecedents, such that the at- 
tribution of mental illness to an individual 
displaying deviant behavior may be modified 
more readily in those instances where the 
behavior is only moderately deviant. 

It might be emphasized that this study has 
attempted to alter the relationship between 
the deviance of certain behaviors, various 
judgments of that behavior, and attitudes 
toward the person who is engaging in these 
behaviors. Although the experimental manipu- 
lation was successful in providing a better 
understanding of the deviant behavior, the 
negative evaluations of the individuals were 
unchanged. It is possible that a more thorough 
understanding of the behavior is needed to 
bring about a more sympathetic and positive 
evaluation. Thus, a stronger treatment effect— 
involving more detailed information about 
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developmental history and/or information 
about concurrent situations which may have 
elicited the deviant behavior—would appear 
to be worth pursuing in future research. 


REFERENCES 


Avrroccut, J., & E1sporrer, C. Changes in attitudes 
toward mental illness, Mental Hygiene, 1961, 45, 
563-570. 

Cumme, E, & Cummine, J. Closed ranks. Cam- 
bridge, Mass.: Harvard University Press, 1957. 
LiwpQuisr, E. F. Design and analysis of experiments 
in psychology and education, Boston: Houghton 

Mifflin, 1953. 

Mauer, B. A. Principles of psychopathology, New 
York: McGraw-Hill, 1966. 

Nunnatty, J. C., Jn. Popular conceptions of mental 
health. New York: Holt, Rinehart & Winston, 
1961. 

Oscoop, C. E, Suci, G. J., & Tannensaum, P. H. 
The measurement of meaning. Urbana: University 
of Illinois Press, 1957. 

Rormaus, P, Hanson, P. G., Crevrrawp, S. E, & 
Jounson, D. L. Describing psychiatric hospitaliza- 
tion: A dilemma. American Psychologist, 1963, 18, 
85-89. 

Sarsiy, T. R, On the futility of the proposition that 
some people be labeled “mentally ill.” Journal of 
Consulting Psychology, 1967, 31, 447-453. 

Srarr, S. A. The public's ideas about mental illness. 
Paper presented at the meeting of the National 
Association for Mental Health, Indianapolis, No- 
vember 1955. 

Szasz, T. S. The myth of mental illness. New York: 
Hoeber, 1961. A 

Urrmann, L. P. & Krasner, L. Case studies in be- 
havior modification. New York: Holt, Rinehart & 
Winston, 1965. 

Yarrow, M. R., SCHWARTZ, C. G., MunPEY, H. S5 
& Deasey, L. C. The psychological meaning of 
mental illness in the family. Jourmal of Social 
Issues, 1955, 11, 12-24. 

Zax, M., & Stricker, G. Patterns of psychopathol- 
ogy. New York: Macmillan, 1963. 


(Received May 31, 1968) 


Journal of Abnormal Psychology 
1969, Vol. 74, No. 4, 486-489 


ATTITUDES TO BODY 


NORMAL SUBJECTS* 


MICHAEL 


Kent State University 
W. DONALD ROSS, RICHARD KURTZ,’ anb GOLDINE C. GLESER 


University of Cincinnati 


Attitudes of normal Ss toward body 


semantic differential. Ss of various ages, from different educational and 
socioeconomic backgrounds, were studied. Sex and race differences were also 


investigated. The results indicated that 


products significantly, and that the scaling of these products was consistent, 
and did not vary among the groups more than could have been expected 
by chance. The ordering of these body products and the absolute scale values 


assigned them were consistent, It was 


Kubie’s theoretical paper which anticipated the universality of such attitudes 


and their hierarchial ordering. 


The importance of attitudes toward the 
body has been studied in psychosexual devel- 
opment (Federn, 1952; Freud, 1931, 1933) 
among psychotic patients (Fisher, 1958; 
Witkin, 1962) and with healthy Ss (Traub & 
Orbach, 1964; Wapner & Werner, 1957). 
There are no empirical data, however, report- 
ing the attitudes of healthy Ss toward specific 
body products such as excreta, tears, saliva, 
blood, etc. Kubie’s (1937) “Fantasy of Dirt” 
seems to be the only paper which discusses 
such attitudes and is based upon clinical 
psychoanalytic study. 

The purpose of this paper is to examine 
empirically the attitudes of normal Ss toward 
body products. 


METHOD 


Attitudes were measured with the semantic dif- 
ferential (Osgood, Suci, & Tannenbaum, 1957). The 
dimensions of evaluation, potency, and activity were 
sampled by three bipolar scales. Each S responded to 
21 body products which, together with the form of 
the semantic differential used, are listed in Figure 1. 


lThis work was supported in part by United 
States Public Health Service Grants 5T1 MH-8018 
and 5-SOI-FR-5408-5. The authors wish to thank 
E. F. Buyniski, John Terlesky, Jan Robinson, and 
Mary Ann Hertz for their assistance in collecting 
these data. 
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PRODUCTS AMONG 
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products were measured with the 


each group differentiated these body 


concluded that the data supported 


The results from a pilot study suggested that @ 
single factor* accounted for most of the communal 
variance (Harmon, 1960). Additionally, the scaling” 
of these body products was found to be highly reproz- 
ducible with generalizability coefficients in the 905) 
(Cronbach, Rajaratnam, & Gleser, 1963). It wasi 
therefore decided to explore the attitudes toward 
body products of normal Ss representing wider age, 
educational, and socioeconomic levels. The Ss were. 
unselected job applicants at a large factory. The age 
range of the 205 Ss was 20-62 yr.; their educational 
level ranged 8-16 yr.; each of the five social classes 
described by Hollingshead and Redlich (1958) wasi 
represented. 

This sample was divided into the following sub- 
groups: six groups of relatively equal size of Caucar; 
sian males, below or above the age of 30, belong d 
to either a “high,” “low,” or “middle” social class), 
39 Caucasian female Ss who were compared with a 
group of male Ss individually matched for age, 
education, and social class; and a group of 11 Neg a 
Ss who were compared with a group of individually 
matched Caucasian Ss, The data were analyzed 
bear upon the following two questions: 


1. Is there, within any given subgroup, 4 b 
sistent scale value assigned to these body products? 
In other words, are Ss consistent in the orderi 
of these body products and in the absolute scor 
ascribed to them? J 

2. Is the ordering and the absolute scores a 
these body products consistent among different SUD. 
groups, so that a single set of score values can 
taken as representative of the attitudes of these bo 
products for the population in general? 


jus the 


4The score for the evaluative dimension pl re for 


score for the activity dimension minus the sc 
the potency dimension. 
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Concept 
Presh® PR See Py el iHe c ie Ta a ea Stale 
Thin** Laaa a eheu de c4 IB Thick 
Pastres E ae S ra E E a Slow 
Sour* ADO S Ra EA A ao is Sweet 
Hard** Li eS 0 DBWST 5080 PONE SU ONES A Soft 
DuTINE* noun. dee Pide ETit Cause USSE AERE Sharp 
Clean* EE URSUS b WEES AN ENR NU E00 TI ur oun Dirty 
Light** [TE Oe hs usb ee a Heavy 
Hot*** : ; E 5 1 H : Cold 


il Evaluative dimension 


fti Potency dimension 


*** Activity dimension 

Ear wax Dandruff Tears 
Desquamated cells Face Sweat Pus 

between the toes 

Nose pickings Saliva Phlegm 

Hair Blood Nail clippings 
Feces Vomit Pubic hair 
Semen Menstrual discharge Body sweat 
Human's milk Blackheads Urine 


Fic. 1. Form and order of the semantic differential scales and list of body product concepts. 


RESULTS 


The differentiation of these body products 
was very high for each subgroup. The data 
for each subgroup were analyzed by means 
of analysis of variance, and the values were, 
Without exception, less than .001. An addi- 
tional indication of the consistent scaling be- 


havior of each subgroup was given by the 
estimated generalizability of the scores given 
by a single member of a given subgroup to 
the scores of the population that the sub- 
group represented. These reliability coeffi- 
cients ranged from .37-.53 for a single indi- 
vidual. The equivalent index of the extent to 
which the obtained scaling could be repro- 
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TABLE 1 
AvERAGE Bopy Propucr SCORES BY DIFFERENT GROUPS 
Groups 
Caucasian males 
Age 20-30 Above age 30 
Prdy products = Females | Negroes 
Social class Social class W = 34) a 
I-II III IV +V. 
} Gets N Yn N y 33) | (N as 24) | (N = 20) | (N = 28) 

5.16 4.95 5.04 5.43 5.08 5.01 5.31 4,92 
Eo milk 5.24 5.08 5.02 5.31 4.95 4.97 547 d 
Blood 4.74 4.90 4.78 4.88 4.87 4.87 4.80 A 
Hair 441 4.76 4.70 4.62 4.77 4.50 4.97 oy 
Saliva 4.19 4.38 445 4.65 445 4.12 4.49 Adi 
Semen 4.65 4.30 4.28 4.54 4.26 4.02 442 js 
Face sweat 4.23 3.88 4.25 4.40 4.05 4.08 4.24 Hi 
Urine 446 4.05 4.02 4.29 4.12 4.06 4.55 SU 
Pubic hair 3.92 3.96 3.91 3.97 3.85 3.78 4.11 10 
Body sweat 4.04 3.76 3.72 4.03 3.80 3.88 3.99 $5 
Menstrual discharge 3.64 3.76 3.47 3.82 3.60 3.60 3.95 5 
D ted cells 
Between the is 3.29 3.26 3.68 3.62 3.51 3.54 3.32 2 
Feces 343 3.57 3.42 3.29 3.28 3.86 3.50 376 
Phlegm 3.37 3.48 3.50 3.35 3.23 3.35 3.39 358 
Nail clippings 3.34 3.30 3.34 3.44 3.46 3.35 3.43 322 
Dandruf 3.21 3.31 3.42 3.59 3.28 3.51 3.30 19 
Ear wax 3.30 3.21 3.36 3.39 3.38 3.28 3.34 $09 
Vomit 3.04 3.19 3.00 3.45 3.28 3.36 3.30 331 
Nose pickings 3.40 2.95 3.24 3.33 3.08 3.32 3,24 325 
Pus 2.98 3.18 3.14 341 3.08 2.97 3.23 290 
Blackheads 2.91 2.72 2.90 3.10 2.85 3.09 3.06 . 


Note,—I-V = the five social classes described by Hollingshead and Redlich (1958). 


duced by other comparable subgroups ranged 
.90—.97. 

The comparability of the scale values as- 
signed these body products by each of the 
subgroups was determined by comparing the 
variability of assigned scale values between 
the subgroups with the variability within each 
of the subgroups. In no case was the vari- 
ability between subgroups greater than could 
have been expected from sampling fluctua- 
tions. The implication of this result is that 
the intraclass correlations among these body 
product concept scores, at least for the popu- 
lation(s) sampled, would be perfect (1.00). 
Since no differences were found between these 
subgroups, they may be assumed to come 
from the same population, providing at least 
partial support for the “universality” of 
attitudes toward these body products. 

In Table 1, the body products have been 
ordered (from high to low) on the basis of 


their score from all Ss. The score assigned 
each body product by each subgroup 1s given 
in original semantic differential units. It 3 
readily apparent that both the ordering Ei 
the absolute scores for each body pubs 
remained markedly consistent among the a 
ferent groups. Tears and human’s milk a 
ways received the highest score, while be 
and blackheads were always among the pe 
ranked body products. The other bo s. 
products also retain their ranks, with ben 
sional movement up (or down) one posito 
in the order. A 

An additional observation from Table 1 5 
the consistent use by all Ss of the lower p 
of the semantic differential scale. In i 5 
words, although the semantic differentia E 
mits a score from one to seven, most Ss e 
only the lower end of the scale, thus em " 
ing negative attitudes toward most 
products. 
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Discussion 


The major finding of this study is that 
there does indeed exist a consistent hierarchy 
of attitudes toward body products. Kubie’s 
theoretical paper anticipated correctly the 
universality of such attitudes and their hier- 
archical ordering. The high correlation which 
has been reported between evaluative factor 
loadings and social desirability ratings of 
semantic differential scales (Ford & Meisels, 
1965) lends further support to Kubie’s 
formulation. 

The meaning of this hierarchy of body 
products is still relatively obscure. One pos- 
sibility is that these body products were 
ordered along a body-alien to body-syntonic 
continuum. In other words, body products 
may be perceived as “normal” components 
of on-going body functions (e.g., blood, tears, 
saliva) or may be discarded or somehow 
sloughed off (e.g., nail clippings). The identi- 
fication of the more specific propreties of body 
products (e.g., their odor, color, consistency, 
etc.) which are of major significance remains 
as an area for further investigation. 
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CERTAIN INITIAL VARIABLES AS PREDICTORS OF 
CHANGE WITH CLASSICAL PSYCHOANALYSIS ' 


H. ARONSON ? 


Department of Psychiatry, 
Temple University Health Sciences Center 


The full case loads of 28 analysts were studied to determine predictors of 
change with psychoanalysis. Information was obtained by questionnaire from 
the treating analysts. It was found that, while initial complaints are of little 
value in predicting success in analysis, they do relate to dropout from treat- 
ment. Motivational factors such as breadth and degree of discomfort, external 
stress, and chronicity of disorder relate to both success in analysis and dropout 
from treatment. It was concluded that, as long as symptoms are not extremely 
pathological, improvement in and commitment to analysis is determined by the 
pressure the disorder places on the patient. 


Classical psychoanalysis is a form of ther- 
apy that has had, despite a long period of 
widespread interest and attention, little ex- 
perimental scrutiny. As a result, many beliefs 
and impressions have become established over 
years of application; yet, few controlled stud- 
ies exist to provide even the simplest requisite 
data upon which to base experimental valida- 
tion. 

Since analysis demands a relationship with 
considerable commitment on the part of both 
patient and therapist, it is difficult to manipu- 
late variables externally. Those interested in 
research on this form of psychotherapy must, 
therefore, often make a difficult choice. They 
must either rely upon lengthy interpolation 
of laboratory results or use epidemiological 
methods and work without the safeguards of 
random procedures or controls. We have 
chosen the second alternative. 

People screened as being suitable for analy- 
sis are far from a random group of applicants 
for psychotherapy. Should it be possible, 
within the rather narrow range of patients 
already acceptable, to specify variables that 
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predict greater or lesser improvement, 
predictive variables would have both 
retical and practical value. j 

Tn other reports we detailed our meth 
data collection and tabulated the desc 
(Weintraub & Aronson, 1968) and 
(Aronson & Weintraub, 1968b) backg 
of patients who were the Ss of our inv 
tion, Then we began to examine the cl 
that could be measured as occurring 
course of analytic treatment (Aroni 
Weintraub, 1968a). The present paper 
focus upon the predictive value of i 
variables. Our goal is to determine, usii 
formation available early in analysis, 
patients who could be expected to improv 
well as to distinguish those who are likel 
remain from those who are apt to leave 
sis short of completion. 


METHOD 
Procedure 


It is important to note that our work has 
based upon the assumption that groups of ý 
each studied at a different longitudinal point 
analysis, would approximate the changes that m 
occur in a hypothetical individual as he pr 
through his analytic treatment. Thus, W 
measure discrete intervals of analysis, we Y 
cuss our findings in terms of curves that 
the effects of time. , 

Our data indicate a highly significant corii 
between improvement on each of the scales hos 
measure change and duration of analysis. AS 
seen in Figure 1, a hypothetical curve of 
ment would be almost identical for both th 
tients who remained in analysis and those ¥ 
analysis short of completion. Figure 2 
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LES gy PATIENT CONTINUING 
i) THROUGH COMPLETION 


SRS PATIENT ENDING ANALYSIS 
PRIOR TO COMPLETION 


MEAN — 80 
CHANGE N 
FROM 60 NI 
INITIAL 
STATUS: 49 
20 
0 
«I l- <3 3-5 >5 


YEARS IN PSYCHOANALYSIS 


Fic. 1. Mean achievement of potential improve- 
ment in patients continuing through completion of 
psychoanalysis and in patients leaving analysis short 
of completion, 


however, that those patients who left prior to com- 
pletion included a majority of those who became 
worse. Our hypothetical improvement curve seems to 
take a steady (learning) rather than a sudden (in- 
sight) incline with the greatest increment in the 
period between the second and fourth years. Our 
focus here will center upon those variables that may 
relate to greater learning with analysis. 

In examining the predictive value of initial varia- 
bles, we found the most obvious, that is, diagnosis, 
to have little relation to the rate of improvement 
(Aronson & Weintraub, 1968a). However, because 
diagnoses are rough estimates, difficult to establish 
with reliability, it appeared reasonable to use more 
refined means to designate initial status. This be- 
came the basis of our continuing investigation. The 
variables that will be examined here are: the kind 
and number of presenting symptoms and/or com- 
plaints, the chronicity of such symptoms, the degree 
of subjective discomfort, and the presence of external 
Stress, 

While no specific hypotheses were chosen prior to 
obtaining the data, a number of questions can be 
posed: 

1. What kinds of symptoms or complaints do 
analytic patients present? 

2. Is there a relationship between the kind or 
variety of initial symptoms presented by a patient 
and the results of his analysis? 

3. Does the degree of the patient's discomfort 
relate to improvement? 

4. Does it matter whether the impetus for treat- 
ment is internal or external (environmental pres- 
sure) ? 

5. Can anything be stated as to why patients 
leave analysis? 

6. What determines the point at which dropout 
from analysis occurs? 


Subjects 


The data reported here were collected oan ie 
course of a prevalence survey of patients underg 
analysis during the then current “psychoanalytic 
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year” (September 1962 through August 1963), Thirty 
analysts, approached informally, agreed to supply 
information on all patients they had seen in classical 
psychoanalysis during the studied year, Our sample 
covers the total analytic case loads of this non- 
randomly selected group of analysts with the follow- 
ing exceptions: (a) Data concerning several pa- 
tients, known by their analysts to be personally 
acquainted with the authors, were excluded. (b) 
Two analysts, with a combined total of 17 patients, 
responded to a shorter set of selected questions, The 
variables studied here were not covered in the infor- 
mation provided by these two analysts, 

The sample for this report thus includes a total of 
127 cases reported upon by 28 analysts. The infor- 
mation collected relates to both the initial condition 
of the 127 patients and to their condition during the 
surveyed period. As a prevalence study, the infor- 
mation obtained represents patients seen at all points 
in their analyses from initiation to either mutually 
determined completion or a unilateral decision to 
discontinue treatment. 


Measurement of Change 


Due to the nature of our survey, all measures were 
limited to judgments of the reporting analysts, To 
increase reliability, fixed-point scales were used as 
our method of choice. Unfortunately, it was not 
possible to construct an acceptable fixed-point scale 
to measure symptomatic change, and a scale that 
covered four degrees of change was used instead, 
These were: (1) complete remission or greatly 
improved; (2) moderately improved; (3) minimal 
improvement; (4) exacerbation of symptoms, 
Fixed-point scales were constructed to measure: vo- 
cational adjustment; object relations (relationships 
with others); and capacity to experience pleasure. 
Vocational adjustment, shown as an example below, 
covered three steps; the remaining scales. covered 
four steps: 

Mi re 
12 


NO PATIENT ENDING ANALYSIS 
PRIOR TO COMPLETION 


MEAN 
CHANGE 
FROM 
INITIAL 
STATUS 


«I 1-<3 


3-5 >5 
YEARS IN PSYCHOANALYSIS 


Fro, 2, Mean occurrence of potential regression in 
patients continuing through completion of psycho- 
analysis and in patients leaving analysis prior to 
completion. 
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1. Patient unable to find work or keep a job (for 
housewife: unable to perform responsibilities of 
wife and mother; for student: unable to make pas- 
sing grades). 

2. Patient able to keep position but not at a level 
appropriate to his intelligence, training, and experi- 
ence (for housewife: able to perform responsibilities 
of wife and mother, but at a low level of efficiency; 
for student: able to make passing grades, but not 
performing up to potential). 

3. Patient able to work at level appropriate to 
intelligence, training, and experience (for housewife: 
carries out responsibilities of wife and mother effi- 
ciently; for student: performance in school is up to 
potential). 

The four scales of symptomatic status and level of 
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functioning, described more fully in Aronson and 
Weintraub (19684), were our criteria of change 
with analysis. 

It is important to note that a patient initially 
rated at the extreme of a fixed-point scale cannot be 
measured as moving farther in the direction of his 
extreme rating. Our results, therefore, relate changes 
only for those patients who were changeable in the 
direction being studied. Our Ns vary accordingly for 
each of the four measures. 


Adjustment for Duration of Analysis 


By chance or due to differential drop-out rates, 
patients with a particular complaint or condition 
might easily be overrepresented in the earlier years 
of analysis. Their mean improvement rate, as com- 


TABLE 1 
INITIAL SYMPTOMS OR COMPLAINTS, ARRANGED INTO Major CATEGORIES, OF 126 PSYCHOANALYSANDS 
Category Y No. patients Category No. patients 
Neurotic Psychosomatic (continued) 
Obsessions and/or compulsions 50 Menstrual disturbances 4 
Phobias 36 Migraine 3 
Boredom 27 Avia 3 
Conversion symptom (s) 22 Peptic ulcer 1 
Social inhibitions 4 Paroxysmal tachycardia 1 
Dissociative episode(s) 2 Overweight 1 
Work or study inhibitions 1 Dermatitis 1 
Anxiety over sexual identity Insomnia 1 
At least one symptom in category 944(75%) At least one symptom in category 37 (29%) 
No other category listing 5 No other category listing 1 
Habits Character traits 
Nail-biting 7 Gross immaturity 19 
Tic(s) | 2 Hypochondriasis 12 
Enuresis _ 2 Withdrawn or asocial 9 
Encopresis 1 Narcissism 1 
Bite cheek 1 Masochism 1 
At least one symptom in category 10 (8%) At least one symptom in category 35 (28%) 
No other category listing 0 No other category listing 1 
Alloplastic Sexual inhibition or inadequacy 
«Keting-out! * behavior 45 Impotence or frigidity y 48 
Extremely low frustration tolerance 29 Premature ejaculation 2 
Temper tantrums 21 Sexual inhibitions 1 
pode Us 5 At least one symptom in category Yi (40%) 
vior isti. 
Infidelity 1 No other category listing 
PA i 1 Sexual deviation 
t least one symptom incategory 64 (51%) Homosexuality 16 
No other category listing 4 Voyeurism E 
b ? Pedophilia ; 
eid Fetishism 8 
‘otal 67 (53 ERGO. 
No other category listing 0 63%) gaubien 2 
D 4 Transvestism 1 
epression Kleptomania i 0%) 
M M 59 (47%) At least one symptom in category 25 (2070 
o other category listing 1 No other category listing 1 
Psychosomatic $ 
Chronic fatigue 16 ooking 5 
Headaches 10 Depersonalization HIC) 
Baa 6 t least one symptom in category i : 
olitis 5 No other category listing 
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pared with patients who continue for further treat- 
ment, would then be deceptively low. In order to 
adjust for variability due to unequal time in analy- 
sis, the data reported upon here were, wherever 
appropriate, divided between those from patients in 
analysis for less than 3 yr. and those in analysis for 
3 yr. or more. The differences attributable to each 
variable being studied were then combined from 
both durational levels by means of Cochran's cri- 
terion for combining results from groups of unequal 
size (Maxwell, 1961). This method essentially ap- 
proaches the data as if we had done two studies, one 
of a group of patients seen early in treatment and 
another of patients seen late in analysis. The results 
of the two are combined to use the larger N while 
still controlling for durational effects. 

Since no directional hypotheses were made, two- 
tailed tests of significance were used in determining 
significance levels. 


RESULTS 
Category of Initial Complaints 


A list of 48 prominent symptoms with blank 
spaces for additional unlisted symptoms or 
complaints provided each analyst a framework 
upon which to report the initial problems of 
each patient. Table 1 shows those symptoms 
or complaints present at initiation of analy- 
sis, each listed under 1 of 10 major categories. 
Symptoms were placed into categories with- 
out knowledge of other results and, wherever 
placement was in doubt, categorization fol- 
lowed analytic theory. One write-in symptom, 
“problems with children,” could not be cate- 
gorized, and that patient, who had no other 
symptom recorded, was dropped from those 
parts of the study requiring categorization of 
symptoms. 

Categories were chosen as being more spe- 
cific than diagnoses yet more encompassing 
than symptoms. A patient with two symptoms 
in a single category is treated as having only 
one symptom category. His pathology is con- 
sidered as being narrower than that of a pa- 
tient whose symptoms cross category lines. 
It is important to note that a patient can be 
listed under any or all of the categories simul- 
taneously and, in fact, few patients had diffi- 
culties limited to only one category. 

Data concerning the number of patients, by 
years in analysis, found under each category 
as well as under other variables to be dis- 
cussed below are presented in Table 2. Table 
3 lists the improvement rates by category of 
Patients who had at least one symptom in the 
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TABLE 2 


NUMBER OF PATIENTS STUDIED IN PREVALENCE SUR- 
VEY OF ANALYTIC TREATMENT, CATEGORIZED BY 
STATUS AT INITIATION OF ANALYSIS 


Years in analysis 
Initial variable 
<3 | 23 
Symptom category 
Neurotic 60 33 
Habits 5 5 
Alloplastic 38 25 
Severe anxiety 44 23 
Depression 37 22 
Psychosomatic 25 11 
Character traits 21 13 
Sexual inhibition or inadequacy 32 19 
Sexual deviation 15 10 
Loss of reality 2 2 
Number of symptom categories 
26 Ta 2 10 6 
3-5 45 28 
«3 22 15 
Degree of subjective suffering at 
initiation 
Extreme 22 15 
Moderate 45 26 
Mild 9 8 
Treatment initiated after onset or 
exacerbation of acute symptom- 
atology 
Yes 36 42 
No 41 7 
Treatment initiated after a major ex- 
ternal situational stress 
Yes 15 16 
No 62 33 


category. The differences between rates for 
patients found within a symptom category and 
those obtained for patients not included under 
that category were compared. The resulting 
significances did not exceed the number ex- 
pected by chance. It would appear that 
knowledge of initial symptoms or complaints 
is as little useful as was crude diagnosis in 
predicting improvement rates within this al- 
ready screened patient group. 


Motivational Effects 

The findings on four variables, motivational 
in nature, are shown in Table 4. They were: 
breadth and depth of discomfort, chronicity 
of disorder, and the presence of external pres- 
sure, Breadth of discomfort was measured by 
the total number of different symptom cate- 
gories under which a patient’s initial com- 
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TABLE 3 
ACHIEVEMENT OF POTENTIAL IMPROVEMENT OF ANALYTIC PATIENTS 
CATEGORIZED BY INITIAL STATUS OF SYMPTOMS j 
Measure of improvement (by years in analysis) 
Obj: ii 
Symptom category Symptomatic ME relationships for peat 
<3 | 23 p <3 | 23 p «3|23| > 
96 | 96 % | % % | % 
Neurotic 28 | 88 | <.10 | 23 | 59 ns 37 81| ns 
Habits 0 | 80 ns 0 | 60 ns 40 | 80| ms l 
Alloplastic 15 | 73 | <.10 | 15 | 32 ns 36 | 70] ns 
Severe anxiety 25 | 87 ns 20 | 68 ns 34 83| ms 
Depression 32 | 77 ns 21 | 55 ns 35 | 70| ms 
Psychosomatic 35 | 71 ns 35 | 73 | <.05 | 36 | 64| ms 
Character traits 44 | 89 | <.10 | 30 | 50 ns 29 T| ns 
Sexual inhibition or in- 
adequacy 23 | 88 ns 17 | 47 ns 28 84| ns 
Sexual deviation 33 | 67 ns 23 | 33 ns 47 70 | ns 
Loss of reality 50 | — | <.10 | 50 | 50 ns 50 | 100| ms 
Total group 25 | 76 24 | 57 34 | 78 


«Each 


{eee test measures the improvement of those patients within the symptom category 
patients. Differences were analyzed statistically by means of Cochran's criterion for combining results fro: 


Probabilities are stated for two-tailed conditions. By chance, 2 of these 40 comparisons should reach p < .05. 


as compared to all other 
m groups of unequal size, 


plaints were listed. Here a significant differ- with greater breadth had greater early im- 
ence was found for three of our four measures provement, less regularity was found in the 
of improvement, However, although those later therapeutic years. In general, while the 
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TABLE 4 


BY INITIAL STATUS OF MOTIVATIONAL FACTORS 


Measure of improvement (by years in analysis) 
Initial status Symptomatic Vocational functioning | Object relationships Capacity for pleasure 
dull» € ]s | s <3 | 23 cae 
No. symptom | 
categorie, 
60 |83 43 |100 25 |33 50 |67 
pm 44 |89 ns |27 71 <.03 | 23 70 <.03 | 27 78 «0 
< 45 | 80 14 | 78 15 |38 41 | 85 
Subjective discom- 
1 55 | 93 33 | 92 30 | 64 41. | SE 
Medium 44 |85 | ns 27 |69 | «0 19 |56 | «o 33 |76 | € 
Ww 44 | 75 14 | 50 12 | 43 2 | 67 l 
Aute onset 03 
es 59 |83 | «10|21 |93 | «10 25 |65 4 | 83 50$ 
No 38 |87 28 | 60 19 | 52 " oz | 7 
Eo Stress 
es 47 |9% | ms |10 |100 | <03| 8 |60 or "19M 
No 45 |8 29 | 65 24 |55 "o[ss | 78 
Total group | 47% | 86% 25% | 76% 21% | 57% 34% | 78% 


2 Differences were compared by Cochran's criterion for combining results from groups of unequal size. Probabilities 


for two-tailed conditions, 


are state 
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breadth of difficulty appeared to be important 
in terms of statistical significance, its effect 
was more consistent early in analysis. 

Such irregularity was not the case for the 
degree of subjective discomfort. With subjec- 
tive discomfort there was a regular progres- 
sion such that, both for those in the first 3 yr. 
and those later in analysis, the greater the 
discomfort the more likely the improvement. 
This progression reached statistical signifi- 
cance for two measures and approached sig- 
nificance (p < .10) for another. It was still 
present but insignificant for the fourth mea- 
sure of improvement. 

An examination of chronicity also produced 
statistically significant differences. The pa- 
tients who, on initiating analysis, were in 
either acute or exacerbated states generally 
showed greater improvement. This was sig- 
nificant for one measure and showed a ten- 
dency toward significance ( < .10) for two 
more. The presence of external or situational 
stress had a significant effect on only one 
measure, vocational functioning. Such stress 
appeared to retard vocational improvement in 
the first 3 yr. and increase improvement dur- 
ing the later years of analysis, as compared 
with change found in patients lacking external 
stress. 

Of the 16 statistical tests done on motiva- 
tional effects, we should expect less than 1 to 
be significant by chance at the .05 level. 
The data showed 7 comparisons significant at 
the .05 level with an additional 3 at the .10 
level. Unlike our findings with symptom cate- 
gories, motivational variables appear to have 
effects that cannot be explained by chance 
alone. 


Unilaterally Determined Termination 


Table 5 records the percentages of patients 
who left analysis for reasons other than com- 
pletion of treatment. Completion may be de- 
fined as that point at which analyst and pa- 
tient agree that maximum benefit has been 
obtained from treatment. While there is not a 
significant difference in overall drop-out rates 
between durational periods, we have retained 
the division of up-to-3 and 3-or-more years in 
analysis, This division allows for greater in- 
terpretation in terms of reasons for premature 
termination. 
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TABLE 5 


Drop-Out RATE DURING SURVEY YEAR OF ANALYTIC 
PATIENTS CATEGORIZED BY SYMPTOMS AND OTHER 
INITIAL VARIABLES 


% dropout 
Prob- 
Symptom or variable SEES dup 
of anal- | of anal- | ferences 
ysis ysis 
Symptom category 
Neurotic 1 12 18 ns 
Habits 20 20 | ms 
Alloplastic 5 20 ns 
Anxiety 9 13 ns 
Depression 5 14 ns 
Psychosomatic 4 0 | <.05 
Character traits 5 36 | <.03 
Sexual inhibition or inad- 
equacy 12 16 | ms 
Sexual deviation 7 5 ns 
Loss of reality 50 100 | <,001 
No. symptom categories 
ie i 10 50 
3-5 7 7 | <.01 
<3 18 20 
Degree subjective suffering 
at initiation 
Extreme 9 27 
Moderate 9 12 | <.10 
Mild 22 12 
Exacerbation or acute symp- 
toms prior to initiation 
Yes 6 17 ns 
No 13 16 
Major external situational 
stress prior to initiation 
Yes 13 18 ns 
No 10 15 
Total group 10 16 ns 


a Total group differences due to years in analysis were com- 
pared by the x? test. The remaining; statistical analyses for 
Hifferences between groups give probabilities obtained with 
Cochran’s criterion for combining results from ups of 
unequal size. All probabilities are stated for two-tailed condi- 


tions. 


Here, the data do show differences attribu- 
table to symptom category. Patients whose 
symptoms included signs of distortion of real- 
ity consistently dropped out of treatment 
more often than did patients who did not 
show such initial complaints ( < .001). Pa- 
tients with psychosomatic symptoms had ex- 
ceptionally low drop-out rates (p < .05). For 
patients who had character traits, there was 
less dropout early in treatment and consid- 
erably more attrition later in treatment than 
for patients not listed under that category (? 


< 03). 
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Breadth of disorder showed a significant 
(p < 01) relationship with dropout. Those 
patients with a medium number of symptom 
categories showed the lowest attrition rate. 
Those with relatively few symptom categories 
had higher drop-out rates early in analysis 
while those with a wider range of symptoms 
had a disproportionately high percentage of 
dropouts late in analysis. Depth of disorder 
shows a similar trend. Those whose initial 
suffering was mild tended to drop out early 
while those whose suffering was extreme 
tended to leave prior to completion relatively 
late in treatment (p < .10). 

There was no evidence that a major ex- 
ternal situational stress influenced the rate of 
attrition. 


Discussion 
Comparison with Other Studies 


Strupp, Wallach, and Wogan (1964) stud- 
ied a group composed of analytic as well as 
other patients in dynamic psychotherapy, for 
the most part without differentiating between 
depths of therapy. Their data indicate a 
close correspondence with our own findings 
on percentages of patients with such symptom 
categories as somatic problems, anxiety, psy- 
choticlike signs, etc. Their patients included a 
noticeably smaller number with sexual prob- 
lems and fewer of our group were listed as 
acting out (alloplastic). They obtained a sig- 
nificant correlation between their therapist's 
estimate of initial motivation and overall suc- 
cess in therapy, a finding consonant with our 
own. Unfortunately, they did not enlarge upon 
what they meant by motivation. Motivation 
may be defined in a number of ways. Waller- 
stein, Robbins, Sargeant, and  Luborsky 
(1956), for example, defined motivation in 
terms of honesty, fees, goals, and secondary 
gains. These would not correspond to what 
we have considered as motivational matters. 

Luborsky (1962), who reported interim 
results of the Menninger Project findings, 
concluded that for patients mot greatly in- 
capacitated at initiation, “the higher the 
level of anxiety the greater the change that is 
likely." Our group, as outpatients with suffi- 
cient “ego strength" to be acceptable for 
analysis, would generally fit the criterion of 
not being greatly incapacitated. However, 
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presence of severe anxiety, as our analysts 
reported it, did not predict either success or 
attrition from treatment. 

The Central Fact-Gathering Committee of 
the American Psychoanalytic Association 
(Hamburg, 1967) reported a total of 43% of 
their patients as not completing analysis, a 
figure not unreasonably different from our 
attrition rate of 10+% per analytic year, If 
we were to reduce a theoretical 100 patients 
by the average drop-out rate found for each 
year, we would lose 52 by the fifth year. 


tom categories into nine exclusive rather than 
overlapping areas, and often presented results 
on analytic patients and patients treated by 
psychotherapy without differentiating between 
treatments. It found low completion rates for 
schizophrenics, paranoids, and sexual deviates 
and a high completion rate for psychosomatic 
patients, Our findings are similar except for 
the low rate for sexual deviates. It is possible 
that, since our categories were not mutually 
exclusive, a number of less habituated cases 
were listed within our deviate category while 
the Fact-Gathering report may have included 
only cases for which a sexual problem was # 
primary concern. 

In all, the differences in research method- 
ology between our own and other studies 
were so great that even such comparisons 4$ 
stated above are questionable. 


Present Findings 


It appeared, when we studied the diagnostic 
designations of our patients (Aronson & Wein- 
traub, 1968a), that the majority of patients 
were labeled as character disorders. Since 
analysis was developed largely as 
for neurotics, this was an interesting result. 
However, an examination of actual symptoms 
would suggest that, if our results are typica 
of analytic practice, the problems being 
treated in analysis may still largely be new 
rotic ones. A majority of our patients hat 
least one neurotic complaint. 

Our results bring us to conclude 
the kind of symptom presented is 
cially important in predicting imp 
or that analysts are already well groun | 
the methods they use to decide which T 
matic conditions can be improved by ™° 


ude that either 
not espe 
rovement; 


The Fact-Gathering Report divided symp- | 


a treatment 
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— of their techniques. Our data indicate, for ex- — terioration. In general, those who suffer more 


ample, that patients whose contact with real- 
ity is not strong are seldom accepted for 
analysis; when accepted, they seldom com- 
plete the analytic course. 

The major findings reported here indicate 
the importance of motivational factors both 
on improvement and on attrition from treat- 
ment. Patients who suffer more tend to bene- 
fit more from analysis both in the early and 
the later stages. Those whose symptoms range 
into a wider number of areas show particu- 
larly high rates of improvement early in 
treatment. Motivation by environmental stress 
is less effective in producing improvement than 
is the impetus of internal discomfort. 

The patient most likely to benefit from 
analytic treatment is one whose symptoms 
are not so severe as to include distortion of 
reality. His discomfort is of relatively acute 
onset, giving him a recent memory of ade- 
quacy and less adaptation to disorder. His 
symptoms cover sufficient areas of life to 
make it difficult to avoid them. Yet he does 
not have so many symptoms that he has little 
or no experience with symptom-free function- 
ing. His internal reaction to his lessened func- 
tioning is one of strong discomfort. 

For this ideal patient, it is relatively un- 
important to note through which specific 
symptom or symptoms his discomfort makes 
itself known. However, if his pathology in- 
cludes character traits, he may not remain in 
analysis after some initial improvement has 
been obtained; if his symptoms include psy- 
chosomatic ones, he is more likely to be a 
tenacious patient. 

Patients whose symptoms cover a narrow 
tange or who have little personal distress are 
not only less likely to improve but also more 
likely to drop out of analysis during the first 
years of treatment. Of those who suffer ex- 
tremely and of those whose symptoms cover a 
wider range, few drop out early in treatment; 
such patients, however, are more prone than 
less disabled patients to leave analysis fol- 
lowing a unilateral decision after a number of 
years. By that time either motivational pres- 
sures have been lessened by improvement or 
their drive to change has led to further de- 
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stay until some change has occurred. 

The question of what happens to patients 
who leave analysis short of completion has 
seldom been broached. We have found that, 
at the point of departure, their achievement 
of positive change could not be differentiated 
from that of patients with equal therapeutic 
experience who remained for further analytic 
examination. Cases reported upon by Freud 
are remarkable, by present-day standards, for 
their brevity. Perhaps Freud ended his treat- 
ment with the expectation that the patient 
would continue to improve once the process 
was set in motion. Perhaps he counted on 
the motivations noted here to bring about 
the necessary changes. Present-day analysts, 
instead, direct the ongoing changes, thereby 
increasing treatment time. Apparently they 
hold continuing motivation as insufficient to 
achieve the desired effects. No experimental 
studies have been reported that test the above 
contending hypotheses. 
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Paranoid and nonparanoid schizophrenic patients, as well as a normal sample, 
were compared on: (a) capacity to form a trusting relationship with an 
unknown other, and (b) the relative efficacy of a “reformed sinner” versus a 
“lapsed saint” strategy in inducing trusting and trustworthy behavior in these 
samples. The Ss played a two-person bargaining game for 30 trials and were 
randomly assigned to one of two experimental strategies. Several game-playing 
styles differentiated the three samples. The paranoids, as contrasted with the 
nonparanoids and normals, were more likely to be “cooperative” when dealing 
with a lapsed saint and more “exploitative” when confronted with a reformed 
sinner. The results reflected the paranoids’ conflicting attitudes toward dis- 


crepancies in power. 


Several psychoanalytic theorists have pro- 
posed that the primary deficit in schizophrenia 
is an impairment in “basic trust” (Arieti, 
1961; Erikson, 1953; Fromm-Reichmann, 
1948). While various subtypes of schizo- 
phrenia are characterized by ambivalence with 
respect to when and whom to trust, the para- 
noid shows pervasive projection and rigid in- 
terpersonal suspicion (Wiener, 1966). Several 
factor-analytic studies by Lorr (1955, 1960, 
1966) indicate that all types of paranoid pa- 
tients show, in common, a reliance on projec- 
tion, perceptual distortion, and overt hostility. 
Lorr (1966) defines projection as the “fixed 
attribution of hostile persecuting intent,” 
whereas distortion refers to “hallucinations 
that threaten or demand [p. 35].” 

The current practice of dichotomizing 
schizophrenia into paranoid and nonparanoid 
types (Katz, Cole, & Lowry, 1964) is conso- 
nant with research findings in the area of 
cognitive, perceptual, and psychomotor per- 
formance (Lynn, 1963; McGonaghy, 1960; 
Shakow, 1963; Silverman, 1964). Unfortu- 


1 This present report is based in part on a doctoral 
dissertation submitted by the first author to the 
graduate faculty, Boston University. This study was 
supported in part by Public Health Service Research 
Grant M-5353 from the National Institute of Men- 
tal Health. 

2 Requests for reprints should be sent to Thomas 
Harford, Massachusetts General Hospital, Boston, 
Massachusetts 02215, 


nately, few laboratory experiments have as- 
sessed the interpersonal motives and social 
emotional orientation of the paranoid schizo- 
phrenic. The present study seeks to determine 
how interpersonal trust and/or suspicion de- 
velops and how they can be modified in à 
sample of paranoid and nonparanoid schizo- 
phrenic patients. 

Deutsch (1958, 1962) introduced the use 
of a two-person, bargaining game for the il- 
lumination of some of the personal and situ- 
ational determinants of interpersonal trust and 
suspicion. Interpersonal trust is said to exist 
when a person (P) behaves on the expecta- 


tion that the other person (O) will do somè- - 


thing, and P perceives that O is aware that 
his behavior has motivational relevance for 
pS 


3 The essential ingredients of a bargaining relation- 


ship include (a) a conflict of interests between ir 
participants, (b) provision of more than one OU 
come (ie, a bargain may be struck or not), an 
(c) the possibility of cooperative strategies. 


^ tions of benevolence 
Trust refers to expectation: e 


a 
lence. Thus, for example, er trusts 


mechanic to repair his brakes properl; 


harm—an auto accident. The trusting osi- 
his trust is fulfilled, is in a more advantageous " 3 
tion than if he had not trusted. However, the 
vantages of trusting and having his trust unf ah 
are considerably greater (ie. auto accident) nat 
the advantages of having his trust fulfilled (ies 
appointments). 
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RED 
PERSON 1 


(SUBJECT) 


GREEN 


RED 
+3 
+3 
-5 
+5 
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PERSON 1 
(CONFEDERATE) 


GREEN 
+5 
-5 


Fic, 1. Game matrix. (Person 1 chooses a row; Person 2 chooses a column. 
Person 1’s payoffs are in the lower left hand corner of each cell.) 


Figure 1 presents the payoff matrix for the 
two-person bargaining game, the “prisoner’s 
dilemma.” In this game, interpersonal trust is 
defined as a cooperative choice based on an 
expectation of cooperation from the other 
player, whereas suspicion is defined as a non- 
cooperative choice. The paradoxical feature of 
the prisoner’s dilemma is that, from the indi- 
vidual player’s point of view, rational behav- 
lor is not possible unless a relationship of 
mutual trust exists. 

Klein and Solomon (1966) studied a sample 
of acute schizophrenic patients in a two- 
person bargaining game. Each patient was 
paired with an experimental confederate who 
assumed an “exploitative” strategy for the 
first 15 trials. In the second phase the confed- 
erate “reformed” and adopted a conditionally 
cooperative strategy. The results indicated 
that the paranoids maintained their noncoop- 
erative stance for a longer period in the face 
of the confederate’s reform. Klein and Solo- 
mon (1966, p. 507) questioned whether the 

sinner’s” initial exploitation served to rein- 
force the paranoid’s suspicion (thereby mak- 
ing them more impervious to a cooperative 
reform), or whether this rigidity reflected a 
cognitive inability to adapt to change. To 
study these issues it would be necessary to 
design an experiment in which the confed- 
erate’s strategy shifts from unconditional to 


conditional cooperation (“lapsed saint") as 
well as from unconditional noncooperation to 
conditional cooperation (“reformed sinner”). 
Studies by Bixenstone and Wilson (1963) and 
Scodel (1959) with college students demon- 
strated that a reformed-sinner strategy is 
more effective in eliciting cooperation than 
such strategies as initial cooperation followed 
by noncooperation (lapsed saint) or uncondi- 
tional cooperation (turn the other cheek). 

A study by Knapp and Podell (1968) com- 
pared the behaviors of college students, psy- 
chotic patients, and prisoners in a two-person 
game in which the other player, a confederate, 
adopted either a cooperative strategy (80% 
cooperation) or a competitive strategy (20% 
cooperation). Although the competitive pro- 
gram failed to differentiate the three groups, 
the students and patients responded with 
more cooperation to the cooperative program, 
in contrast to the prisoners. 

The present experiment sought to compare 
paranoid with nonparanoid schizophrenic pa- 
tients, and a normal sample on (4) capacity 
to form a trusting relationship with an un- 
known other and (5) relative efficacy of a re- 
formed-sinner versus a lapsed-saint strategy 
in inducing trusting and trustworthy behavior. 


METHOD 


Thirty-seven hospitalized schizophrenic patients 
from the Massachusetts Mental Health Center, and 
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11 patients from McLean Hospital, Belmont, Massa- 
chusetts, served as Ss. Resident male patients were 
selected according to the following criteria: (a) 
primary diagnosis of schizophrenia, without organic 
involvement, mental deficiency, or alcoholic addic- 
tion and (b) age range of 17-35. The selected pa- 
tients were subdivided, on the basis of staff diag- 
nosis, into paranoid and nonparanoid groups, with 
24 patients in each. The nonparanoid group included 
the following diagnostic categories: simple, 2; 
catatonic, 2; affective, 2; undifferentiated, 18. In 
addition, each patient received the Progressive 
"Matrices test (Raven, 1949) and a forced choice 
version of the Wechsler Adult Intelligence Scale 
(WAIS) Vocabulary test. 

The groups were matched so that there were no 
significant differences between the paranoid and 
nonparanoid samples with respect to WAIS (Forced 
Choice) Vocabulary test, Progressive Matrices, socio- 
economic status (based upon father's occupation), 
and education and age at time of testing. There 
were no significant differences between the two 
groups for premorbid level of adjustment based on 
the Elgin Prognostic Scale (Becker, 1955) and the 
Prognostic Rating Scale (Phillips, 1953). 

A comparison group of 24 male college students 
were drawn from the evening session division of 
Boston University. The student group differed from 
the patients in having higher average WAIS verbal 
intelligence scores, being on the average 4-5 yr. 
younger, and coming from higher socioeconomic 
class. 


Procedure 


The E met with each patient on the ward in 
order to solicit participation in the study. The game 
was described by providing the patient with a card 
containing an actual game matrix and then proceed- 
ing to describe the choice options, the payoffs, and 
the structural interdependence for the two players 
(see Figure 1). After the patient demonstrated un- 
derstanding of the game, E would ask if he would 
consent to come to the laboratory to participate in 
a game-playing study. 

The procedure for soliciting the college student 
sample was somewhat different. Students in the 
class setting were asked to volunteer to participate 
in a game-playing study. The laboratory was de- 
signed to accommodate six Ss at one time. The stu- 
dents were provided with the same experimental in- 
structions and rewards as the patients. A more de- 
tailed description of this laboratory setting has been 
presented elsewhere (Harford, 1965). 


Apparatus and Experimental Situation 


The laboratory setting consisted of a large rec- 
tangular room with two large mobile booths set up 
at opposite sides of the room. These booths shielded 
Ss from seeing or talking to each other. Each booth 
housed a small electrical switch box which was wired 
to a master panel, permitting E to intercept and 
transmit game choices to S without his knowledge. 
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The S's panel box was equipped with two rows oí 
lights (two lights per row) and two switches lo- 
cated directly below the lights. 

The payoff matrix of the game appears in Figure 
1. Player 1 (S) chose between red and green (rows) 
while Player 2 (the stooge) chose between red and 
green (columns). Depending on the combination of 
choices, the players either won or lost chips. Player 
1’s wins and losses were circled. At the end of the 
game Ss exchanged the chips they had won for cig- 
arettes or candy. The structure of the game was 
such that both players could win (via red choices), 
or both could lose (via green choices), or one could 
win and the other lose. 


Saint and. Sinner Strategies 


The Ss were randomly assigned to one of the two 
experimental conditions. In one condition (lapsed- 
saint strategy, Harford & Solomon, 1967) the con- 
federate chose cooperatively for the first three trials 
(unconditional cooperation) and then switched to 
conditional cooperation for the remainder of the 
game (the program cooperated only if S cooperated 
on the immediately preceding trial). On the other 
hand, in the second condition (reformed-sinner 
strategy) the confederate chose noncooperatively for 
the first three trials, then switched to unconditional 
cooperation for the next three trials, and finally 
changed to conditional cooperation for the remainder 
of the game. Both strategies are similar, the only 
difference being that the reformed sinner initiates 
the game with three noncooperative choices and 
thereafter repeats the strategy of the lapsed saint. 

The major dependent variables in the present 
study were trust, risk taking, exploitation, and sus- 
picion, as derived and defined from an analysis of 
S's expectations of the other's behavior, together 
with S’s own behavior in the game (Solomon, 1960). 
Trust was defined as a cooperative choice based on 
an expectation of cooperation from the other players 
whereas suspicion was a noncooperative choice bas 
on an expectation of noncooperation. Exploitation 
was defined by a noncooperative choice based on an 
expectation that the other player would cooperates 
whereas risk taking was a cooperative choice base’ 
on an expectation of noncooperation. 


RESULTS 
each 


Separate analyses were performed on 
trust, 


of the following measures: cooperation, 
suspicion, risk taking, and exploitation. These 
measures were not completely independent 0 
each other. For example, trust and exploita- 
tion have in common an expectatio 
eration, whereas trust and risk takin; 
common a cooperative choice. Table B 
a set of intercorrelations of the game 
measures based on the patient sample: 
intercorrelations among trust, suspicion, 
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TABLE 1 
INTERCORRELATIONS BETWEEN GAME INDEXES FOR THE SAMPLE OF SCHIZOPHRENIC PATIENTS 


Variable e Risk taking Suspicion Exploitation — |Cooperative choice 

Trust 1.00 
Risk taking —.33* 1.00 
Suspicion 61% —.30* 1.00 
Exploitation —.44** 13 —.25 1.00 
Cooperative choice .85** 20 —.80** —.38** 1.00 
Cooperative expectation .80** —.28 =.83** .18** .68** 

Note.—N = 48 patients. 

*p <.05, 


**p c0; 


cooperation were high, whereas those for ex- 
ploitation and risk taking were low. 


Trial 1 


A separate analysis was made for the first 
trial choice for the three experimental groups. 
Trial 1 provides a relatively pure measure of 
S's readiness to trust an unknown other. 
There were no significant differences between 
the three groups for any of the measures. 
Taking the sample as a whole the breakdown 
was as follows: trust 37%, risk taking 10%, 
suspicion 29%, and exploitation 24%. Fifty 
percent of the Ss chose to cooperate on the 
first trial. 


Effect of Strategy Change: Trials 
1-4 versus 4-8 


Experiments designed to test the effects of 
variations in the confederate's strategy on S 
indicate that S's strategy change may be a 
more sensitive indicator of treatment effects 
than the mean number of a particular type 
of response (Swingle, 1967). To assess S’s re- 
actions to the confederate's abrupt change (at 
Trial 4), analyses of covariance were applied 
to Trials 5-8.5 The number of choices made 
during Trials 1-4 served as covariates and 
the number of choices made during Trials 
5-8 served as criterion (Winer, 1962). The 
game measures were transformed by the com- 
mon logarithm. 

Cooperation. The analysis of covariance 
yielded a significant Groups X Strategies in- 
teraction (F = 4.48, df = 2/65, p < .05) for 


5 Although both the saint and the sinner abruptly 
change their strategy on Trial 3, S (who chooses 
simultaneously) made his trial choice without any 
knowledge of the confederate’s change. Thus Trials 1 
through 4 were treated as a common phase. 


cooperation on Trials 5-8. Table 2 presents 
the adjusted means for Trials 5-8, together 
with the unadjusted means for Trials 1-4 and 
5-8. Multiple tests between the means were 
based on the average effective error (Winer, 
1962). The results indicated that the students 
were more reactive to the abrupt change in 
the confederate’s behavior than either of the 
patient groups. The students were less coop- 
erative with the lapsed saint than with the 
reformed sinner (p< 01). The paranoid 
schizophrenics were significantly more coop- 
erative with the lapsed saint than either the 
nonparanoid schizophrenics (p < .10) or the 
students (p< .05). Although there were no 
significant changes within each patient group 
with respect to the strategies, it is of interest 
to note that the differential impact of the 
strategies on cooperative behavior was more 
similar for the nonparanoid schizophrenics 
and the students than for the paranoid schizo- 
phrenics. 

Trust. The analysis of covariance yielded a 
significant Groups X Strategies interaction (F 
= 3.72, df = 2/65, p< 05) for trust on 
Trials 5-8. The adjusted means for Trials 


TABLE 2 


Apjusrep MEANS ON COOPERATION FOR TniALS 5-8 
AND UNADJUSTED MEANS FOR 
Trrats 1-4 AND 5-8 


Paranoids | Nonparanoids Students 

Mean 
LS RS Ls RS Ls RS 

djusted 
Vos 14 38 | 44 | 51 | .36 | 39 27 
Trials 5-8 | 49 | 43 | 40 42 | .24 | 46 

djusted 
A iale 5-8 so | 39 | .32 | 44 | .25 | .54 


Note.—LS = lapsed saint; RS = reformed sinner. 
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TABLE 3 


ADJUSTED MEANS on TRUST FOR TRIALS 5-8 AND UN- 
ADJUSTED Means FOR TRIALS 1-4 AND 5-8 


Mean Ls | RS | LS RS | LS RS 
Unadjusted 
Trials 1-4 31 | 41 | .34 | 32 | .27 | .22 
Trials 5-8 43 | .37 | .37 | .35 | .24 | 45 
Adjusted 
Trials 5-8 43 | 31 | 35 | 34 | .27 | .51 


Note.—LS = lapsed saint; RS = reformed sinner. 


5-8, together with the unadjusted means for 
Trials 1-4 and 5-8, are presented in Table 3. 
The students were less trusting with the 
lapsed saint than with the reformed sinner 
(p < .05). The paranoids were less trusting 
than the students with the reformed sinner 
(p < 10). There were no significant differ- 
ences within each patient group with respect 
to the strategies. 

Exploitation. The analysis of covariance 
yielded a significant Groups X Strategies in- 
teraction (F = 4.07, df = 2/65, p < .05) for 
exploitation on Trials 5-8. Table 4 presents 
the adjusted means for Trials 5-8, together 
with the unadjusted means for Trials 1-4 and 
5-8. The nonparanoid schizophrenics were 
more exploitative with the lapsed saint than 
with the reformed sinner (p < .01). The non- 
paranoid schizophrenics were more exploita- 
tive with the lapsed saint than either the 
paranoid schizophrenics ( < .05) or the stu- 
dents (p < .10). Although not significant, the 
patternings of exploitation under both strate- 
gies were more similar for nonparanoids and 
students than for paranoids. 

Suspicion and risk taking. The results of 
the covariance for the measures suspicion and 


TABLE 4 


ADJUSTED MEANS ON EXPLOITATION FOR TRIALS 5-8 
AND UNADJUSTED MEANS FOR 
Trrats 1-4 AND 5-8 


Paranoids | Nonparanoids Students 


Mean 
ts | RS | Ls | RS 


Unadjusted 
Trials 1-4 | .25 | .12 | .21 | .19 
Trials 5-8 | .16 | .17 | .28 | .11 
Adjusted 
Trials 5-8 | .144 | .19 | .27 | 11 


Note.—LS = lapsed saint; RS = reformed sinner. 


_ their game behavior on the first 
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risk taking did not yield any significant dif- 
ferences. y 
Trials 9-30 Hn 

In order to examine the subsequent impact | 
of the strategies the game measures were " 
summed over Trials 9—30. 1 

Cooperation. The analysis of variance | 
yielded a significant main effect for groups 
(F = 4.29, df = 1/66, p < .05) for coopera- ] 
tion on Trials 9-30. The data indicated that — 
the patients were more cooperative (M= 
.98) than the students (M = T). 

Trust. The analysis of variance yielded a. 
significant main effect for groups (F = 4.23, 
df = 1/66, p < .05) for trust on Trials 9-30. | 
An examination of the means indicated that” 
the patients were more trusting (M = 89) 
than the students (M = .66). The interaction | 
Patients versus Students x Strategies ap“ 
proached significance (p < .10). The patients - 
displayed higher levels of trust with the lapsed 
saint (M = .99) than with the reformed sin- 
ner (M=.79). The students, on the other. 
hand, displayed higher levels of trust with. 
the reformed sinner (M — .78) than with the 
lapsed saint (M = .55). An examination of 
the means for each patient group indicated à 
greater differential response to the lapsed 
saint and reformed sinner for the nonpar "d 
noid schizophrenics (M = 1.04 versus :m 
respectively) than was true of the paranoid 
schizophrenics (M = .93 versus .87, respec 
tively). A 

Exploitation. The analysis of variance of 
the number of exploitative choices 
main effect for patients which approached 
significance (F = 3.0, df = 1/66, P € 10% 
The nonparanoid schizophrenics tend j 
higher in exploitation. (M — .55) than the 
paranoid schizophrenics (M — .36). 


Discussion 


Paranoid versus nonparanoid differences 


The two samples did not differ with Nes n 

rial. 
analysis of the effects of the abrupt change 
the confederate's strategy revealed Ee 
between the two patient groups. The A A 
saint produced higher levels of cooperatio 1 


1 oid 
the paranoid sample than in the nonparan 
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| sample. The reformed sinner produced lower 


levels of exploitation in the paranoid sample 


_ when compared with the nonparanoid sample. 


These initial reactions to the two strategies 
tended to be less salient during the remainder 
of the game (Trials 9-30). An apparent ex- 
planation stems from the merging of the two 
strategies into a similar strategy of condi- 
tional cooperation. Despite this, however, 
there were some subtle differences between 
the patient samples. The nonparanoids tended 
to be higher in exploitation than the para- 
noids, and were more responsive to the differ- 
ences between the two strategies. 

Perhaps the major issue at stake for the 
paranoid schizophrenics in the present experi- 
mental situation is not when and whom to 
trust, but rather a concern with the balance of 
power. Ovesey (1955) has proposed that in- 
ternalized conflicts with regard to achieving 
interpersonal power are a key feature of the 
paranoid orientation, The paranoids’ strategy 
is designed to test and redefine the existing 
balance of power. Thus, when the sinner is 
punitive and tough, the paranoid is docile 
and cooperative; with the sinner’s abrupt shift 
to a soft unconditional benevolence, the para- 
noids react by shifting in the direction of an 
exploitative and dominating strategy. With 
the saint the reverse is true. The paranoids 
are initially exploitative, but when the saint 
shifts to a strategy of punishing noncoopera- 
tion, the paranoids rapidly shift to a coopera- 
tive pattern. 

The game-playing strategies of the para- 
noids also fit the clinical description of the 
authoritarian personality (Adorno, Frenkel- 
Brunswik, Levinson, & Sanford, 1950). 
Adorno et al. described the authoritarian as a 
person who idealized strength and toughness 
but has contempt for weakness and tender- 
ness. They imply that the authoritarian is 
likely to be exploitative and domineering when 
he has greater power (or is not resisted), but 
submissive and ingratiating when in a subordi- 
nate relationship. The conception of the para- 
noid person as one who is conflicted with 
respect to power relationships has applicabil- 
ity to normal as well as schizophrenic sam- 


* ples. 


One key implication of these findings is 
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that the paranoid schizophrenic's interpersonal 
stance is not indiscriminate and can be modi- 
fied by variations in the strategy of the other 
party. In order to illuminate other clinical 
aspects of suspicion in paranoia, future re- 
search designs could systematically vary such 
interpersonal dimensions as the status, sexual 
identity, and perceived similarity of the other 
party. 

Patient versus student differences. The stu- 
dent sample was included in order to provide 
a comparative point of reference rather than 
as a control. The students represent a limit- 
ing case in the point of comparisons: Are the 
prisoner’s dilemma game and strategy manipu- 
lations sensitive to the differences in these 
two sample populations? The findings indi- 
cated three major differences between the com- 
bined patient sample and the students: (a) 
the students showed a higher level of reactiv- 
ity to the confederate's strategy shift than 
either of the patient samples; (b) the stu- 
dents showed higher levels of noncooperation 
than either of the patient samples; and (c) 
the reformed sinner tended to be more effec- 
tive in establishing a trusting relationship 
with the students than with, the patients. 
These findings contrast with the study by 
Knapp and Podell (1968) who did not obtain 
differences between psychotic patients and 
college students. The differences obtained in 
the present experiment were largely a function 
of (a) a more well-defined psychotic popula- 
tion and (b) type of strategy. Knapp and 
Podell (1968) pointed out that variations in 
cooperative strategies are more likely to reveal 
personality differences than the strictly com- 
petitive strategies. 

The finding that the patients as a group 
were more cooperative and trusting may re- 
flect the fact that the students adopted a more 
competitive set in the game. The students’ 
higher levels of reactivity to the strategy shift 
of a confederate suggests that they were more 
sensitive to interpersonal change in others. 
Moreover, they were selectively sensitive to a 
shift from a competitive to a cooperative 
strategy. Whether the differences between the 
student and patient samples can be attributed 
to the nature of the schizophrenic illness, or 
to other possible confounding variables such 
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as the effects of institutionalization (Goffman, 
1961; Silverman, 1965), social stigma (Goff- 
man, 1963), or the social psychology of the 
experimental situation (Rosenthal, 1964) can 
not be determined. 
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FALSE FEEDBACK AND LONGEVITY OF THE 
CONDITIONED GSR DURING EXTINCTION: 


SOME IMPLICATIONS FOR AVERSION THERAPY * 


KARL P. KOENIG? anp DAVID DEL CASTILLO 


University of New Mexico 


The effects of two cognitive variables on GSR extinction rates were investigated. 
Study I replicated earlier findings that Ss made aware of the onset of non- 
shocked trials exhibited extinction of a conditioned emotional response more 
rapidly than Ss not made aware. Studies I and II further demonstrated that 
extinction, even under conditions of awareness, can be significantly retarded 
when false feedback concerning the S's current emotional responsivity is sup- 


plied during extinction trials. 


Fundamental to most behavioral treatment 
of sexual deviance has been the association of 
aversion with stimuli thought to control sexual 
behavior (Feldman, 1966; Rachman, 1961). 
Thorpe and Schmidt (1964), for example, 
paired noxious shock with photographs of 
nude men in a case of male homesexuality. 
There have also been a number of attempts 
to elaborate the basic classical conditioning 
paradigm to include avoidance learning (Feld- 
man & MacCulloch, 1965), escape learning 
(Barker, 1965), and aversion-relief training 
(Fisher, 1968). Regardless of the several vari- 
ations, however, successful behavior modifica- 
tion must be presumed to depend upon the 
establishment and maintenance of a vigorous 
and negative emotional response to the 
conditioned sexual stimuli. 

The use of aversive conditioning as a 
therapeutic procedure rests on the assumption 
that the conditioned emotional response 
(CER) established in therapy will continue 
to resist extinction after S has left the treat- 
ment setting. In the absence of firm clinical 
evidence to the contrary, am alternative as- 
sumption is instead that the deviant client 
will learn to respond fearfully to sexual 
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stimuli when it is appropriate to do so (in 
the therapy setting) yet continue to respond 
nonanxiously in those circumstances where 
fear of aversive consequences would be in- 
appropriate (locales other than the clinic). 
One would expect humans to discriminate 
readily between those occasions with zero 
probability of aversive unconditioned stimulus 
(US) occurrence and those with a probability 
greater than zero, If the goal of aversion 
therapy is to generate specific conditioned fear 
under the control of particular sexual stimuli, 
but independent of other significant stimuli 
such as the presence or absence of shock 
electrodes, then present techniques would ap- 
pear inadequate for the task. Learned fear 
which persists across nonreinforced presenta- 
tions of the conditioned stimulus (CS) de- 
pends on the inaccessibility of precisely that 
information which enables S to make good 
probabilistic estimates of the occurrence of 
punishment. The variables suggested by Feld- 
man (1966) as useful in producing resistance 
to extinction (e.g., distributed versus 

trials) are likely to have a major effect only 
if Ss remain ignorant of the onset of non- 
reinforced trials. 

Empirical evidence is available which indi- 
cates that rapid, if not immediate, extinction 
of the CER occurs when Ss are informed that 
subsequent presentations of the CS are no 
longer to be followed by shock. The effect 
has been demonstrated in conditioned heart- 
rate studies (Chatterjee & Erikson, 1962; 
Notterman, Schoenfeld, & Bersh, 1952) and 
in GSR conditioning (Bridger & Mandel, 
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1965; Cook & Harris, 1937; Mowrer, 1938; 
Silverman, 1960; Wickens, Allen, & Hill, 
1963). Inasmuch as clinical patients are also 
“informed” (by implication) of when aver- 
sive events are probable and when not, 
investigators of therapeutic aversive condi- 
tioning should expect little resistance to 
extinction beyond the confines of the clinic. 

In view of the relationship between labora- 
tory studies of CER extinction and current 
therapeutic practices, the present investigation 
was designed to determine whether a CER 
could be maintained during extinction despite 
instructions which informed Ss of the perma- 
nent termination of shock following acqui- 
sition trials, The Ss also viewed, by means of 
a meter, experimentally manipulated false 
feedback represented as their emotional re- 
sponses to CS trials. Thus, each S was con- 
ceived to be serving as his own response- 
cuing model. Since it has been shown that 
emotional responses can be both acquired and 
extinguished through exposure to models 
(Barnett & Benedetti, 1960; Berger, 1962), 
it was predicted that Ss would continue to 
respond fearfully to a stimulus if supplied 
with evidence that they were already respond- 
ing in a fearful manner. 


Srupy I 
Method 


Subjects. The initial sample consisted of 97 under- 
graduates, Failure to meet either of two criteria 
resulted in the elimination of 17 Ss. Five did not 
meet the criterion of scorable records due either to 
apparatus failure or to erratic resistance changes 
which masked their responses. The remaining 12 
were eliminated for failure to respond above .1 log 
change in conductance to the test stimulus prior to 
extinction trials. The final sample consisted of 32 
women and 48 men assigned randomly to the eight 
experimental subgroups. 

Apparatus. A Lafayette dermohmeter (7601A) was 
used in conjunction with a Harvard Electronic Re- 
cording Module (350) to produce a continuous ink 
recording of resistance change. A second pen recorded 
time and stimulus events. The equipment, including 
timers, relays, and shock stimulator, was located in 
a control room connected by a two-way mirror to 
the S room. 

Nickel-plated finger electrodes were taped to S's 
first and third right hand fingers and connected to 
the dermohmeter. His right arm was secured by an 
elastic bandage to the arm of a recliner chair to 
reduce GSR artifacts. Shock was delivered to the 
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left hand in .5-sec. impulses in a range of .0-5.5 ma, 
through a 100 K © resistor in series with S. 

The S sat in a small but comfortable room. A 
headset and intercom permitted two-way communi- 
cation with E. The chair faced a panel of three jewel 
lights, 4.5 ft. away at eye level, and visible through 
the two-way mirror when illuminated. Lights were 
timed for 2-sec. durations. The reinforced light 
(positive conditioned stimulus, CS+) was orange, 
while the nonreinforced stimuli (negative conditioned 
stimuli, CSs—) consisted of two white lights equal 
in size and brightness to the CS+. The CS+ was 
easily discriminable from the CSs—. Also visible, 
when illuminated by E during extinction trials, was 
a 4X 3 in. black-on-white meter set 4 in, below the 
lights. Each needle movement of the meter started 
at zero and peaked halfway across the dial before . 
returning slowly to zero. 

Procedure. On arrival, S was seated, headphones 
and electrodes were attached, and instructions to 
remain very still were given. After apparatus adjust- 
ments were made the room lights were dimmed to 
enhance attention to the stimuli. The experiment 
consisted of the following four phases: d 

1. Adaptation. Ten positive conditioned stimuli 
(CSs+) and 20 CSs— were presented in a fixed- 
random order with a 25-sec. interval between 
presentations. f 

2. Shock determination. Each S was asked to assist 
in finding a barely tolerable level of shock, The 
individually determined level was then used through- 
out acquisition (obtained range — .5 to 5 ma;). 

3. Acquisition. Ten CSs+ and 20 CSs— were pre- 
sented in a new fixed-random order with the same 
intertrial interval. Each CS+ was followed immedi- 
ately by a shock of .5-sec. duration 100% of the 
time. After all 30 stimuli were presented, an eleven! 
CS+ without shock was presented to determine 
conditioning (test trial). 

4. Extinction. Twelve CSs+ and 24 CSs— 
presented in a new order with the same interval. | p 
shock was delivered at any time during extinction. 

No differential conditions were introduced until the 
extinction phase. tal, 

Information conditions. Following the test | 
but prior to extinction trials, one-half of the m 
(informed group) received instructions designed 
assure them that the remaining stimulus present” 
tions would not involve shock. Abstracted, instruc 
tions for the informed condition were as ie 
“I am coming in now to remove the shock electron 
From this point on, none of the lights will be n 
lowed by shock." The E joined S and removed 
electrodes. 

The remaining Ss received no indication that shock 
would be terminated. To control for E interru ats 
however, noninformed Ss were told that an adj 
ment was to be made in the amplifying system: suh 
E then entered the room and appeared to make 
an adjustment. f meter 

Meter conditions. The presence or absence 0 mu 
activity visible to Ss and different sets Of mE 
tions resulted in four variants of meter COP 


were d 
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These were balanced between the informed and non- 
informed conditions. Meter instructions were given 
following information instructions but prior to ex- 
tinction trials. Meter activity began immediately 
upon termination of each CS+ and was completed 
within 7 sec. after its inception. Meter activity 
followed only the first six extinction trial CSs+ and 
during this time the meter light was on constantly 
for all meter conditions which were the following: 

1. Meter-feedback groups. In addition to exposure 
to meter activity, instructions were given which are 
reproduced in part here. 


during this part of the experiment you will be 
able to see your own galvanic skin response to each 
light as it is presented. . .. If the needle rises 
above 10 units [one-eighth the possible distance] 
this will indicate you have given a fear or anxiety 
response to the light. 


The meter needle in all cases, of course, moved well 
beyond the mark signifying anxiety to S since it 
always peaked at approximately one-half the distance 
between O and 120. : 

2. Meter-task groups. Although the meter was 
manipulated exactly as in the above condition, the 
instructions differed: “you should pay attention to 
the meter as well as the lights because I will later 
ask you some questions about the activity of the 
meter needle." 

3. Meter-only groups. No instructions were given 
concerning the meter but the meter was operated 
as in Conditions 1 and 2. 

4. No-meter groups. Neither meter activity nor 
meter instructions were introduced to these groups. 

Groups under Conditions 1, 2, and 3 saw the same 
meter and, within limits of experimental error, the 
same meter activity. Only the instructions describing 
the purpose of the meter differed. The meter did not 
appear again after the last pairing of the meter with 
the sixth CS-- during extinction. No explanation for 
meter termination was offered. The purpose of termi- 
nating the meter after the sixth CS trial was to 
examine the transference of its effects to the last six 
CS trials. 


Results 


The GSR unit changes were converted from 
changes in resistance to log changes in con- 
ductance (log CCs) in a manner consistent 
with that suggested by Haggard (1949), 
Lacey and Siegel (1949), and Wickens et al. 
(1963). The raw measure consisted of the 
maximum resistance change (ohms) within 
an 8-sec. period after CS+ termination. 
Positive log CCs reflect decrements in skin 
resistance, A constant of one was added to 
each change value before the logarithmic 
transformation was made. The CS+ extinc- 
tion trials were grouped in six blocks of 
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TABLE 1 


Srupy I Anatysis or VARIANCE OF LOGARITHMIC 
CHANGE IN CONDUCTANCE BY INFORMATION AND 
METER CONDITIONS AND BLOCKS 


Source df MS F 


Information (I) 1 6.2794 17.946** 
Meter (M) 3 1.3022 3,722* 
IXM 3 0534 
Error 72 .3499 
Blocks (B) 5 1.0710 13.696"* 
BXI 5 3074 3.930** 
BXM 15 .0054 
BXIXM 15 .1522 1.946* 
Error 360 .0782 

* p < 025. 

** p <.005. 


two trials each for analysis and graphical 
presentation. 

An analysis of variance with repeated mea- 
sures was performed on the log CCs. The sum- 
mary of the analysis is shown in Table 1 and 
the associated functions are given in Figure 1. 
In addition, log CCs associated with the test 
trial alone were analyzed in order to test 
the assumption that no real differences existed 
among the experimental subgroups prior to 
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Fic, 1. Study I extinction curves (N=10 
in each group). 
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the introduction of experimental variables. 
Since no comparisons yielded an F greater 
than 1, the assumption of initial group com- 
parability was supported. A correlation of 
.51 (df = 78, p < .01) was obtained between 
test-trial responses to the CS and averaged 
responses to the extinction CSs. Therefore, it 
was of importance that no significant group 
differences existed initially. 

The significant effects for information and 
for Information X Blocks (see Table 1) sup- 
ported the prediction based on earlier studies 
that Ss made aware of the termination of 
shock would extinguish more rapidly and to a 
lower level than Ss not so informed (see Fig- 
ure 1). The log CC block means associated 
with the main effect were noninformed — .45 
and informed — .22. 

Although meter conditions resulted in a 
significant overall effect, it was not possible 
to interpret this finding without examining 
the individual group means. Duncan's multi- 
ple-range test (Edwards, 1960) was employed 
for this purpose. At the .05 level of signifi- 
cance, and within the informed-of-extinction 
condition, both the meter-task and the meter- 
only groups differed from the meter-feedback 
group in the expected direction. The no-meter 
group differed from the meter-feedback group 
in the predicted direction but only between 
the .10 and .05 levels of significance. The top 
drawing in Figure 1 shows that the meter- 
feedback group under the informed condition 
produced stronger conditioned responses than 
the no-meter group during the first three 
blocks of trials, but dropped steeply when the 
supply of false feedback was terminated prior 
to the last three blocks of trials. 

"Tests of individual group means within the 
noninformed conditions yielded effects similar 
to those within the informed condition: the 
meter-feedback group extinction rate was sig- 
nificantly retarded when compared with the 
meter-only and meter-task groups, but did 
not differ significantly from the no-meter con- 
dition. 

Support for the prediction concerning in- 
formation, and partial support for the ex- 
pected effects of meter feedback, stimulated 
the second study. The experiment reported in 
the following paragraphs was undertaken in 

order to test the prediction that meter feed- 
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back will result in slower extinction than a 
no-meter condition under a constant-informed- 
of-shock-termination condition, over extended 
acquisition and feedback trials. 


Stupy II 
Method 


Subjects. The initial sample consisted of 26 under- 
graduates enrolled in introductory psychology. Fail- 
ure to meet the criterion of scorable records led to 
the elimination of 3 Ss. Another 3 Ss were elimi- 
nated for failure to respond above .1 log CC to the 
test trial CS prior to extinction, The remaining 20 
Ss consisted of 11 men and 9 women assigned ran- 
domly to the two experimental conditions. 

Apparatus. The apparatus was identical to that in 
Study I. 

Procedure. Adaptation, acquisition, and extinction 
trials were conducted, as in Study I, but repeated on 
2 consecutive days for each S. On both days, prior 
to extinction trials, all Ss were informed that shock 
would no longer occur and the shock electrodes were 
removed. The no-meter group was not exposed to 
the meter on either day. The meter-feedback group, 
however, was shown the meter (with instructions 
identical to those used in Study I) following all 12 
CS-- presentations during extinction on Day 1, and 
following each of the first 6 CS+ presentations dur- 
ing extinction on Day 2. Thus, the following se 
quence was observed for all Ss. Day 1 consisted of 
10 CS+ and 20 CS— trials in fixed-random order 
during adaptation; 10 CS+ and 20 CS— trials in 
another fixed-random order during acquisition; and 
12 CS+ and 24 CS— trials in still another order 
during extinction. Day 2 followed the same pattern 
except that new fixed-random arrangements of the 
stimuli were generated. 


Results 


Skin resistance changes to the CS were com- 
puted and transformed into log CCs as de- 
scribed in Study I. Separate £ tests were per- 
formed on test-trial log CCs between the n0- 
meter and meter-feedback conditions for Day 
1 and Day 2. On neither day was there à Sig- 
nificant difference between the test-trial 
means. The absence of a test-trial difference 
between the two conditions on Day 1, prior f? 


the introduction of experimental treatments, 


supported the assumption that the two S sam- 
ples had been conditioned to equivalent levels. 

Analysis of variance with repeated me^ 
sures was performed separately for Day í 
and Day 2 data. The obtained Day 1F s 
4.13 (df = 1/18) for the main effect was "0 
statistically significant. The analysis of pe 
2 data, however, yielded an F of 7.55 ( j 
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Fic, 2. Study II extinction curves (N = 10 
in each group). 


1/18) which was acceptable beyond the .025 
level of significance. In neither analysis did 
the blocks or Blocks X Meter effects reach 
statistically significant levels. The results, 
compared with the no-meter and meter-feed- 
back means under the informed condition in 
Study I, supported the prediction that false 
GSR feedback would effectively retard ex- 
tinction when the number of feedback trials 
during extinction was increased. 

; The data are represented in Figure 2. Un- 
like meter feedback under the informed con- 
dition in Study I, the responses of compara- 
ble meter-feedback Ss in Study II did not 
show rapid decline in responsivity after the 
first three blocks of extinction trials. This 
was probably due to the fact that false meter 
feedback was continued across all extinction 
trials on Day 1. A relatively steep decline was 
noted on Day 2 over the first three blocks 
which: leveled off following discontinuation of 
feedback. 


DISCUSSION * 


Differential rates of extinction were ob- 
tained between informed and noninformed 
groups in Study I. The effect was in the di- 
rection predicted and was in general agree- 
ment with other experimental reports (e.g. 
Bridger & Mandel, 1965). Earlier in the paper 
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a parallel was drawn between experimental 
Ss aware of shock termination and clinic pa- 
tients who are also aware of when shock con- 
tingencies are likely to operate and when 
they are not. The finding attributed to infor- 
mation conditions in the present study, and 
the parallel described above, raises a serious 
question concerning the probable effectiveness 
of aversive therapy in maintaining a strong 
CER which is also resistant to extinction out- 
side the therapeutic setting. Nor is it immedi- 
ately apparent why instrumentally condi- 
tioned avoidance responses (vigorously pro- 
posed by Feldman, 1966) would be less sus- 
ceptible to rapid extinction in view of the 
average clinic S's ability to discriminate 
presence and absence of avoidance contingen- 
cies. The omission of systematic considera- 
tion of cognitive variables such as awareness 
seems to strengthen the recent criticism of 
conditioning therapies made by Breger and 
McGaugh (1965). 

False meter feedback, labeled explicitly for 
S as his own galvanic skin response to the 
previously shocked light, was shown to be an 
effective procedure for retarding extinction 
rates even in Ss aware of the onset of non- 
shocked trials. It will be recalled that dif- 
ferences between the false-feedback and no- 
meter conditions reached significance in the 
second study only with the addition of ex- 
tended acquisition and extinction trials. The 
requirement of additional trials to achieve a 
differential effect does not, however, seriously 
weaken the demonstration. Therapy by aver- 
sive conditioning has generally consisted of a 
series of alternated acquisition and extinction 
trials. Each time the patient leaves the office 
he is, in effect, entering extinction trials if 
either he imagines or physically encounters 
those stimuli to which aversion has recently 
been conditioned. An examination of the ex- 
tinction data in Study II indicates that re- 
peated exposure to nonshocked trials, even 
when interspersed with reinforced trials, led 
to successively diminished conditioned re- 
sponses in the absence of feedback, Con- 
versely, there appeared to be a lesser decline 
in the magnitude of the response during ex- 
tinction in meter-feedback Ss and an increase 
in response stability. Further research will be 
required in order to determine the feedback 
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format and values and the frequency of ac- 
quisition trials which tend to maximize re- 
sistance to extinction. 
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RECALL AND RECOGNITION FREE LEARNING 
IN SCHIZOPHRENICS * 
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Recall and recognition learning tasks were administered to 30 schizophrenic and 
30 nonschizophrenic psychiatric inpatients. The schizophrenics were inferior 


to the nonschizophrenics in the number 


of correct items reproduced on the 


recall tests (p< .05), but the two groups did not differ significantly in the 
number of correct items identified on the recognition tests. The schizophrenics 
made more intrusion errors on both the tests of recall (p< 01) and recog- 
nition (p< 05). Compared to the nonschizophrenics, the schizophrenics’ in- 
trusion errors were more marked on the recall than on the recognition tests: 
the Groups X Tasks interaction was significant (p<.01). The results were 
related to a theory of overinclusion in schizophrenia, previously developed to 
account for schizophrenic deficits in a communication task. 


Cameron (1938) described cognitive per- 
formance in schizophrenia as overinclusive. 
This is the notion that schizophrenics have 
difficulty maintaining the conceptual bound- 
aries of task situations; hence, responses in- 
appropriate to the task at hand are likely to 
intrude into the schizophrenic’s thinking and 
communication. As a consequence, thought 
and speech become less precise and coherent, 
more vague and fragmentary. 

There have been a number of attempts to 
explain the phenomena of overinclusion in 
schizophrenia by relating them to faulty in- 
formation processing mechanisms (Lang & 
Buss, 1965; Yates, 1966). Such models typi- 
cally construe schizophrenic patients as suf- 
fering from some defect in a hypothetical 
“screening” mechanism. Normally, this mech- 
anism functions to filter-out “noise” from in- 
ternal sources (irrelevant associations), or 
from external sources (irrelevant stimuli). 
Some specific models are those proposed by 
Payne, Matussek, and George (1959), Mc- 
Ghie and Chapman (1961), and Shakow 
(1962). 

A more recent model, also including the 


1 This research was supported by National Science 
Foundation Grant No. GS 723 for research on 
Verbal behavior as interpersonal communication. The 
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notion of a filter mechanism, is one proposed 
by Rosenberg and Cohen (1966) and applied 
to schizophrenic performance in a communi- 
cation task by Cohen and Camhi (1967). The 
Cohen and Camhi study was designed to 
compare the performance of schizophrenic and 
normal Ss in “speaker” and in "listener" 
roles. In the communication task, the speak- 
ers job was to provide clue words that 
would make it possible for his listener to dis- 
tinguish referent from nonreferent stimuli. 
The listener’s job was to choose the referent 
stimulus from each two-stimulus array, bas- 
ing his choices on the speaker’s clue words, 
Using as a measure the proportion of correct 
listener’s choices of the speaker’s referents, 
the results indicated that schizophrenic speak- 
ers were inferior to normals, but schizophrenic 
listeners approached the accuracy of normals. 
The communication deficit, therefore, ap- 
peared to rest mainly in the role of the 


speaker. 
Speaker and Listener Models 


What is there about the particular require- 
ments of the speaker’s task such that a schizo- 
phrenic deficit was found in the speaker role 
but not in the listener role? Rosenberg and 
Cohen (1966) proposed that the speaker role 
involves a two-stage process, while the listener 
role involves a one-stage process. According 
to this theory, the speaker’s role begins with 
a sampling stage in which a response from the 
speaker’s repertoire is selected randomly from 
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the distribution of possible clue words associ- 
ated with the referent stimulus alone. In the 
comparison stage, which always follows the 
sampling stage, the relative associative 
strengths linking each of the stimulus objects 
(referent and nonreferent) to the sampled 
response determines the probability that the 
speaker will decide to emit or reject the sam- 
pled response. If, as a result of the comparison 
stage, the sampled response is rejected as an 
inadequate clue word, the speaker again sam- 
ples a clue word and then repeats the com- 
parison process. This hypothetical sequence 
(sampling stage — comparison stage) contin- 
ues to recycle until a satisfactory clue word 
is decided on by the speaker, and he emits this 
word as a response. It is in the comparison 
stage that the speaker must reject ambiguous 
and misinformative associations. What is im- 
plicit in the concept of the comparison stage 
is its function as a filter mechanism. The 
speaker must reject clue words that will not 
have, for the listener, discriminably higher 
associative strength to the referent than to 
the nonreferent. The listener’s choice of the 
speaker’s referent was also assumed to be 
governed by the relative associative strengths 
of the speaker’s clue word to each of the two 
stimulus objects (the referent and the non- 
referent). 

It will be recalled that in the Cohen and 
Camhi study, the schizophrenics were found 
to be defective as speakers, but approached 
normal accuracy as listeners. When, according 
to the theory, sampling and comparison stages 
were both involved (speaker process), schizo- 
phrenic performance suffered. However, when 
only the comparison stage was thought to be 
operative (listener process), the schizophren- 
ics showed no significant inadequacy. Cohen 
and Camhi (1967) speculated that the specific 
weakness of the schizophrenic was in his 
ability to make comparison stage decisions in 
the speaker role, but not in the listener role. 
They explained this apparent contradiction in 
the following way: 

Faulty comparison in the speaker role . . . may 
reflect deficiencies in the ability of schizophrenics to 
integrate the comparison with the sampling stage of 
the two-stage speaker process. Thus, comparison 
combined with sampling (speaker comparison) may 
be defective, even though comparison alone (listener 
comparison) remains relatively unimpaired [p. 245]. 
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Recall and Recognition 


The speaker and listener tasks are analo- 
gous, respectively, to the traditional recall and 
recognition tests used in studies of rote learn- 
ing. The essential operational difference be- 
tween tests of recall and recognition is that in 
recall S is required to reproduce items from a 
previously presented training list, while in 
recognition S is required only to judge whether 
items, produced for him by E, were or were 
not on the previously presented training list. 

Rosenberg and Cohen (1966) suggested 
that performance on recall tasks is determined 
by a two-stage process similar to the speaker 
model, while performance on recognition 
tasks is determined by a one-stage process 
similar to the listener model. In recall, S first 
samples an item from an underlying set that 
consists of list and extralist subsets. The $ 
then goes on to the comparison stage which 
determines the probability that the sampled 
item will be emitted as a response. In the 
comparison stage, S decides whether the sam- 
pled item was, or was not, in the list subset. 
Hypothetically, this is done by “comparing 
the associative strengths of the sampled items 
with a constant representing à threshold 
value of associative strength [p. 228].” In 
recognition, only the comparison stage is pe 
volved, as E himself performs the sampling 
for S. 

One might expect, therefore, that schizo- 
phrenics will show deficit on recall tasks but 
not on recognition, because recall, like the 
speaker task, presumably involves an inte- 
gration of both sampling and comparison 
stages; while recognition, like the listener 
task, only involves a comparison stage. e 
schizophrenic’s faulty ability to make com- 
parison stage decisions in the two-stage i 
task implies that his judgments of an items 
list membership will be more unreliable than 
such judgments by a nonschizophrenic 
This can result in (a) overexclusion (Chap- 
man, 1961): frequent erroneous decisions t 
reject a correct item—leading to lowered p 
rect response scores; and (5) overinclusion: 
frequent failures to reject incorrect, extr i 
items—leading to increased intrusion ee 
scores. In contrast, neither of these effects a 
expected on the one-stage recognition task. 


| 


LEARNING IN SCHIZOPHRENIA 


Although no research has been conducted 
which has directly compared recognition ver- 
sus recall tests of learning in schizophrenics, 
Spence, Goodstein, and Lair (1965) observed 
that schizophrenics performing a recognition 
task were less distracted by verbal reinforce- 
ment than were schizophrenics performing a 
recall task. Spence et al. suggested that the 
recognition task demanded less active effort 
and less sustained concentration, and could, 
therefore, have made the schizophrenic Ss’ 
performance less vulnerable to any distracting 
effects the reinforcers may have had. 

The purpose of the present study was to 
provide data comparing schizophrenic and 
nonschizophrenic Ss on recall and recognition 
tests of retention in a free learning task. 


METHOD 
Test Materials 


The items for the recall and recognition tasks 
were obtained in the following manner. Three hun- 
dred Rutgers undergraduates, all males, in intro- 
ductory psychology classes, were asked to list within 
a 60-sec. period, all of the items they could think of 
in each of two categories: one was “states” of the 
United States and the other was “edible plants.” 
This was done to establish frequency norms for two 
conceptual sets. In all, 50 items (the 50 states) 
were obtained for the states category, and 54 items 
for the edible plants category. The percentage of 
students listing each item was tabulated for the 
items of both distributions. The distributions were 
then divided into deciles, and 15 items were selected 
from each distribution to make up the two training 
lists used in the study; for each conceptual set, one 
item was randomly selected from each of the 10 
deciles, while the remaining 5 items were randomly 
selected from the middle three deciles (the fourth 
through sixth). 


Subjects 


The Ss were 30 schizophrenic and 30 nonschizo- 
phrenic male psychiatric inpatients at the Veterans’ 
Administration Hospital in East Orange, New Jersey. 
Diagnoses for all of the patients were obtained at 
psychiatric staff conferences approximately 1 wk. 
after the patient was admitted to the hospital. The 
two groups of patients were comparable in age and 
education (see Table 1) as well as medication. All 
of the Ss were being treated with comparable dos- 
ages of Mellaril (dose to body weight). No S had a 
history of more than three psychiatric hospitaliza- 
tions since discharge from military service. All of the 
schizophrenics had a history of psychotic symptoms 
since military service. The schizophrenics were non- 
Paranoid, chronic, undifferentiated. The nonschizo- 
Phrenic group was composed of Ss diagnosed as 
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TABLE 1 


AGE AND EDUCATION OF THE SCHIZOPHRENIC 
AND NONSCHIZOPHRENIC GROUPS 


Age Education 
Group 


Schizophrenic 321 | 48 10.1 1.6 
Nonscite eoe 33.9 5.9 11.8 19 


Note.—N = 30 for each group. 


anxiety neurotics with no indication of psychotic 
symptoms. 


Procedure 


Training. The S was told that he would see a list 
of words, one at a time, As each word appeared, he 
was instructed to say it aloud and try to remember 
it. Each word was presented for 2 sec, with a 1.5- 
sec. interstimulus interval. After a practice period 
with four unrelated words, each of the 15 items of a 
given list was presented on a Stowe memory drum, 
Immediately after the list was presented S was given 
either a recall or a recognition test. During any 
single experimental session, the same test, recall or 
recognition, was administered for a total of three 
trials; each trial followed presentation of the same 
set of 15 items, but in a different random order. 

Recall tests. Immediately following list presenta- 
tion, S was asked to say out loud as many of the 
words on the list as he could remember, He was 
given 2 min. to do so. The E recorded S's responses 
verbatim. Each subsequent trial was begun at the 
end of the 2-min. time limit. 

Recognition tests. Following list presentation, S 
was given a 90-item printed page which contained 
all (50 or 54) of the items from the category used 
to make up the training list, and the remainder 
consisted of unrelated items (nonset items), such as 
rabbit, Cadillac, violin, rust, china, oak, etc. The S. 
was instructed to check off all of the items which he 
remembered as having been on the líst. Again he was 
given 2 min. to do so, after which the next trial 


was begun. 
Experimental Design 


Half of the Ss in each diagnostic group was given 
the recognition task first, and 1 wk. later the recall 
task. The remaining Ss were given the tasks in 
reverse order. For the first session, half of the Ss in 
each experimental condition (recall or recognition) 
was given items from one of the conceptual sets, 
while the other half was given the alternative con- 
ceptual set. One week later, in the second session, 
when S took the other experimental test, he learned 
the alternate conceptual set. Thus, if S was first 
exposed to a recognition procedure with edible plants 
as the conceptual set, the following week he would 
take the recall procedure with states as the conceptual 
set. Each S was given three trials per list, with either 
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the recognition or the recall test administered after 
each trial. The results were analyzed by a within-Ss 
factorial analysis of variance. 


RESULTS 


Correct responses. As shown in Figure 1, 
the results for the recall tests indicated that 
the schizophrenics produced, on the average, 
appreciably fewer correct responses, than the 
nonschizophrenics. This difference was signifi- 
cant (F = 4.05, df = 1/56, p < .05). For the 
recognition tests, however, although the direc- 
tion of the obtained difference was the same 
as that found on recall, the difference between 
the two groups was much smaller and non- 
significant (F = .38, df = 1/56). 

On each task, the improvement with trials 
was significant and the same for both groups: 
(recall, F — 50.27, df — 2/112, p< .005; 
recognition, F = 56.32, df = 2/112, p < .005). 
That is, the schizophrenics learned as rapidly 
as the nonschizophrenics, as measured by both 
the recall and recognition tests. This can be 
seen in Figure 1, in the parallel curves for 
the two groups, with both tasks. Except for 
the obvious task effect (recognition being 

. superior to recall), no other significant effects 
occurred, such as those due to different lists 
(conceptual sets) or order of task presentation. 

Thus far the results for correct responses 
are clearly in line with the predictions gen- 
erated by the main experimental hypothesis: 
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The schizophrenics were inferior on the recall 
task, but approached the nonschizophrenics’ 
accuracy level on the recognition task.* 

Intrusion errors. All overt, incorrect re- 
sponses were considered intrusion errors and 
they were examined with respect to two cate- 
gories: intraset intrusions (items from the 
same set as those which appeared on the 
learning list) and extraset intrusions (items 
unrelated, in terms of conceptual set, to those 
which occurred on the learning list). 

On both tasks there were significant differ- 
ences in the frequency of intraset intrusions, 
between the schizophrenics and the nonschizo- 
phrenics. That is, the schizophrenics made 
more intraset intrusions than the nonschizo- 
phrenics on recall (F = 34.61, df — 1/56, 
p < 01), and on recognition (F = 5.23, af= 
1/56, p < .05). As indicated in Figure 2, how- 
ever, the increment in intrusion errors by the 
schizophrenics was considerably larger on the 
recall than on the recognition task; the 
Tasks X Groups interaction was significant 
(F = 8.63, df = 1/56, p < .01). 

Another feature of these intraset intrusion 
error results was the finding of a significant 
Groups X Trials interaction for the recogni- 
tion task (F = 3.33, df = 2/112, p < 09). 
This indicates that with training, the schizo- 
phrenics approached the smaller number of 
recognition intrusions made by the nonschizo- 
phrenics. As indicated in Figure 2, by the 
third trial the schizophrenics’ recognition m- 
trusions were almost at the level of the non- 
schizophrenics. 3 

Extraset intrusions occurred so rarely 1 
either group that detailed statistical compati- 
sons were not feasible. On the recall 


8 The differential power of the recall and recogni- 
tion tasks to separate the two diagnostic groups 
might be explained, without recourse to the m i 
outlined earlier in this paper, if the two tasks Mr 
not of equivalent difficulty. That is, the 


ognition task to some optimum level for a 3 p 
task, were 
trial. 

intertrial im 
though the scores showed steady inter! ai 1). 
This suggests that the between-group 
were independent of difficulty level, at Teas 
the limits of the observed ranges. 
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the mean extraset intrusions for the schizo- 


phrenics was .30 on Trial 1, .40 on Trial 2, 
and .16 on Trial 3; the nonschizophrenics 
gave no extraset intrusions on any of the 
recall trials. On the recognition task, the mean 
extraset intrusions for the schizophrenics was 
.06 on Trial 1, .10 on Trial 2, and .00 on 
Trial 3, while the nonschizophrenics produced 
means of .03 on Trial 1, .00 on Trial 2, and 
10 on Trial 3. 

One schizophrenic S checked off every item 
on the recognition list. His results were ex- 
cluded because it was apparent that he was 
not paying attention to the instructions. In 
all other instances, Ss made so small a propor- 
tion of extraset intrusions that it was con- 
cluded that they were attending to the task, 
that is, not responding indiscriminately. 


Discussion 


The results for correct responses parallel 
those reported by Cohen and Camhi (1967) 
for communication accuracy. They found that 
schizophrenics were significantly inferior to 
normals in the speaker role, but not in the 
listener role. Similarly, the present findings for 
Correct responses showed the schizophrenics to 
be inferior to the nonschizophrenics on the 
recall task, but not on the recognition task. 

Using as a frame of reference the two-stage 
recall model outlined earlier in this paper, 
these findings are consistent with the interpre- 
tation that schizophrenic Ss are defective in 
the ability to make reliable comparison-stage 
judgments when the task requires them to 
integrate comparison with the sampling stage 
of the two-stage response selection process 
posited for recall. As predicted, overexclusion 
(significantly lower correct response scores 
for the schizophrenic Ss) was found on the 
recall tests, and not on the recognition 
tests. The results for overinclusion were also 
in the predicted direction insofar as the in- 
Creased schizophrenic tendency to make intru- 
Sion errors was significantly more prominent 
on the recall than on the recognition tests. 

The fact that some degree of schizophrenic 
Overinclusion was found on the recognition 
tests, however, does not fully conform to the 
theory, According to the theory, the schizo- 
Phrenics were expected to make no more in- 
trusion errors on the recognition tests than 
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the nonschizophrenics. A more adequate 
theory would have to account for the moder- 
ate but nonetheless significantly larger number 
of schizophrenic intrusion errors found on the 
recognition tests. 

A theory recently proposed by Broen and 
Storms (1966) would seem to account for the 
prevalence of schizophrenic intrusion errors on 
both the recognition and recall tasks. Their 
theory, a variation of one proposed earlier 
by Mednick (1958), postulates “heightened 
arousal” in schizophrenics resulting in in- 
creased strengths of normally weak, non- 
dominant responses. The Broen and Storms 
theory is, however, inconsistent with the de- 
crease in intrusion errors over trials seen in 
the present experiment on both recall and 
recognition tasks, and does not account for 
the differential schizophrenic deficit on the 
recall versus the recognition tasks. 
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Reported anomalies in verbal communication by parents of schizophrenics 
were investigated. Content structured speech samples from 15 sets of parents 
of good premorbid (GPM) and poor premorbid (PPM) schizophrenic daughters 
and nonpsychiatric controls (NPC) were obtained. These samples were analyzed 
by content area (neutral, aggression, dependency, and sex) for communicability, 
amount of speech anxiety, and source pathology. Communicativeness was 
assessed by cloze procedure using male and female college raters. Contrary 
to prediction, PPM parents tended to have the least amount of speech 
anxiety and the highest predictability, whereas NPC parents tended to have 
the highest speech anxiety and lowest predictability. However, raters tended 
to correctly identify which parents had the most disturbed offspring. 


The etiological role of families in psycho- 
pathology remains controversial (e.g., Frank, 
1965; Rimland, 1966; Zuckerman, 1966). 
Since Bateson, Jackson, Haley, and Weak- 
land (1956) postulated the “double-bind” 
hypothesis, distorted parental communication 
has been increasingly implicated as an impor- 
tant etiological factor in schizophrenia. While 
the tenability of the double-bind hypothesis 
has been seriously questioned (Schuham, 
1967), subsequent investigations of cognitive 
and communicative aberrations in parents of 
schizophrenics have yielded reasonably prom- 
ising results. Consistently positive indications 
of deviant thought processes in parents of 
schizophrenics have been derived from analy- 
ses of parental Rorschachs and Thematic 
Apperception Tests (e.g, Meyer & Karon, 
1967; Singer & Wynne, 1963, 1966), and 
application of these verbal cognitive analyses 
to object-sorting tasks have likewise yielded 
encouraging results (Wild, Singer, Rosman, 
Ricci, & Lidz, 1965). 


1 This investigation was supported in part by a 
grant from the Psychiatric Training and Research 
Fund of the Illinois Department of Mental Health, 
and in part by State of Washington Initiative 171 
Funds for Research in Biology and Medicine. Ap- 
preciation is expressed to Kashinath D. Patel, Sta- 
tistical Consultant, University of Washington Primate 
Center, for guidance in the analyses of these data. 

2 Requests for reprints should be sent to Joseph 
Becker, Department of Psychiatry, University of 
Nb e School of Medicine, Seattle, Washington 
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Investigators of cognitive deviance in 
parents of schizophrenics contend that vul- 
nerability to schizophrenia is partially medi- 
ated by defective symbolic functioning, the 
core feature of schizophrenic disorders. Par- 
ents of schizophrenics are reputed to have 
disturbed symbolic operations complementary 
to their disturbed offspring. More specifically, 
the verbalizations of parents of schizophrenics 
have been characterized as (a) lacking clo- 
sure—speech is fragmented, unintelligible, in- 
definite, contradictory, (5) disrupted—sur- 
feited with irrelevancies, and non sequiturs, 
and (c) peculiar—idiosyncratic logic, and 
usage (Singer & Wynne, 1966). These parental 
anomalies are assumed to impair the develop- 
ment of a capacity for sustained attention, 
shared meaning, and effective communication 
within the preschizophrenic child. Singer and 
Wynne (1966) assert that deviant verbaliza- 
tions in parents of schizophrenics are not test 
specific, but are readily identifiable in “almost 
any verbal material from [schizophrenic] 
parents [p. 262],” an assumption supported 
by Morris and Wynne’s (1965) postdictive 
study of family therapy recordings. 

The present study investigated the com- 
municativeness of speech from parents of 
schizophrenics by using cloze procedure 
(Osgood, 1959, pp. 78-88; Taylor, 1954). 
This procedure involves substituting blanks 
for words in verbal messages and requiring 
receivers to supply the missing words. Cloze 
procedure is reputed to measure the corre- 
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spondence between the total language systems 
of its source and receivers. Osgood (1959) 
stated that “Cloze works where other methods 
won't... because it is sensitive to just 
about all of the determinants (semantic, asso- 
ciational, grammatical, and syntactical) which 
affect encoding by a source and decoding by 
a receiver [p. 82]." Osgood (1959, p. 88) 
has speculated that cloze procedure might be 
a useful tool for assessing thought disturb- 
ance. Cloze procedure has discriminated mean- 
ingfully in a number of studies related to 
personality disturbance (e.g, Feldstein & 
Jaffee, 1963; Osgood & Walker, 1959; Platz 
& Honigfeld, 1965; Salzinger, Portnoy, & 
Feldman, 1963). 

Neither previous empirical findings with 
cloze procedure, nor their theoretical rationale, 
provided a firm basis for predictions. Osgood 
and Walker (1959), in effect, postulate an 
inverted curvilinear relation between the drive 
state of the speech source and the intelligibil- 
ity of his speech. Moderately high drive 
maximizes the probability of occurrence of 
dominant responses within the source’s speech 
hierarchy resulting in greater stereotypy and 
intelligibility. Extremely high drive pushes 
multiple responses within the hierarchies to 
asymptotic strength thus producing blocking, 
interference, disorganization, and lower cloze 
scores. Low drive produces flexibility and 

` variability, hence less redundancy, and lower 
cloze scores. To partially assess the drive 
state associated with parental speech in the 
present study, nonlexical speech anxiety levels 
were determined (Mahl, 1959). 

Wynne et al.’s descriptions of the verbaliza- 
tions of parents of schizophrenics seem analo- 
gous to the disorganization Osgood posits for 
high-drive states. Furthermore, if Wynne et 
al's assumed complementarity between the 
speech of parents of schizophrenics and their 
psychotically diagnosed offspring obtains, then 
the finding of lower cloze scores for schizo- 
prenic sources by Salzinger et al. (1963) 
would lead to a similar prediction for their 
parents. ; 


METHOD 
Subjects 


` Patient criteria required that Ss be cooperative, 
hospitalized, Caucasian females aged 17-40, non- 


retarded or brain damaged, diagnosed as schizo- 
phrenic with both biological, native-born parents 
willing to participate. Parents were below age 68, 
and in reasonably good health. Nonpsychiatric con- 
trol families (NPC) met the same criteria as pa- 
tient families, but reported no history of serious 
personality disturbance. Controls were obtained 
through local ministers, and received $10.00. Most 
patient and control families came from large urban 
areas (Chicago or Seattle), and were of British, 
German, or Scandinavian background as judged by 
surnames. 

As previously described (Becker & Siefkes, 1969), 
parents of patients were split into good and poor 
premorbid groups (GPM and PPM, respectively) on 
the basis of their daughters! scoring on the Phillips 
Premorbid Scale (Farina, Garmezy, Zalusky, & 
Becker, 1962), and the Social Competence Scale 
(Phillips & Zigler, 1961). 

Median length of patient hospitalization was 2 mo. 
The sample included 15 pairs of GPM, 15 pairs of 
PPM, and 15 NPC parents. Parent groups differed 
significantly (p< 05) in age, education, and esti- 
mated IQ (Wechsler Adult Intelligence Scale, WAIS, 
Vocabulary subtest), but not in socioeconomic class 
(Hollingshead, 1957). The PPM parents had less 
education and lower vocabulary scores than GPM 
and NPC parents. 


Procedure 


Separately and in private, each parent was asked 
to tape-record 4 min. of speech in each of four 
content areas. The first area was a relatively neutral 
unstructured one; parents spoke on any aspect of 
family life they cared to. Instructions were: 


I'm going to ask you to do something that may 
be a little difficult, but I'd like you to do it as 
well as you can. We're interested in what sort of 
ideas occur to people about their family life with- 
out thinking it over. For four minutes, I’d like 
you to speak whatever thoughts come to your 
mind about your family life, I will listen and jot 
down some notes as you speak. I would like for 
you to do all the talking during this four minutes. 
In order not to influence your stream of thoughts, 
I will try to have as little facial expression as 
possible. 


The other three content areas (dependency, aggres- 
sion, and sex) related to childhood socialization prac- 
tices with their schizophrenic (or control) child. 
Order of presentation of the socialization content 
areas was counterbalanced. As a preface to the 
Socialization content areas the following introduction 
was given: 


Okay, this time Pd like to hear you talk for about 
four minutes about how you handled some more 


*Some parents of the schizophrenic patients had 
been previously hospitalized with schizophrenic diag- 
noses. None of the parents was overtly psychotic 
when this study occurred, and none was excluded 
on this basis. 
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general problems that all parents have in raising 
their children. Let’s take the problem of how you 
dealt with [first name of patient or control] 
during her first few years of life when you were 
[dependency] trying to get her to show her 
wishes for help, attention, and affection in an 
acceptable way, [sex] trying to teach her about 
sexual matters such as sex play with either herself 
or other children, and about going around with 
clothes on, [aggression] trying to get her to control 
her anger, destructiveness, and fighting. 


After each 4-min. section, E would say, “Alright, 
fine. Now how did you deal with [daughter's name] 
when you were... .” 


Preparation of Cloze Protocols 


The first 150 words of each speech sample were 
transcribed with every fifth word deleted and re- 
placed by a line of standard length. All nonlexical 
verbal expressions were included in the protocol but 
not counted as words. Since previous work (Becker, 
Tatsuoka, & Carlson, 1965) had suggested that sex 
of rater might interact with sex of speech source, 
two sets of 30 protocol booklets, each containing 
12 protocols, were compiled. One set of 30 booklets 
was administered to male raters, the other set to 
female raters. Speech protocols were randomly as- 
signed to booklets within sets except that half of the 
protocols in each booklet came from each sexed 
parent. 


Raters and Instructions 


Basic raters were 30 male and 30 female University 
of Washington enrollees in introductory psychology 
who were drawn randomly from the S pool. A 
female graduate research assistant met with student 
raters in groups of 8-12. Rater groups were 
instructed as follows: 


This study is part of a larger one concerned 
with the language of parents who have or do not 
have emotionally disturbed offspring. Your booklet 
contains twelve pages. On each page there is a 
sample of speech from a different parent. Approxi- 
mately every fifth word in each sample has been 
deleted and replaced by a blank of uniform length. 
Vour task is to guess what the missing word is 
and to write your guess in the blank space. No 
blanks were substituted for proper nouns, numbers, 
Or exclamations. All proper nouns have been 
Changed to preserve confidentiality. Contractions 
like “don’t” have been counted as one word in 
substituting blanks. A series of dashes in paren- 
theses (- — —) indicate that the tape recording 
Of the parent's speech was unintelligible at that 
Doint. Each dash indicates about one syllable of 
missing speech. à 

Before attempting to fill in the blanks on a page, 
read the entire page first. This will provide a feel 
for the person's language habits. You will have 
five minutes for each page which is quite ample. 
It is very important that every blank be filled in. 
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One final request—at the bottom of each sheet 
you will notice a rating scale like so: 
Please check the one category that you think 
is most likely correct: 
"This parent has a severely disturbed child. 
This parent has a moderately disturbed 
child. 
This parent probably does not have a dis- 
turbed child. 
Please fill in this scale for each sheet. 
When you have completed each sheet, please do 
not go on to the next one until asked to do so by 
the investigator. 


For a protocol to be accepted as suitably rated, at 
least 12 of the 15 blanks had to be filled in; and 
for a booklet to be accepted, 10 of the 12 protocols 
had to meet the criterion. These criteria were adopted 
to help ensure adequate rater motivation. Three 
booklets and 7 additional protocols failed to meet 
the criteria. These “failures” seemed randomly dis- 
tributed over the protocols. Additional raters were 
obtained to fulfill the requisite number of rated 
protocols by each sex. 


RESULTS 
Cloze Analyses 


Arcsin transformations were made on the 
raw score cloze proportions. To determine 
whether covariance adjustments were indi- 
cated, correlations were determined for each 
sexed parent between each of the supplemen- 
tary variables on which the parents differed 
(age, education, and WAIS Vocabulary), and 
the cloze scores for each of the speech content 
areas. Since only 2 correlations of 48 were 
significant (p < .05), covariance adjustments 
were not used. Likewise, Mahl speech anxiety 
scores were unrelated to cloze scores, Data 
for the cloze scores of each content .area 
were analyzed separately by partially nested 
repeated-measures analyses of variance. This 
design permitted testing of group effects, and 
Sex of Parent X Group interactions. Com- 
parisons of ratings by male versus female 
raters and of mothers versus fathers were 
made by ¢ tests. Results of the analyses of 
variance are presented in Table 1. Mean cloze 
scores are depicted in Figure 1. 

Overall findings provided no support for 
the predicted low cloze scores of PPM parents. 
On the contrary, PPM mothers consistently 
obtained the highest cloze scores and NPC 
mothers the lowest. Significant interactions 
occurred for all socialization content areas. 
These interactions were chiefly due to the 
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TABLE 1 
ANALYSES OF VARIANCE OF CLOZE SCORES 


Content area 


Source df Neutral Aggression Dependency Sex 
MS F MS F MS F MS F 
Groups (A) 2 | 243| —* |320.01|4.975* | 130.77 | 1.86 2178| — 
F amilies within groups (B in A) 42 | 63.43 64.36 70.28 94.65 
No. parents within family (Cin B) 45 | 58.63 | 1.33 | 56.06 | 3.235*** | 77.83 | 2.85b*** | 111.76 | 2.08>** 
No. raters within parent (D in C) 90 | 44.17 17.35 27.32 53.80 


a MS for A divided by B in A. 
b MS for C in B divided by}D in C. 
*p «.025. 

+> < 01. 

p < 001. 


relatively high cloze scores of PPM mothers 
and the relatively low scores of their hus- 
bands. Male and female raters’ scores did not 
differ for either parent in any content area. 

Neutral speech. There were no significant 
group differences. 

Aggression speech. Duncan multiple-range 
tests yielded lower cloze scores for combined 
NPC parents than for GPM (p< .05) and 
PPM parents (p < .01). Among the mothers’ 
groups, NPC groups were lower than PPM 
groups (p< .05); and among the fathers’ 


groups, GPMs were higher than NPC 
A. Neutral B. Aggression 
a 
50. NS 
48 ‘e--- 
46: 
Omo mother 
44. e-- e fother 


C. Dependency 


NPC GPM PPM 
i 
3 


NPC GPM PPM NPC GPM PPM 


Fic. 1, Mean cloze scores for mothers and fathers of 
NPC, GPM, and PPM daughters. 


(p < .05) and PPM groups (p< .05). Al- 
though the interaction (p < .001) is largely 
due to differences between the PPM mother 
and father groups, these two groups did 
not differ significantly between themselves 
(t — 1.11, df = 14). 

Dependency speech. 'The trends of results 
were similar to those for aggression with the 
interaction (p < .01) due mainly to the diver- 
gence between PPM mothers and fathers 
(t = 2.15, df = 14, p < .05). There were no 
differences among the mother groups or 
among the father groups by Duncan multiple- 
range test. 

Sex speech. While there was no diagnostic 
group difference in cloze scores for sex, 
mothers overall had higher scores (¢ = 2.41, 
df = 88, p < 05) than fathers. The inter- 
action (p < .01) was due to the tendency 
of NPC and PPM mothers to have higher 
scores than NPC and PPM fathers, whereas 
GPM parents were similar. 


Mahi Analyses 


To determine interrater reliability, 12 of 
the speech protocols were randomly selected 
and rescored blindly by an independent rater. 
A Spearman rank correlation rho of .95 was 
obtained. Intercorrelations between the Mahl 
and concomitant variables yielded no signifi- 
cant correlations (p < .05), hence no covari- 
ance adjustments were used with the two- 
factor mixed design with repeated measures 
(Groups x Parents). Means and standard de- 
viations of the groups for each content area 
are presented in Table 2. No significant 
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within-Ss differences were obtained either be- 
tween mothers and fathers or as an inter- 
action between parents and groups. The PPM 
parents, especially the fathers, consistently 
tended to show least speech anxiety and NPC 
parents most. Significant between-group 
differences were found for neutral speech 
(F = 6.03, df = 2/42, p < .01) and depen- 
dency speech (F = 3.38, df = 2/42, p < .05). 

Neutral speech. Both NPC and GPM 
parents evidenced more speech anxiety than 
the PPM group (? < .01) by Newman-Keuls 
test. Whereas the mother groups did not dif- 
fer among themselves, the father groups did, 
with both NPC and GPM fathers showing 
more speech anxiety than PPM fathers 
(p < .005 and « .05, respectively) by Duncan 
multiple-range test. 

Aggression speech. A Newman-Keuls com- 
parison of parent groups yielded lower speech 
anxiety for PPM than NPC groups (? < .05). 
Again mothers did not differ, but NPC and 
GPM fathers had more speech anxiety than 
PPM fathers (p < .005 and < .01, respec- 
tively) by Duncan multiple-range test. 

Dependency speech. Nonpsychiatric control 
parents were more anxious than PPM parents 
(p < .05), but separate comparisons among 
mother groups and among father groups 
yielded no differences. 

Sex speech. There were no differences be- 
tween groups. 


Severity of Illness Ratings 


Despite the PPM parents’ trend toward 
higher cloze scores and lower speech anxiety, 
raters tended to infer correctly that PPM 
parents had reared the most disturbed chil- 
dren. Neither male versus female raters' nor 
mothers’ versus fathers’ scores differed. Par- 
tially nested repeated measures analyses of 
variance were used. For neutral speech, groups 
differed (F = 4.16, df = 2/42, p < .025) with 
combined NPC groups rated as having less 
disturbed children than either GPM or PPM 
parents (p < .05); the latter two groups did 
not differ. The same trend appeared for ag- 
gressive and dependency speech, but were not 
statistically significant. Statistically significant 
but small and inconsistently patterned inter- 
actions occurred for Sex of Parent X Diag- 
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TABLE 2 
MAHL Anxrety MEANS AND STANDARD DEVIATIONS 
Mothers Fathers 
Content area 
m SD x sD 
Neutral 
NPC 10.05 3.89 11.90 3.23 
GPM 10.92 2.77 10.57 3:32 
PPM 9.57 2.89 8.07 2.82 
Aggression 
NPC 11.10 4.88 13.75 3.73 
GPM 11.64 443 10.75 2.10 
PPM 10.67 2.92 9.79 2.49 
Dependency 
NPC 12.75 3.14 12.87 4.99 
GPM 12.63 2.81 11.47 3.71 
Bae 10.63 3.80 10.57 2.99 
NPC 12.50 3.81 13.50 2.88 
GPM 12.47 2.68 12.73 3.63 
PPM 11.53 4.49 11.83 3.17 


Note.—NPC = nonpsychlatric controls; GPM = good pre- 
morbids; PPM = poor premorbids. 


nostic Group for both aggressive (F = 1.64, 
df = 45/90, p < .05) and dependency speech 
(F = 1.41, df = 45/90, p < .05). 


Discussion 


The findings of relatively high speech pre- 
dictability in PPM mothers and low anxiety 
in PPM mothers and fathers were contrary to 
expectation. Probably the high cloze scores of 
PPM mothers were due to considerable stero- 
typy which Osgood associates with high drive 
state (D) as construed within the Spence- 
Hull framework. The findings of the present 
study indicate that either Mahl's speech anx- 
iety measure is not coordinate with D or 
that Osgood’s theory requires modification. 

Post hoc, earlier work of Wynne’s (Wynne, 
Ryckoff, Day, & Hirsch, 1958) group on the 
concept of pseudomutuality could be recon- 
ciled with an expectancy of higher cloze scores 
in PPM parents. They emphasize their lack 
of spontaneity and novelty, their excessive 
use of bland indiscriminate approval, their 
massive use of denial, and constricted pur- 
view. Further support for greater constriction 
and stereotypy in PPM mothers than fathers 
derives from Becker and Siefkes’ (1969) find- 
ing that the PPM mothers reported a nar- 
rower range of disciplinary measures than any 
of the other parental groups. 
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Likewise, the low anxiety of PPM parents 
can be rationalized. Lidz, Fleck, and Corneli- 
son (1965) in summarizing observations on 
parents of female schizophrenics emphasize 
the “aloof” qualities of their fathers (pp. 
104-105) and mothers (p. 264). Baxter, 
Becker, and Hooks (1963) reported greater 
use of denial than obsessional mechanisms in 
PPM parents. Recent work with Byrne’s 
(1964) Repression-Sensitization (R-S) scale 
suggests that less overt anxiety is asso- 
ciated with denial and repression than with 
obsessiveness. 

Also, it is possible that PPM parents, per- 
haps especially the mothers, have been queried 
more about their socialization practices, and 
have thought more about them. Hence they 
may have been more rehearsed and less 
anxious. 

Finally, the data in the present study were 
obtained under somewhat more structured 
conditions than Wynne et al.’s conjoint family 
therapy sessions or projective testing. Degree 
of structuredness may be a critical determi- 
nant of the coherence and cohesiveness of 
PPM parents’ speech and thought. 

It appears that the generality of Wynne 
et al's findings must be viewed with cau- 
tion, The tendency of raters in the present 
study to correctly identify PPMs as parents 
of more severely disturbed children is, of 
course, congruent with Wynne et al.’s find- 
ings. Increased concern with varied mea- 
sures of assessment and further delineation of 
dimensions of thought disorders are clearly 
indicated. 

The most valid generalization to be drawn 
from the present findings is that interesting 
and unexpected results have been obtained 
which require further explication. 
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CONSTRUCT VALIDITY FOR THE EVALUATION OF 
THERAPY OUTCOMES * 
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Evaluation is a matter of judging conformity to certain values and the judg- 
ment of how healthy or diseased a person is is an evaluation. The first step 
in doing outcome evaluation studies of psychotherapy is, therefore, to ex- 
plicitly choose a reference group or public and find out what its relevant 
values are. Patients, clients, sponsors, and therapists constitute four such 
publics with somewhat different values. Probable value criteria for these 
publics were proposed and the problems of defining and collecting valid data 


for these criteria were discussed. 


The impasse between clinicians and re- 
searchers over the value of psychotherapy 
exists primarily because of the widespread 
misconception that valid evaluative measures 
of outcome can be discovered in empirical 
studies of or experience with therapy or can 
be derived from theories of personality. In- 
stead, they must be derived from or proved 
by the values held by certain groups of per- 
sons. Why this is so, what these values may 
be, and how outcomes must therefore be eval- 
uated are the topics considered in this paper. 


VALUES AND PUBLICS 


There can be no evaluation without prior 
standards for what is good or what is better. 
Such standards cannot be logically deduced 
(or invented) without a priori value proposi- 
tions to deduce them from. Likewise, they 
cannot be empirically discovered without a 
priori criteria, that is, value propositions, for 
recognizing which differences in outcome be- 
tween treatments are relevant to evaluation 
and the degree to which each criterion is im- 
portant (see, e.g., Margolis, 1966, pp. 56-66; 
Wootton, 1959, pp. 203-226). Since an evalu- 
ation is always done for some audience or 
public, it can only be convincing if it is done 
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James Breedlove, Jay Hirsch, Kenneth Howard, 
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Street, Chicago, Illinois 60611. 


in terms of that public’s relevant values. If 
such a public has no explicit relevant values 
or no adequate consensus on what these values 
are, no evaluation done for it can probably 
be decisive or even acceptable, The achieve- 
ment of an adequate value consensus is a 
social process wherein certain values come to 
be widely held through persuasion, negotia- 
tion, legislation, or the development of dog- 
matic traditions. New factual discoveries, 
however, may indicate to some members of a 
public that a certain value commitment is in- 
adequate, in that it is too narrow or internally 
contradictory, for example, but they cannot 
imply what it should be. 

The evaluation of therapies intended to en- 
hance health or reduce disease in humans 
must be done in terms of the values which 
define these human conditions. The public 
whose values must be satisfied in any particu- 
lar case includes those persons in whose in- 
terest the therapy is performed or provided. 

*The concept of disease is used quite broadly to 
include all sorts of unhealthiness, Thus, injury and 
malaise are included just as are infectious diseases 
and neurosis. Even character disorders, disabilities, 
and retardations are covered, The values that generi- 
cally define disease are ultimately cultural or social 
rather than medical or other professional ones (e+ 
Dubos, 1961, p. 214; Jaspers, 1962, pp. 779-788; 
Mechanic, 1962). The matter could be relegated to 
some profession, but, if it is, such a delegation of 
authority should first be well and openly considered. 
To be judged ill is often tantamount to some loss of 
freedom and other rights, especially if mental illness 
is imputed. The author does not agree with Szasz 
(1961, pp. 39-45) that the answer lies in limiting the 
definition of illness to physiochemical disorders. 


Instead, a full scale analysis of the concept of illness 
is in order. 


524 


"This means that the values of (a) the persons 
receiving the therapy (the patients), (5) the 
"persons whose complaints will be remedied by 
the therapy (the clients), and (c) the persons 
"whose planning and material support is respon- 
ble for the provision of the therapy (the 
sponsors) are pertinent to the therapy’s eval- 
"uation. In some cases these are all the same 
"person, such as the affluent self-referral for 
— psychotherapy. In other cases they may all be 
— different people, as in a school referral of a 
"fire-setter to a public child guidance agency. 
But all the variations are possible, for exam- 
“ple, that of a housewife who brings in her 
genployed alcoholic husband to a private fam- 
ily counseling agency. 
$ - The only other obviously interested parties 
to therapy transactions are (d) the profes- 
‘sionals who “perform” them, but these pro- 
fessionals are almost always selling their 
"Services as means to someone else's ends as 
- well as applying their own professional norms 
"concerning means and ends. Therefore, as re- 
-sponsible vendors, they should subscribe to 
the values of their sponsors; as honest con- 
— tractors, to the values of their clients; and, as 
- humanitarians, to the values of their patients. 
Professional norms do further complicate the 
therapists’ values but can be considered sepa- 
3 rately where different from sponsor, client, or 
patient values (as in the judgment the opera- 
tion was a success but the patient died). It 
may be that the outcome of a therapy case 
could be evaluated differently in terms of the 
different values of these four publics. And it 
8 not at all clear that the values of any one 
Iblic, whatever these may be, are superior to 
or more relevant than the values of any other. 
lus, issues such as whether or not “mental 
Iness” is disease are cultural issues, not 
merely scientific ones, and so are settled ulti- 
ately by whatever means produce a relevant 
alue consensus. 
—— Health and disease are both open concepts, 
that is, only partially defined. It is obvious 
that they are extensionally open (that all 
their instances are not known) because new 
- Cases of disease are regularly discovered, and 
Leyen altogether new diseases and degrees of 
health are occasionally identified. It is not 
— *'Thus, there are those who believe that a client's 
- failure to realize his potential, to actualize, develop, 
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quite so obvious that they are also intension- 
ally open concepts, that is, that the criteria 
for distinguishing a diseased from a healthy 
person cannot be completely stated to every- 
one's satisfaction. There are, however, some 
definitional constraints that are, we believe, 
generally respected. Since this is a methodo- 
logical analysis, rather than an empirical sur- 
vey of definitions in use, this purportedly 
consensual theme is developed. 


HEALTH AND DISEASE 


Whatever other characteristics it may have, 
every disease involves or portends bad conse- 
quences for at least the person who has the 
disease. A sine qua non of disease is the 
presence of or disposition to disvalued (i.e. 
bad) experiences of, acts by, or physical 
events in the person which tend to incapaci- 
tate him for the role performances expected 
of him by the relevant publics (see Parsons, 
1964, pp. 258-262). Pain is the paradigmatic 
experiential event; destruction, the paradig- 
matic behavioral event; and loss of, or inade- 
quate, form or function, the paradigmatic 
physical or characterological events, respec- 
tively. These disvalued and incapacitating 
events are the primary characteristics of all 
diseases, mental and physical, from their rela- 
tion to which the rest of any disease's char- 
acteristics acquire their relevance to that 
disease. These primary characteristics are 
weakly disjunctively necessary conditions for 
the presence of manifest disease. Although at 
least one of them is necessary, no specific com- 
bination of these three characteristics is gen- 
erally sufficient to define a state of disease. 
This is so, in part, because tolerances for 
pain, destruction, and losses of form and 
function vary, and because tolerances for dif- 
ferent degrees of incapacitation in different 
roles vary as well. However, it is also due in 
part to the priority of certain alternative ex- 
planations for some instances of these primary 
characteristics, for example, overexertion, im- 
maturity, or aging. 

One GRE these primary disvalued 
characteristics being necessary to the defini- 
tion of any disease is the evaluative nature of 


and express himself is an index of mental illness 
(e.g., Maslow, 1962, pp. 141-174). 
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the disease concept itself.° To diagnose a per- 
son as having a disease is tantamount to a 
decision to take therapeutic action (condi- 
tional upon the availability of the resources 
for doing so, upon competing claims on these 
resources, and upon the probable net advan- 
tage to patients, clients, or sponsors involved). 
Thus, the identification of an instance or case 
of disease by a medical or other therapeutic 
professional in the line of duty (ie., in re- 
sponse to a bona fide client) is not a mere 
perceptual or cognitive achievement. It is 
also, properly, often a cue to some more or 
less remedial response, because a state of 
disease is always a disvalued state of any 
person. 

Within the context of disvalued and inca- 
pacitating personal events, as the class of 
(necessary) disease-defining events, the sev- 
eral possible publics for evaluative research 
can differ in their values. They can also differ 
in their skill at recognizing dispositions to, if 
not the presence of, these primary character- 
istics. It is here that therapy professionals 
may properly claim some technical advantage, 
though their degree of advantage is an em- 
pirical matter. Evaluative research, however, 
need not endow these prognostications with 
any privileged status so long as extended or 
recurrent contact with (former) patients and 
clients is feasible. 


MENTAL HEALTH Status 


Some disease, broadly conceived, seems to 
be caused at least in part by mental or per- 
sonality characteristics of the patient. This 
kind of “disease” has been taken by the 
“mental health” professions as their responsi- 
bility. In no way, however, should the logical 


5 Therefore, statistical abnormality per se is in- 
sufficient as a defining property of disease, because 
valued states of the person may also be rare, It 
becomes absurd as a defining property of seriousness 
of disease or degree of unhealthiness, because it 
makes more frequent diseases ipso facto less serious. 
Thus, without arbitrarily specifying some reference 
frequency or population, burgeoning epidemics make 
the individual patients improve, and effective pre- 
ventive and curative procedures make them deteri- 
orate. Only by affecting values can abnormality in- 
fluence the definition of disease, but it has no privi- 
leged status among the many factors that can in- 
fluence values. 
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problem of evaluating the outcome of the 
psychosocial therapies used by these profes- 
sionals differ from that of evaluating the out- 
come of any treatment of any disease (see 
Margolis, 1966, pp. 71-82, on this point). The 
previous discussion shall merely be extended 
to more nearly depict operational assessment 
procedures for mental health (or disease) 
status. 

Besides physical pain there are equally dis- 
valued moods or affects, such as anxiety, 
despair, and guilt, which can be quite inca- 
pacitating. The destructive acts to self or oth- 
ers which can indicate disease are more in- 
digenous to mental than to physical illness, 
though here it is their irrational quality 
which is often stressed. The inadequate at- 
tainment or the impairment of form and 
function are, in this realm of disease, usually 
in social or mental affairs, rather than in 
physical structures. The incapacitation for 
role performance, however, can be as severe as 
in any other realm of disease. How unhealthy, 
ill, sick, or diseased a person is, and, conse- 
quently, how much better or worse he is, can, 
of course, only be determined from some value 
position (although his prognosis, in terms of 
whatever normative framework is adopted, is 
in part a matter of scientific knowledge). 

Each patient has his own tolerances for 
various moods and affects, his special personal 
and environmental resistances to ostensibly 
destructive behavior, and his particular per- 
sonal and environmental tolerances for inade- 
quate or reduced mental and social form and 
function. Furthermore, the roles which he 
must, or hopes to, undertake and the demands 
they make upon him are also specific to his 
life. Therefore, any evaluation of his therapy 
from Ais standpoint must concern changes in 
experiences, acts, events, and capacities as 
judged by him to be better or worse on bal- 
ance. 

A client who is not a patient (i.e., someone 
who refers a patient at least in part because 
of his bad effect on their lives or jobs) can 
make similar kinds of appraisals but only 
from his own standpoint. These appraisals by 
collateral persons can diverge considerably 
from those of the patient himself. Their rec- 
ognition and valuation of the patient’s moods, 
destructive acts, functioning, and capacities 
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are quite independent of his. Perhaps his re- 
ports of his own affective experiences are bet- 
ter than theirs, but this does not hold for 
impressions of the effectiveness, finesse, effi- 
ciency, or public consequences of his role per- 
formances. They may see problems for them- 
selves, created by the patient, which the pa- 
tient either does not see or interprets differ- 
ently. 

A sponsor, who is neither a patient nor cli- 
ent, may, as an enlightened humanitarian, 
share a concern with patients for their ex- 
periences, self-destructiveness, and the func- 
tioning and capacities they value specifically. 
He may also be concerned about the problems 
that patients create for various collateral 
persons. However, he may, further, be con- 
cerned with law and order, public decency, 
responsible citizenship, escapism, economic 
self-reliance, etc., in ways which patients and 
clients do not see as relevant to therapy. 
(What, for example, does government foresee 
as the desirable consequences of public mental 
health centers?) 

The mental health professionals are un- 
doubtedly concerned to optimize the patient's 
outcome from his standpoint (even, some- 
times, at the expense of altering that stand- 
point a bit) and from the standpoint of cer- 
tain other clients as well. The parents in 
child guidance cases, the children of a senile 
parent, the child's analyst who has referred 
the parents for counseling (but not so often 
the wife of an alcoholic or the dependent 
mother of a bachelor), may find the therapist 
sympathetic to their value position. Thera- 
pists probably are also relatively aloof to 
purely sponsor preferences. Most of all, 
therapists rely on their own special standpoint 
for judging mental health status. Unfortu- 
nately, what their value position is varies 
Considerably both within and between the 
therapy professions (e.g., see Offer & Sabshin, 
1966), so it is discussed here in quite broad 
terms. 

The major novelty in the therapist’s evalua- 
tive standpoint is its inclusion of prognostic 
indicators. Thus, judgments about ego 
Strength, object relations, superego severity, 
Opportunities for impulse gratification, skill 
repertoire, family dynamics, environmental 
Stresses, etc., influence their evaluations of 
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mental health status. The only justification 
for such a use of prognostic indicators is in 
the degree to which they actually do tend to 
predict the state of the several primary char- 
acteristics of disease already discussed. An- 
other novelty of the therapists’ evaluative 
standpoint, before we pursue the first more 
deeply, is its usual restriction to data based 
on observations made in the office, clinic, or 
hospital, environments quite foreign to the 
life for which the patient is being treated. 


Speciric DISEASES 


Among the persons who are considered 
diseased (sick, ill, or injured) by a medical 
or paramedical professional, discriminations 
are made regarding what disease or diseases 
they “have.” In our society the recognition 
and nonrecognition of disease by medical pro- 
fessionals has acquired a special legitimacy, 
although their special experience and skills jus- 
tify such legitimacy only for their discrimina- 
tions of specific diseases. These latter are de- 
fined most securely when they are defined 
genotypically, that is, when their course, eti- 
ology, prevention, and cure are understood 
(see Temkin, 1963). Extensive scientific re- 
search is necessary to achieve such an under- 
standing. Short of such a genotypic definition, 
and prerequisite to it, is phenotypic defini- 
tion, which depicts the configuration and co- 
variation of the general and primary disease 
characteristics with any others. Such reliable 
clusters of characteristics are the bases upon 
which discriminable kinds of diseases are 
recognized and their genotypic definitions 
sought. The crucial point, however, is that the 
criterion indicators of disease in general are 
the primary characteristics previously dis- 
cussed, whereas the criterion differential indi- 
cators of specific diseases may be anything. 
What these latter properly are is clearly an 
empirical rather than a value or stipulative 
matter, but it is quite irrelevant to evaluating 
health status unless these differential diagnos- 
tic indicators are well-evidenced predictors of 
future stability or change in the primary 
characteristics of disease. Such demonstrated 
prognostic utility is practically unknown in 
the archives of mental health research. There- 
fore, evaluative studies of psychosocial ther- 
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apy should not rely upon the inadequately 
predictively validated prognostic indicators. 

The exception to this rule quite properly 
occurs when the public for which an evalua- 
tive study is performed is that of therapists. 
(The happenstance that outcome studies have 
been predominantly, if not exclusively, done 
by and for therapists and allied professionals 
may make this logical exception seem more 
the rule.) If the goals of therapy are cast in 
terms of such prognostic indicators, then it is 
of interest to study when and how these goals 
are attained. Such technical or professional 
studies, or those devoted to theoretical or 
predictive validity, however, ought not be mis- 
taken for true evaluative studies of mental 
health status outcomes, Likewise, studies con- 
cerned with the stability or change of a spe- 
cific disease in patients must not be equated 
with studies of general health status, because 
one disease may improve and yet the patient 
may be more unhealthy. Since the seriousness 
of any two specific diseases can logically be 
negatively correlated within patients and since 
all disease has not been classified into a 
closed set of specific diseases, the course of 
any one specific disease is no sure determinant 
of health status. 


Construct VALIDITY * 


The first problem of evaluative data is their 
semantic construct validity, that is, to what 
extent they are relevant to or can reflect 
health status for a given public. The line of 
argument set forth so far was intended to 
explicate the concepts of health and illness. 
Although the concept of mental health has 
been used, it is certainly not clear how to dis- 
tinguish mental health from physical health 
unless, for example, any genotypically defined 
physical diseases which are demonstrably un- 
responsive to psychosocial therapies be ex- 
cluded. But what could be the purpose of 
such a distinction for evaluative research, ex- 
cept to create two subprofessions out of one. 
For the primary criteria of outcome evalua- 
tion no different background or training is 
necessary to do research on treatments for 
tuberculosis or fractures as to do it on schizo- 


6 The methodological grounds for this section are 
discussed in Krause (1967). 
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phrenia, homosexuality, or low back pains. 
And these primary criteria are, in addition to 
cost, what matter in treatment evaluation. 
Since the explication includes considerations 
popular among therapy professionals, one ex- 
pects that it should be more or less acceptable 
to them, at least as far as it goes. Neverthe- 
less, there can be no rational evaluation with- 
out specifying the public for which it is ex- 
pressly designed, and so empirical research on 
that public's value position is prerequisite. 

The second problem of evaluative data is 
that their public may be committed to certain 
theoretical propositions so firmly that out- 
come findings contrary to these would be con- 
sidered invalid. Thus, psychoanalysts might 
find it implausible that good analysands would 
not be healthier on the average after 2 years 
of apparently successful analysis, and client- 
centered therapists might feel the same about 
a similar exposure to perceived unconditional 
positive regard, empathic understanding, and 
congruence in the relationship, etc. The only 
insurance an evaluative researcher can have 
against such inconclusiveness is to achieve and 
maintain his chosen public’s faith in the va- 
lidity of his data. It must be his goal to 
collect sufficiently convincing data to jar the 
prior convictions of even the most theoreti- 
cally committed of his public, should discon- 
firmative results turn up. That is exactly the 
upper bound of how much validity he needs. 

A usual line of attack on the validity of 
data, and so the third problem for an evalu- 
ator, concerns technological construct valid- 
ity. There are always possible biasing factors, 
not colinear with actual mental health status, 
which may have influenced the data. Respon- 
dent, test taker, interviewer, test administra- 
tor, and E biases have been demonstrated, 
but may be stochastically equated between 
experimental and control groups. What cannot 
be avoided through better assessment instru- 
ments must be rendered irrelevant by better 
experiment design, For this reason if no other, 
control groups are generally necessary in 
evaluative research. 

Although most “evaluative” work is done 
for therapists, and some ostensibly for (espe- 
cially Federal) sponsors, it is conceivable that 
consumers’ organizations will someday be 
strong enough in this country to purchase 
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their own research for patients’ and clients’ 
standpoints. In the meantime we must be 
content with exploring logical possibilities 
and encouraging the development of counter- 
vailing forces in the market of health services. 


PRACTICAL EVALUATION ' 


Once a random or adequately matched 
assignment of patients between a control 
“treatment” and the therapy being evaluated 
has been achieved, the conditions for some 
evaluation are present. In whatever terms are 
acceptable to the evaluator's public, the roles 
and standards of performance of the patients 
must be assessed. Instrument development 
must be based upon empirical research pri- 
marily on the relevant public. There are no 
scientific standards for the content of evalu- 
ative assessments, unless "well-evidenced" 
predictors of future stability or change in 
mental health status already exist. Prior re- 
search on, or experience with, patients and 
their lives can, otherwise, only serve to suggest 
possibilities to the relevant public. 

Whatever is the appropriate content for an 
evaluative assessment of patients, some ade- 
quate means of obtaining the pertinent data 
must be developed. The criterion data, ac- 
- cording to our analysis of disease, are on 
patient-role performance capacities (which 
are dispositional properties of persons) and 
On patient experiences, acts, and events 
lending to alter these capacities (which are 
state properties of persons with certain dis- 
positional and causal properties of their own). 

A person's role-performance capacities are 
fully manifest by his role performance only 
when his motivation to perform is optimal 
and his opportunities to perform or conditions 
Of performance are sufficiently similar to 
those of interest. Thus, proper test condi- 
tions for mental health status require suf- 
ficient motivation of the patient for adequate 
tole performance and “real life" situations 
appropriate to this performance. Only patients 
themselves, their natural associates, or a very 
intrusive evaluation team (made more subtle, 
Perhaps, by modern electronic apparatus) 


* An example of an evaluative study conducted as 
an application of the logic of this essay, and par- 
ticularly of this and the following section, is Krause, 
Breedlove, and Gonen (1967). 
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could observe patients under these condi- 
tions, Although it may be reasonable to 
suppose that inadequate role performance in 
one situation is predictive of similar inade- 
quacy in other situations, whether this is so 
or not is an empirical matter. Certainly per- 
formance in the unfamiliar role of patient, 
test taker, applicant for help, or prisoner 
may not be a good predictor. Since therapists 
typically know only what they are told by 
patients and collaterals and what they see in 
the therapy or testing situation, their assess- 
ments are either (adjusted) hearsay evidence 
or prognostications, some of which are gen- 
eralizations to other situations. Therefore, 
evaluative research requires that good work- 
ing relations be established with patients and 
their natural associates for interviewing or, 
better, observation by the evaluation team. 
The subtle problems of bias, reactivity, and 
indirect measures of role performance are not 
taken up here. 

As noted in the preceding paragraphs, the 
presence of any experiences of, acts by, or 
events in the patient which tend to incapaci- 
tate him are also within the evaluative pur- 
view. Their disposition to incapacitate is an 
empirical matter on par with therapists’ 
prognostications, but some tend to be some- 
what incapacitating by definition, Thus, in- 
tolerable anxiety or strain, serious and ap- 
parently insoluble “life problems,” unpleasant 
irrational behavior, and flagrant antisocial be- 
havior are prime examples. Whatever the 
whole gamut of these “pathogenic” states of 
the person may be, those which are acts or 
(nonexperiential) events are accessible to 
natural associates, evaluation teams, thera- 
pists, and some patients if they observe the 
patient for a sufficient length of time and 
over a sufficient variety of circumstances. The 
likely error is that of failure to note transitory 
relevant behaviors. Although the same is true 
for relevant patient experiences, only the pa- 
tient himself has access to these and so for 
this data his therapist should have a con- 
siderable advantage over many of the patient’s 
natural associates and even over many evalua- 
tion team interviewers or observers. Again in 
the case of these several states of the patient, 
any further validity or reliability issues are 
not considered. 
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DECIDING BETWEEN TREATMENTS 


Once it is admitted that several legitimate 
standpoints for evaluation exist and, further, 
that each involves several criteria, the prob- 
lem of how to choose among treatments on 
the basis of their outcomes becomes very 
complicated. Only when these outcomes are 
superior for one treatment on all criteria from 
all standpoints is a choice simple. Obviously, 
it cam happen that what is an improvement 
from one standpoint (e.g., to government au- 
thorities a reduction in unconventional be- 
havior) may be taken to be worse from 
another (e.g., to humanistic therapists a re- 
duction in self-actualization). And evaluative 
research cannot avoid the implications of such 
conflicts of values in pluralistic societies. 
Perhaps the easiest solution derives from 
using the values of those who sponsor the 
research (and see some decisions to be made 
on the basis of its results). Less interested 
parties’ values may be involved if there are 
sufficient resources to collect the relevant 
data and if they too can be informed of the 
results. This seems a more socially responsible 
solution, but it demands that these various 
parties be identified and their standpoints 
defined first. 

Even within a single standpoint several 
noncolinear or even negatively correlated 
criteria may occur. If each could be assigned 
some proper weight, then a unique outcome 
score could be calculated for each case. But 
it is difficult to see, without empirical study, 
how important each criterion is in each stand- 
point or whether the criteria are properly 
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additive at all. In the meantime, all that is 
clear is that a case that improves on some 
criteria and becomes worse on none is better 
than another case which improves less or gets 
worse on any of these same criteria and does 
not improve more on any of them. Where 
other patterns of differences occur, choices 
between treatments require prior weighting 
rules for the criteria. 
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The equivalence of 12 psychotic syndromes established in the United States 
data was sought in data from six other countries. Samples of drug-free, newly 
hospitalized patients in England, France, Germany, Italy, Japan, and Sweden 
were rated after a semistructured interview. The pooled ratings of men and 
women in each sample were correlated and factored. A hypothesis-testing 
procedure and the normal Varimax were applied to yield two roughly 
equivalent solutions. With but a few exceptions each of the syndromes was 
confirmed in each of the six samples. The sample syndrome score profiles were 


also examined and discussed. 


Beginning in 1933 with Moore, a group of 
investigators have sought to isolate, by the 
method of factor analysis, the major psychotic 
syndromes reflected in currently observable 
symptoms and behavior, The need arose from 
the prevailing dissatisfaction with the psychi- 
atric classes and from the absence of any 
rigorous alternative procedure for correcting 
or verifying the symptom groupings reported 
by clinicians. Wittenborn (1951), Guertin 
(1952), Lorr, Jenkins, and O’Connor (1955), 
and Lorr, McNair, Klett, and Lasky (1962) 


: have among them isolated and confirmed 10 


or more psychotic syndromes. Since a critical 
review of these and other studies has been 
reported (Lorr, Klett, & McNair, 1962), none 
is needed here. 

The present study proposed to ascertain 
whether or not the psychotic syndromes de- 
fined by the factor analysis of interview 
ratings were the same in other cultures than 
our own. The major goal was to determine the 
number and kinds of syndromes exhibited by 
drug-free and newly hospitalized men and 
women in each of six countries. The results 
were then compared across samples and with 

1 This study received partial support from National 
Institutes of Health Grant MH 113212-02. Grateful 
acknowledgment is made to Horst Flegel, Franco 
Giberti, Max Hamilton, Carl Jonsson, Ziro Kaneko, 
and Pierre Pichot who made this study possible. 
Ronald Bonato and Kenneth Yang of the Biometrics 
Laboratory, George Washington University, con- 
ducted the initial factor analyses. 

? Requests for reprints should be sent to Maurice 
Lorr, Department of Psychology, Catholic University 
of America, Washington, D. C. 20017. 
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the United States findings to assess the degree 
of constancy or similarity. At the present time 
there is no internationally accepted yardstick 
for measuring the kinds or degrees of psy- 
chotic behavior manifested by the severely 
disturbed. Qualitative descriptions and cate- 
gories constitute the major vehicles for com- 
municating research information concerning 
the behavioral disorders. In brief, the investi- 
gation could provide a test of the uniformity 
of the syndromes across several cultures. The 
end results would perhaps yield a set of 
measures and a conceptual scheme capable of 
facilitating epidemiological, genetic, and bio- 
chemical research. 

The 12 syndromes or constructs hypothe- 
sized were as follows: Excitement, Hostile 
Belligerence, Paranoid Projection, Grandios- 
ity, Perceptual Distortion, Obsessional-Phobic, 
Anxious Depression, Functional Impairment, 
Retardation and Apathy, Disorientation, 
Motor Disturbances, and Conceptual Disor- 
ganization. A Somatic Complaints syndrome 
was also framed but dropped for reasons to 
be given later. Each of these patterns has 
been identified two or more times. They also 
combine into five more inclusive higher-order 
syndromes (Lorr, Klett, & Cave, 1967) of 
which four resemble the well-known dis- 
orders referred to as schizophrenia, paranoid 
state, psychotic depression, and manic excite- 
ment, At the same time it should be under- 
stood that the syndromes hypothesized, based 
as they are on current symptoms and signs, 
are not synonymous with the usual diagnostic 
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classes. Conventional psychiatric categories 
include additionally such life history variables 
as age, type of onset, precipitating event, 
course of the disorder, and tendency toward 
recurrence. 

METHOD 
Data Collection 


Conferences were held with each group of col- 
laborating psychiatrists in England, France, Germany, 
Italy, and Sweden. Arrangements with the Japanese 
representative, however, were conducted by mail. 
All psychiatrists responsible for data collection held 
university appointments. The meetings were con- 
cerned with the nature of the sample, with the 
interview structure, and with the rating process. 
Several patients were interviewed and rated in order 
to familiarize each team with the rating schedule 
and with the rating process. Subsequently arrange- 
ments were made to translate the rating form of the 
Inpatient Multidimensional Psychiatric Scale (IMPS) 
into French, German, Italian, Swedish, and Japanese. 
The forms were then retranslated back into English 
by another competent person. Finally the original was 
compared with the retranslation by the investigators. 

Each team was asked to interview and rate a 

* representative sample of 100 males and 100 females 
diagnosed psychotic and newly hospitalized. A fur- 
ther requirement was that patients interviewed either 
be free of drugs or be on a minimum drug-dosage 
schedule. This requirement arose from research 
showing that the tranquilizers reduced or eliminated 
characteristic symptom profiles. Cases with central 
nervous system disorders or neurological disease as 
well as alcoholics and drug addicts were to be 
excluded. 

In addition to the ratings, data were collected on 
the following: age, marital status, highest grade 
completed, occupation, place of residence (rural 
or urban), initial psychiatric diagnosis, religion, 
duration of psychotic episode, and number of prior 
hospitalizations. 


Interview and Rating Schedule 


The standardized 75-item rating schedule IMPS was 
designed to measure 11 psychotic syndromes isolated 
in repeated factor analyses (Lorr, Klett, McNair, & 
Lasky, 1962). To the standard set, 15 new items 
were added to measure two depression syndromes 
recently established called Functional Impairment 
and Somatic Complaints (Lorr, Sonn, & Katz, 1967). 
Five to 11 items defined each syndrome measure. 
Data relative to the reliability and validity of the 
standardized portion of IMPS are presented else- 
where (Lorr & Klett, 1966; Lorr, Klett, McNair, & 
Lasky, 1962). 

The interviews were semistructured, 30 to 60 min. 
in length, and focused on information relevant to 
the questions in IMPS. Each interviewer was accom- 
panied by an equally experienced and trained ob- 
server. Ratings were completed independently by 
observer and interviewer immediately following the 
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interview. It was the combined ratings that were 
used in the subsequent analysis. The interviewing 
team also determined the patient's diagnosis using 
the American Psychiatric Association (1952) manual. 


Method of Analysis 


The first step was to ascertain the number of 
factors required to account for the correlations 
among the items. The two criteria used to judge 
completeness of factoring were (a) the number of 
factors with associated latent roots of 1.00 or greater, 
and (b) the number of Varimax-rotated factors, 
exclusive of doublets and singlets, judged to be 
meaningful. To this end two preliminary principal 
component analyses were made, one of the pooled 
male cases and one of the pooled female cases. The 
two 14-factor solutions yielded 12 interpretable Vari- 
max factors, of which 11 were clearly the same in 
both analyses, The analysis also revealed that the 
new Somatic Complaints syndrome hypothecated 
was represented weakly or was absent in several 
countries. For this reason the five somatic items 
were deleted for the hypothesis-testing portion of 
the study which was based on 85 variables. 

Next each sample's correlation matrix, based on 
the pooled ratings of both men and women, was 
factored by the method of principal components. 
Pooling seemed justified by prior research indicating 
negligible sex differences (Lorr & Klett, 1965), and 
by the reduction in the standard errors in correlation 
coefficients. The 12 factors extracted in each sample 
were rotated by a least-squares procedure to a 
simple structure hypothesis matrix (Horst, 1956) based 
on United States data. The simple structure matrix 
consisted of 85 rows and 12 columns representing the ' 
items and the factors hypothecated. Each item was 
assigned a zero weight except in the factor column 
y oe to define. The nonzero weights ranged 

The third set of analyses were based on all 90 
variables. Thirteen principal components were ex- 
tracted from each of the six matrices. The factors 
obtained were then transformed by Kaiser's normal 
Varimax. The use of 13 rather than 12 factors (as 
in the second analysis) was justified by the more 
stable structure attained in rotation. The "blind" 
analytical solution was intended as a check on the 
hypothesis-testing solution. The latter should provide 
a better fit to the data (being a correlated solution) 
and at the same time test the equivalence of, United 
States and cross-cultural structures. 


RESULTS 
Sample Characteristics 


The total sample consisted of 1,107 cases 
to which each country except Sweden con-. 
tributed between 200 and 204 cases. The 
Swedish sample consisted of 50 men and 50 
women. The median age of the 540 men was 
36 while the median age of the 567 women 
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TABLE 1 
CORRELATIONS OF SYMPTOMS WITH EXCITEMENT SYNDROME 
scil 
Country 
Symptom Item 
England | France | Germany Italy | Japan | Sweden 
Keyed 

Unrestrained behavior 7 59 wal 64 EL 49 39 

Hurried speech 9 .69 71 .68 44 67 .63 

High mood 12 51 61 A2 ES .62 AS 

Dramatizes 17 54 45 58 (.33) .66 .64 

Loud speech 20 .65 .62 .65 E .66 64 

Overactive 26 68 416 .62 59 65 61 

Excess speech 35 , 12 -70 60 18 66 «66 
T Dominating 37 64 .59 AT 72 .60 .65 

Intense feeling 76 51 36 Ad 76 E: .50 

Seductive 77 50 = 49 — 57 AS 

Not keyed 

Rambling speech 4 E 51 = 42 51 43 

Superior attitude 15 35 51 .36 

"Tension 10 37 35 

Irritability 25 42 

Slovenliness 30 .36 

Complaining 34 39 

Sleep difficulty 86 e 


Note.—Number in parentheses not significant at +.35. 


was 38. The average years of schooling re- 
ceived by patients ranged from 6.5 for Italians 
to 11.5 for the Japanese. The percentage of 
single (never married) was 63 for men and 
37 for women. No previous hospitalization 
had been experienced by 46% of the men and 
43% of the women. As might be expected 
nearly all of the French and Italian patients 


were Catholic and nearly all of the Swedish 
were Protestants. The English and the Ger- 
man sample was split with Protestants pre- 
dominating, Two-thirds of the Japanese were 
Buddhists. Approximately half of the sample 
came from very large cities. The occupation 
of most patients fell into clerical and sales, 
skilled and semiskilled. 


TABLE 2 
CORRELATIONS OF SYMPTOMS WITH ANXIOUS DEPRESSION SYNDROME 
| ] Country 
Symptom Item 3 
England France Germany Italy | Japan | Sweden 
i Keyed 
Blames self 14 .79 719 12 78 69 19 
Anxious (specific) 21 45 50 4 46 AT ju 
ae 24 A4 44 .51 42 AL zi 
Self-depreciating 27 71 73 63 68 62 ae 
Depressed 29 45 .54 53 40 59 ES 
Guilty 31 83 83 E 78 E 77 
Lacks insight —39 (—.26) — — (—.34) E E 
Suicidal 40 75 By rH T ie m 
Feels sinful 66 42 1 f y ? 4 
Hopeless 78 56 -60 -56 42 65 63 
Not keyed 
Tense 10 43 39 
Slow moving 16 E 
Obsessive — ' 41 37 45 
Lacks sex interest 81 E 


Note,—Numbers in parentheses not significant at +.35. 
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A standard American Psychiatric Associa- 
tion (1952) diagnosis was obtained for each 
case. The English, French, German, and 
Swedish samples were fairly consistent in their 
distribution of cases to the various diagnostic 
categories. The percentages of nonparanoid 
schizophrenics, paranoids (including paranoid 
schizophrenics), depressives, and manics were 
40, 30, 22, and 8, respectively. Irt contrast, 
more than half of the Italian sample were 
diagnosed depressives and about half of the 
Japanese received nonparanoid schizophrenic 
diagnoses. 


Hypothesis-Testing Solution 


Tables 1 to 12 present by country those 
symptoms and behaviors correlating £35 
with each syndrome in the hypothesis-testing 
solution. The items keyed for the syndromes 
(those given nonzero weights) are shown in 
the upper section of each table. The items not 
keyed but correlating above .35 with the 
factor are shown in the lower section of each 
table. Items 76 to 90 represent the experi- 
mental scales. 

The results are summarized briefly as the 
abbreviated symptom descriptions make it 


TABLE 3 
CORRELATIONS OF SYMPTOMS WITH FUNCTIONAL IMPAIRMENT SYNDROME 


ee Country 
Symptom Item 
England | France Germany Italy | Japan | Sweden 
PR Keyed 
acks sex interest 81 (.26) 50 — é 
Cannot work 84 49 51 .59 oe TH R 
Cannot concentrate 85 46 59 -59 .62 42 49 
Sleep difficulty 86 Al AL .56 .50 49 39 
pe T i E va m 50 61 51 A0 
'o interest in people al 53 37 È 
Not keyed s A^ ud 
Apprehensive 24 .39 
Lacks insight 39 —.49 
Suicidal 40 35 
Hopeless 78 37 


Note.—Numbers in parentheses not significant at +.35, 
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obvious what defines a factor. Nonkeyed items 
are considered if they correlated with a factor 
in at least three countries. 

Excitement. This pattern presented in Table 
l emerged similarly in each country. Ram- 
bling speech (4) and an attitude of superior- 
ity (15), although not in the key, also corre- 
lated with the syndrome. In the United States 
samples they also correlated with Excitement 
but at lower levels. 

Anxious Depression. The pattern presented 
in Table 2 corresponds closely to the United 
States key except for lack of insight which 
was expected to correlate negatively. 

Functional Impairment. This less well-es- 
tablished syndrome presented in Table 3 was 


defined fairly uniformly in each of the six 
samples as hypothesized. It had been previ- 
ously identified by Overall (1963) and by 
Lorr, Sonn, and Katz (1967). 

Obsessional-Phobic. Table 4 presents a pat- 
tern fairly well defined in four countries, 
weakly defined for France, and absent from 
Sweden. Ideas of change or depersonalization 
(67) was also weak in the United States data 
as it is here. 

Retardation and Apathy. With but a few 
exceptions this syndrome (Table 5) was con- 
firmed in each sample. Manneristic posturing 
(6) was the only nonkeyed item relating to 
Retardation in three samples. 


TABLE 4 
CoRRELATIONS OF SYMPTOMS WITH OBSESSIVE-PHOBIC SYNDROME 
Country 
Symptom Item 
England | France | Germany | Italy | Japan | Sweden 

Keyed 
Obsessive di 38 (.26) .62 19 see i 
Fearful (specific) 42 50 (31) .38 5 vm a 
Compulsive 43 = — E .80 E nc 
Ideas of change 67 51 = = = 

Not keyed 
Neologisms 49 37 
External control 63 38 


Note,—Numbers in parentheses not significant at +.35, 
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TABLE 5 
CORRELATIONS OF SYMPTOMS WITH RETARDATION SYNDROME 
Country 
Symptom Item F 
England | France Germany Italy | Japan | Sweden 
Keyed 
Slowed speech 1 .68 57 51 12 59 61 
Lacks goals 8 53 AT 55 (.32) .66 .59 
Fixed facies 13 69 68 -63 .62 .67 .65 
Slowed movements 16 414 65 48 72 54 56 
Memory deficit 19 (32) .39 (.29) 49 45 — 
Blocked speech 22 51 AA (.34) — AT ER] 
Apathetic 23 7 54 59 AT A 59 
Slovenly 30 42 59 (.33) AT (.30) .38 
Whispered speech 33 «67 .68 .50 12 57 .52 
Fails to answer 38 .61 67 .67 4 61 61 
Not keyed 
pokes 2 .36 A3 1 51 
elevant perce 4 
Lacks insight 39 61 
48 48 


Repet. movements 
n 


Note,—Numbers in parentheses not significant at 3-.35. 


Hostile Belligerence. The belligerence pat- 
tern shown in Table 6 was clearly confirmed 
in each sample. Ideas of persecution (60) was 
the only nonkeyed item represented. 

Disorientation. Although relatively few pa- 
tients were rated disoriented, the defining 
items (Table 7) were relatively objective and 
reliable. This may be the reason the factor 
appeared uniformly in each sample, 
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Grandiosity. 'The pattern of grandiose ex- 
pansiveness shown in Table 8 was relatively 
narrow but was well established. Here it was 
most weakly represented in Italy. 

Conceptual Disorganization. 'The items that 
define the pattern shown in Table 9 are asso- 
ciated with process-type disorganized schizo- 
phrenia. Thus it is understandable why the 
syndrome fails to appear in Italy and in 
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Sweden. The Italian sample consisted mainly 
of depressives while the Swedish sample was 
restricted to mild cases by the requirement 
that patients be drug free. 

Motor Disturbances. The syndrome, shown 
in Table 10, was confirmed in four samples, 
weak in Sweden, and absent from Japan. Since 
nonparanoid diagnoses were well represented 
in the Japanese sample it was difficult to ac- 
count for the absence of the syndrome. Speech 
blocking (Item 22) correlated better with 


Motor Disturbances than with Conceptual 
Disorganization where it was keyed. 

Perceptual Distortion. The syndrome shown 
in Table 11 is characterized by voices, visions, 
and other hallucinatory experiences. Since it 
was difficult to eliminate content, it is not 
surprising that Item 54 (voices that extol) 
correlated with the syndrome and with Grandi- 
osity. 

Paranoid Projection. Table 12 shows that 
the paranoid syndrome was well characterized 


TABLE 6 
CORRELATIONS or SYMPTOMS WITH HOSTILE BELLIGERENCE SYNDROME 
Country 
Symptom Item 
England | France Germany Italy | Japan Sweden 
Keyed 
Verbally hostile i 13 16 72 .67 .73 «17 
Contemptuous 11 .42 69 60 .60. A45 52 
Hostile attitude 18 yt «17 10 .50 .65 -64 
Irritable 25 64 71 53 49 54 64 
Blames others 28 58 St 61 62 67 70 
Resentful 32 .63 .61 .69 .63 15 28 
Complaining 34 68 23 -68 S1 .57 69 
Suspicious 36 m E 54 45 B :66 
Not keyed 

Ideas of persecution 60 AT .36 43 

Tense 10 46 

Dramatizing 17 40 

Slovenly 30 40 
Lacks insight 39 35 
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TABLE 7 
CORRELATIONS OF SYMPTOMS WITH DISORIENTATION SYNDROME 
Country 
Symptom Item 
England | France | Germany Italy | Japan | Sweden 
r 
Keyed 
Knows hospital 70 rst 69 93 91 — 60 
Knows state 71 49 12 83 94 A3 61 
Knows no one 72 55 68 62 16 E: = 
Knows season 73 64 76 .90 94 59 78 
Knows year 74 .66 77 86 81 62 80 
Knows age 75 AS 80 67 .80 62 75 
Not keyed 
Memory deficit 19 —.35 L 
Stereotyped speech 50 —42 
Ideas of change 67 35 


in each sample. Item 36, suspicion, although 
keyed for Hostile Belligerence also correlated 
moderately with the pattern. In the English, 
French, and German samples, four of the 
variables keyed for Perceptual Distortion also 
correlated with the paranoid syndrome. Sev- 
eral hypotheses can be offered to explain the 
extent of overlap between the two syndromes. 
First, it may be conjectured that the inter- 
viewers in some countries are more skillful or 
more thorough in seeking out hallucinations 
when paranoid ideation is present. Or, it is 
possible that because of tradition, raters in 


a 
a 


t-SCORES 


45 


some countries tend to assume the presence of 
hallucinations when paranoid delusions are 
noted. A third hypothesis is that the halluci- 
nating experiences are more frequently associ- 
ated with persecutory ideas in some countries 
than others. Further research with structured 
interviews may clarify the problem. 

The correlations among the primary factors 
of each solution were next examined. If these 
are quite large, they would suggest that the 
solution had been forced. However, the corre- 
lations were negligible or small with but few 
exceptions. Moderate correlations occurred at 
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TABLE 8 
CORRELATIONS OF SYMPTOMS WITH GRANDIOSITY SYNDROME 
Country 
Symptom Item n_a a M 
England | France Germany | Italy | Japan Sweden 
Keyed 
Superiority 15 50 40 AL 48 (.27) 35 
Voices extol 54 .68 .59 .50 s 43 51 
Unusual powers 64 56 .66 .61 54 «68 -66 
Great personality 65 .66 .62 .63 72 74 .39 
Divine mission 69 67 .68 4 18 63 .60 
Not keyed 
Visions 57 EU 35 
Elevated mood 12 37 
Specific fears 42 —.38 
oices order 56 53 
Amorous delusions 79 57 


Note.—Numbers in parentheses not significant at +.35. 


TABLE 9 
ConnELATIONS OF SYMPTOMS WITH CONCEPTUAL DISORGANIZATION SYNDROME 
Country 
Symptom Item 
England | France | Germany Italy | Japan | Sweden 

Keyed 
Irrelevant speech 2 56 Al 56 — 67 = 
Incoherent speech 3 .62 49 67 = -64 == 
Rambling speech 4 45 37 Al — A8 — 
Neologisms 49 49 .58 35 = 52 — 
Stereotyped speech 50 AS — 45 n (34) = 

Not keyed 
Startled glances 52 36 


Note.—Numbers in parentheses not significant at +.35. 


TABLE 10 
CORRELATIONS OF SYMPTOMS WITH MOTOR DISTURBANCES SYNDROME 
Country 
Symptom Item 
England | France | Germany Italy | Japan | Sweden 
Keyed 
Posturing 6 .57 44 64 .59 = 72 
Tense 10 (27) — (29) (30) — 39 
Giggling 46 48 62 50 [p — = 
Grimacing 47 74 -57 466 64 E 7 
Repet. movements 48 18 .53 67 E = 1 k 
Talking to self 51 .63 AT = 39 = en 
Startled glances 52 bd AT 46 41 42 — ^ 
Not k 
Speech blocking 22 Ad 45 38 36 
Heo E 44 4s 
ncoherent speech E | 
Slovenly 30 36 
Stereo. speech 50 54 
Seductive behavior 77 51 


uc c fcrt sube, eel De es | e a a 


Note.—Numbers in parentheses not significant at +.35. 
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approximately the same level obtained in the 
United States data. 


Varimax Solution 


The six Varimax solutions were examined 
with a view to finding matches across samples 
and with the United States key. The cutting 
point for a significant correlation was arbi- 
trarily set at + .35, a widely accepted value. 
On the whole, the simple structures achieved 


by the Varimax solution were not as clear as 
in the main solution. Although the syndromes 
tended to be correlated, the Varimax yielded 
independent factors. Consequently, items were 
likely to correlate significantly with several 
factors rather than one. However, six factors 
did have clean matches across the various 
samples. These factors were Excitement, Per- 
ceptual Distortion, Grandiosity, Anxious De- 
pression, Retardation, and Disorientation. 


TABLE 11 
CORRELATIONS OF SYMPTOMS WITH PERCEPTUAL DISTORTION SYNDROME 
Country 
Symptom Item 
England | France | Germany Italy | Japan | Sweden 
Keyed 
Hears voices 45 15 15 68 59 3 62 
Voices accuse 53 59 66 60 72 60 53 
Voices threaten 55 49 .60 .63 54 163 .69 
Voices order 56 ‘55 56 69 ‘53 ‘62 36 
Visions — 57 52 — 45 42 41 35 
Hallucinations (other) | — 58 44 39 49 37 3 63 
" i ab? keyed 
'oices extol 37 38 E 

Delusions m 44 38 4 x A. 

Startled glances 52 39 49 

People control 62 -50 43 

External control 63 44 45 
Ideas of reference 59 43 
Ideas of conspiracy 61 36 


: 
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TABLE 12 
CORRELATIONS OF SYMPTOMS WITH PARANOID PROJECTION SYNDROME 
Country 
Symptom Item 
England | France | Germany | Italy | Japan | Sweden 
Keyed 
Delusions 44 54 61 .65 .62 45 .56 
Ideas of reference 59 48 70 74 59 56 59 
Ideas of persecution 60 50 43 56 64 .63 51 
Ideas of conspiracy 61 .63 34 59 .61 .59 9 
People control 62 51 57 64 .63 43 39 
External control 63 38 36 49 53 E53 (27) 
Amorous delusions 79 34 — 37 — (.26) .50 
Homosexual delusions 80 42 34 -36 — (.29) 44 
Not keyed 
Suspicions 36 224 35 .61 .56 43 
Hears voices 45 .39 53 E 
Voices accuse 53 50 51 43 
Resentment 32 35 43 
Voices threaten 55 39 36 
Hallucinations (other) 58 40 40 
Ideas of change 67 49 .60 
Verbal hostility 5 36 
Tensions 10 -36 
Blames others 28 35 
Voices order 56 36 
Visions 57 43 
Great personality 65 Al 


Note,—Numbers in parentheses not significant at +.35, 


Four additional factors were represented in 
five countries. These factors were Hostile 
Belligerence, Paranoid Projection, Functional 
Impairment, and Motor Disturbance. Obses- 
sional-Phobic and Conceptual Disorganization 
patterns were represented in four samples. 
The Somatic Disturbance syndrome appeared 
in three countries, It could not have appeared 
in the hypothesis-testing analyses as the five 
defining items had been deleted. 

The Varimax findings may also be viewed 


from the standpoint of each country. All syn- 
dromes except Functional Impairment and 
Obsessional-Phobic appeared in the English 
data. The French sample yielded no Paranoid 
Projection syndrome; paranoid items corre- 
lated instead with Hostile Belligerence. The 
German sample failed to produce a Hostile 
Belligerence or a Somatic Disturbance syn- 
drome. The syndromes called Conceptual Dis- 
organization and Somatic Disturbance were 
missing from the Italian sample. The Japa- 


TABLE 13 
SYNDROME Raw Scores ror MALES 
Country 
Syndrome 
England France Germany Italy Japan Sweden | United States 
Excitement 18 10 23 10 12 15 20 
Hostile belligerence 20 18 30 22 15 18 31 
Paranoid projection 35 26 41 24 39 34 33 
Grandiosity 14 6 13 3 8 9 9 
Perceptual distortion 15 12 18 9 20 15 2 
ious depression 34 20 53 58 28 25 2 
Retardation 26 28 a E a sm at 
isorientation 0 0 
Motor disturbances 11 15 21 12 6 1 T 
Conceptual disorganization 8 5 14 8 4 8 
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TABLE 14 
Synprome Raw SCORES FOR FEMALES 
Country 
Syndrome 
England France Germany Italy Japan Sweden | United States 
Excitement 16 14 27 21 18 24 25 
Hostile belligerence 18 18 28 31 25 27 37 
Paranoid projection 27 31 55 19 38 31 34 
Grandiosity 8 5 12 3 7 5 8 
Perceptual distortion 16 15 28 9 20 17 14 
Anxious depression. 49 23 51 68 26 31 40 
Retardation 25 23 38 25 30 38 34 
Disorientation 0 0 0 0 1 1 1 
Motor disturbances 9 15 21 13 8 15 18 
Conceptual disorganization 6 7 14 6 7 9 12 


nese sample produced no Motor Disturbance 
factor while the Swedish sample showed little 
evidence of Obsessional-Phobic, Somatic Dis- 
turbance, or Conceptual Disorganization. No 
tables are presented because the results are 
quite similar to those presented earlier. 

The findings from the two solutions thus 
support the conclusion that the same syn- 
dromes exist in all seven cultures. However, 
there were minor differences between coun- 
tries as to the variables that characterize each 
syndrome. It seems likely that these differ- 
ences reflect mainly sample biases. 


Culture Syndrome Differences 


Since the syndromes were comparatively 
constant across the seven cultures it was pos- 
sible to examine between-country differences 
in score profile. Each case was scored on the 
10 standard syndromes on the basis of the 
United States derived keys and weights. The 
mean syndrome scores were computed for 
both men and women by country. Table 13 
illustrates the range of variation in men, The 
mean raw scores for each country were also 
converted into. standard scores (based on 
United States norms) in order to examine the 
profiles. 

When the samples were ranked with respect 
to mean score on each of the syndromes it 
became apparent that the German sample was 
the most severely disturbed and the United 
States sample next. The French and Swedish 
samples were least disturbed; mean profile 

elements were nearly all below the American 
norms. 


It was also possible to characterize the 
mean syndrome standard score profile of each 
sample in terms of one or two scores most 
elevated above the norms. The English males 
sample was then seen as primarily grandiose 
and the female sample as anxious depressed. 
While the French profiles were relatively flat 
they exhibited peaks on paranoid projection 
and hallucinations. The German males may 
best be characterized as anxious depressed and 
paranoid. The female German sample tended 
to be hallucinated paranoids. Both Italian 
samples peaked only on anxious-depression, a 
finding consistent with diagnoses previously 
described. The Japanese samples were also 
characterized by elevated scores on paranoid 
projection and perceptual distortion (halluci- 
nations). The Swedish sample could not be 
characterized in any particular way since the 
profile was comparatively flat. . 


DISCUSSION 


The findings from the structural analyses 
are relatively clear-cut. The same set of syn- 
dromes appear consistently across all samples 
with only minor differences. If somewhat 
broader samples could have been obtained it 
seems likely that every syndrome would have 
appeared in each sample. 

The question may be raised as to whether 
and to what extent the findings reflect pro- 
fessional training and knowledge of Kraepe- 
linian categories. It should be noted in re- 
sponse, that at least half of the syndromes 
are not recognized as such by psychiatrists. 
Hostile Belligerence, Grandiosity, Perceptual 
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Distortion, and Conceptual Disorganization 
are examples. A second and equally cogent 
answer is that the same syndromes have been 
isolated in ratings provided by psychiatric 
aides on ward behavior, and by patient rela- 
tives on patient home behavior. 

The differences between countries in mean 
syndrome scores should be considered, at 
best, as suggestive. It is for this reason that 
no significance tests were made. The samples 
simply are not representative of any coun- 
try’s hospitalized psychotic population; this 
was not a study goal. Although representative 
samples from each hospital’s intake were 
sought, a greater emphasis was placed on the 
inclusion of all sources of individual variation 
in psychopathology. A sample designed for a 
factor analysis need not be representative of 
a population but it should include the range 
of variation on all major sources of difference. 
For example, to define a height factor it is 
necessary to include tall and short people; a 
random sample is not required. Repeated fac- 
tor analysis of widely different United States 
samples supports this argument. Studies have 
been made of state hospital cases, veterans 
hospital cases, and university-sponsored hos- 
pital cases. All yielded the same structures 
provided the samples were stratified on the 
spectrum of psychotic behavior. 


REFERENCES 


AMERICAN PSYCHIATRIC Association. Diagnostic and 
statistical manual for mental disorders, Washing- 
ton, D. C.: Mental Hospital Service, 1952. 


543 


Gurerm, W. H, A factor-analytic study of schizo- 
phrenic symptoms. Journal of Consulting Psychol- 
ogy, 1952, 16, 308-312. 

Horst, P. A simple method of rotating a centroid 
factor matrix to a simple structure hypothesis, 
Journal of Experimental Education, 1956, 24, 231- 
237. 

Lorr, M. Jenkins, R. L., & O’Connor, J. P. Factors 
descriptive of psychopathology and behavior of 
hospitalized psychotics. Journal of Abnormal and 
Social Psychology, 1955, 50, 78-86. 

Long, M. & Krzrr, C. J. Constancy of psychotic 
syndromes in men and women, Journal of Con- 
sulting Psychology, 1965, 29, 309-313. 

Long M. & Krerr, C. J. Inpatient Multidimen- 
sional Psychiatric Scale. Manual. Palo Alto, Calif.: 
Consulting Psychologists Press, 1966. 

Lorr, M. Kerr, C. J, & Cave, R. Higher level 
psychotic syndromes. Journal of Abnormal Psy- 
chology, 1967, 72, 74-77. 

Lorr, M., KLETT, C. J., & McNam, D. M. Syndromes 
of psychosis. Oxford: Pergamon Press, 1962. 

Lorr, M., Krzrr, C. J., McNam, D. M., & Lasxy, J. 
J. Inpatient Multidimensional Psychiatric Scale, 
Manual. Palo Alto, Calif.: Consulting Psycholo- 
gists Press, 1962. 

Lorr, Mọ, McNar, D. M., Krxrr, C. J., & LASKY, 
J. J. Evidence of ten psychotic syndromes. Journal 
of Consulting Psychology, 1962, 26, 185-189. 

Lorr, M., Sonn, T. M., & Karz, M. M. Toward a 
definition of depression. Archives of General Psy- 
chiatry, 1967, 17, 183-186. 

Moore, T. V. The essential psychoses and their 
fundamental syndromes. Studies in Psychology and 
Psychiatry, 1933, 3, 1-128. 

Overatt, J. E. Dimensions of manifest depression. 
Journal of Psychiatric Research, 1963, 1, 239-245. 

Wirrensorn, J. R. Symptom patterns in a group of 
mental hospital patients. Journal of Consulting 
Psychology, 1951, 15, 290-302. 


(Received July 12, 1968) 


Journal of Abnormal Psychology 
1969, Vol. 74, No. 4, 544-552 


COGNITIVE FACTORS IN DIFFERENTIAL CONDITIONING 
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Twenty-two normals and 22 schizophrenics underwent differential conditioning 
of the GSR, using 8-sec. tones of differing frequencies as the CSs, an 8-sec. 
CS-UCS interval, and a UCS comprised of an RT task signaled by a low- 
intensity light. Both intertrial reports and postconditioning interviews were 
obtained, Twelve Ss in the normal group verbalized the conditional stimulus 
relations accurately, compared to 3 Ss in the schizophrenic group. The normal 
group showed significant GSR differentiation, though conditioning was limited 
to the group of accurately verbalizing Ss. No evidence for conditioning was 
obtained in the schizophrenic group. Normal Ss had faster RTs than schizo- 
phrenic Ss. The RT of accurately verbalizing normals was shorter than that of 
inaccurately verbalizing normals, and the degree of GSR differentiation was 


significantly correlated with RT. 


In recent investigations of cognitive proc- 
esses during differential classical conditioning, 
conditional galvanic skin response (GSR) dif- 
ferentiation was found to be limited to groups 
of Ss capable of accurately describing the 
conditional stimulus relations during either 
intertrial reports or postconditioning inter- 
views (Baer & Fuhrer, 1968; Fuhrer & Baer, 
1965, 1966). Groups of Ss unable to state the 
stimulus relations accurately did not show dif- 
ferential GSR conditioning. The Ss in these 
studies were intellectually capable individuals 
without known psychopathology. Conditioning 
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was of an aversive type characterized by the 
use of a noxious unconditioned stimulus 
(UCS) consisting of shock at S’s tolerance 
level. Thus, the observed concordance between 
cognitive and GSR differentiation appeared to 
represent a relationship between verbalized 
anticipation of impending shock and GSR 
indexes of conditional fear responses. 

One purpose of this investigation was to 
extend the study of cognitive processes in dif- 
ferential GSR conditioning by using a non- 
aversive UCS. The reaction time (RT) signal 
and its associated motor act appeared prom- 
ising for this purpose. For instance, Marquis 
and Porter (1939) have demonstrated that 
an RT task can serve as an effective UCS for 
conditioning the eye-blink response. More re- 
cently, Martin (1965) showed that the GSR 
can be differentially conditioned using a UCS 
comprised of an RT task and a potentially 
noxious 100-db. tone as the RT signal. 

The present study represented an effort to 
demonstrate that an RT task that is not 
supplemented by a noxious stimulus can serve 
as an effective UCS for differentially condi- 
tioning the GSR. The use of a long (8-sec.) 
interstimulus interval permitted the analysis 
of multiple conditional responses defined by 
the latency of the GSRs. It was also deter- 
mined whether there was a relationship be- 
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‘when an aversive UCS is used. In addition, 
"T was assessed in relation to conditional 
GSR differention and Ss' ability to verbalize 
"stimulus relations accurately. 
— "This study also sought to establish whether 
‘a group of schizophrenics (Group Z) would 
differ from a group of normals (Group N) in 
conditional GSR differentiation and in the 
ability to verbalize stimulus relations accu- 
| rately. There are a number of previously 
‘established characteristics of schizophrenics 
‘that supported the expectation that Ss in 
"Group Z would be less able to recognize 
the conditioned stimulus (CS)-unconditioned 
‘stimulus (UCS) relations. For example, schizo- 
—phrenics have been reported to have dif- 
ficulties in excluding irrelevant stimuli (Lang 
& Buss, 1965), integrating related stimuli 
"(Chapman & McGhie, 1962), and maintain- 
ing an attention set (Zahn, Rosenthal, & 
Shakow, 1963). It was therefore predicted 
that there would be fewer accurately ver- 
balizing Ss in Group Z than in Group N, 
‘and that there would be less GSR differen- 
— tiation in Group Z. The latter prediction was 
— based on the assumption that differential GSR 
conditioning and the ability to verbalize 
‘stimulus relations are related for schizo- 
-phrenics in a manner similar to that previ- 
ously observed for normal Ss. 


METHOD 
Subjects and Apparatus 


Group N was comprised of 22 women, paid 
volunteers, whose ages ranged between 19 and 48 
u 30.8), and whose educational attainment ranged 
“between 9 and 16 yr. (X = 12.7). Group Z was 
comprised of 22 women whose ages ranged between 
24 and 47 (X = 34.1), and whose educational attain- 
‘Ment ranged between 8 and 16 yr. (X = 12.5). The 
Ss in Group Z were all posthospitalized outpatients 
Who had been prescribed maintenance dosages of 
Phenothiazines. The approximate modal dosage was 
100 mg. daily of chlorpromazine. A subgroup of 
2 Ss in Group Z were administered rapid urine 
E tests for phenothiazines (Forrest, Forrest, & Mason, 
-1961) on the day of the experiment. These tests 
‘indicated that the patients were generally taking less 
‘than the prescribed dosage. Several types of schizo- 
Phrenia were represented in Group Z, 10 Ss being 
agnosed paranoid, 8 undifferentiated, and 4 affec- 
tive. All of the undifferentiated schizophrenics and 5 
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of the paranoids were considered to be chronic, while 
the rest were considered reactive. The time elapsed 
since the initial diagnosis of schizophrenia aver- 
aged 5.1 yr. The mean number of hospitalizations 
was 2.5, and the average length of hospitalization 
was 4.7 mo. 

Exosomatic basal skin resistance and GSRs were 
monitored by a constant voltage system (Montagu 
& Coles, 1966) supplying 4 v. and recorded on an 
Offner type R dynograph. The CSs were two 8-sec. 
tones of 450 and 1,400 cps that were substantially 
above threshold. A pilot group was used to equate 
psychophysically the intensities of the two tones. 
The UCS consisted of illumination of a 6-w. jeweled 
white light mounted on a panel facing S at eye level 
Írom a distance of 42 in. The RT response consisted 
of lifting the middle finger of the preferred hand 
Írom a microswitch. The switch was located on the 
armrest of S's chair, and required no more than 
the weight of the finger for its depression. Hunter 
timers controlled temporal intervals, and an Erie- 
Pacific digital timer recorded RT in milliseconds. The 
RT signal was presented coincident with tone offset 
and was terminated by release of the switch. 


Procedure 


The S was seated in a semidark, sound-attenuated 
room adjoining the instrumentation room. The active 
Ag-AgCl electrode was placed on the volar surface 
of the second phalange of the second finger of the 
nondominant hand, and the reference electrode was 
placed on the ventral surface of the forearm. Both 
electrodes were coated with a gel that is isotonic 
with sweat. After attachment of electrodes, the S 
was oriented to the RT task using highly exhortative 
instructions stressing improvement of. performance 
with repeated presentations. Three practice RT trials 
followed, tones omitted. After 2 min. of recording 
with S at rest, instructions for intertrial reports were 
presented. The S was told in effect to state her 
observations, thoughts, and feelings about events 
occurring in the experiment whenever the experi- 
menter said "report" over the intercommunication 
system. No reference was made to the tones. There- 
upon, 8 adaptation, 24 acquisition, and 8 extinction 
trials were administered, with intertrial reports inter- 
spersed between all trials. In each group, half of the 
Ss had the 450-cps tone paired with the RT task, 
and half the 1,400-cps tone. There were no test trials. 
Presentations of the paired tone (CS+) and unpaired 
tone (CS—) were regularly alternated in a single 
alternation pattern. The intertrial intervals averaged 
51 sec. (SD — 19 sec.) for Group N and 54 sec. 
(SD -—23 sec.) for Group Z. Following the last 
extinction trial, all Ss were interviewed with a stan- 
dard series of increasingly informative questions.* 
The reports and interviews were tape-recorded and 
transcribed for subsequent analysis. 


4Copies of the instructions for the RT task, in- 
structions for intertrial reports, the postconditioning 
interview, and the rating schedules for the latter two 
have been deposited with the National Auxiliary 


546 


jT 3 5 7 
ADAPTATION 


ACQUISITION 


Paur E. Barr AND Marcus J. FUHRER 


—CS+ 
---CS- 
© GROUP NA 
e GROUP NI 


[2088 EI TRE 
[I] 13 1S 17 19 
EXTINCTION 


TRIAL BLOCKS 


Fic. 1. Mean first-interval GSR magnitudes for accurately verbalizing normal Ss (Group 
NA, N= 12) and inaccurately verbalizing normal Ss (Group NI, N = 10). 


RESULTS 
Verbalizations 


Three raters independently content ana- 
lyzed the interviews for accuracy of Ss’ con- 
ceptualizations of the conditional stimulus 
contingencies. Raters worked without knowl- 
edge of Ss’ GSR conditioning. Two criteria 
were used to establish the accuracy of Ss’ 
statements of stimulus relations based on 
either a statement that the UCS followed only 
one type of tone or that the UCS appeared in 
a regularly alternating pattern. Classification 
of S as an accurate verbalizer for either inter- 
trial reports or postconditioning interviews 
required two ratings of accuracy not consid- 
ered questionable by either rater, or three 
ratings of accuracy if one or more raters con- 
sidered their rating questionable. For the 
interview ratings, verbalizations that were 
elicited by questions containing information 
about the differential stimulus contingencies 
or the alternating trial sequence were not used 
to classify Ss. 

In Group N, 7 Ss were rated as having 
'verbalized accurately during both intertrial 
Publications Service. Order Document No. 00526 from 
National Auxiliary Publications Service of the Amer- 
ican Society for Information Science, c/o CCM 
Information Sciences, Inc., 22 West 34th Street, New 
York, New York 10001. Remit in advance $1.00 for 


microfiche or $3.00 for photocopies and make checks 
payable to: Research and Microfilm Publications, Inc. 


reports and interviews. Three Ss verbalized 
accurately only during the interview, and 1 
only during intertrial reports. In Group Z, 
1 S verbalized accurately both during reports 
and the interview, and 2 Ss only during the 
interview. Final classification of an S as an 
accurate verbalizer was based on either the 
ratings of the intertrial reports or the inter- 
views. In Group N, 12 Ss were classified as 
accurate, and 10 as inaccurate. In Group Z, 
3 Ss were classified as accurate, and 19 as 
inaccurate (x? = 6.42, df = 1, p < .02, two- 
tailed). 


GSR Measures 


The 8-sec. interval beginning 1 sec, after 
tone onset. was divided into subintervals. A 
GSR originating 1-5 sec. after tone onset 
was scored as a first-interval response. A 
GSR originating 5-9 sec. after tone onset 
was scored as a second-interval response. A 
third-interval response was scored for a GSR 
originating during the interval 9-13 sec. 
following tone onset. Each GSR was mea- 
sured from its onset to its peak and the 
amplitude expressed as the square root of the 
conductance change. 


First Interval GSRs 


Mean first interval GSR magnitudes are 
depicted in Figure 1 for the group of 12 accu- 
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rately verbalizing normal Ss (Group NA) and 
the group of 10 inaccurately verbalizing 
normal Ss (Group NI). The first CS acqui- 
sition trial was omitted from consideration 
because of sensitization effects of the first RIT 
trial. Three factor analyses of variance 
(Groups NA versus NI, CS+ versus CS— 
and trial blocks comprised of pairs of suc- 
cessive CS-- and CS— trials) were calcu- 
lated separately for the three phases of the 
conditioning procedure. During adaptation, 
Groups NA and NI did not differ significantly 
when responses to CS+ and CS— tones 
were combined (F < 1, df = 1/20) nor was 
the Trials X Groups interaction significant 
(F = 1.5, df = 3/60). Differentiation of GSR 
magnitudes to CS+ versus CS— did not occur 
during adaptation for the combined groups 
(F< 1). 

During acquisition, significantly differenti- 
ated GSR magnitudes to CS+ and CS— were 
obtained for Groups NA and NI combined 
(F=5.8, df = 1/20, p < .05). The degree 
of conditioning differed significantly between 
the groups (F= 5.5, df = 1/20, p< .05), 
with conditional GSR differentiation found 
only in Group NA. Groups NA and NI com- 
bined did not differentiate significantly during 
extinction (F <1), nor did the two groups 
differ in this respect (F = 2.0, df= 1/20). 
The groups did not differ in combined reactiv- 
ity to CS+ and CS— during acquisition or 


72 


63 


QU 3 5 7 
ADAPTATION 


547 


extinction (F = 1.7, df —1/20; F< 1, re- 
spectively). It may be noted that differential 
GSR conditioning does not require incremental 
responding to CS+, but simply that there be 
differential responding to the CS+ and CS—. 

Mean first-interval GSR magnitudes for 
Groups NA and NI combined (Group N) and 
for Group Z are shown in Figure 2, Three 
factor analyses of variance (Groups N versus 
Z, CS+ versus CS—, and trial blocks com- 
prised of successive CS+ and CS— trials) 
were calculated separately for adaptation, 
acquisition, and extinction. During adapta- 
tion there was a nonsignificant tendency for 
greater reactivity in Group N (F=34, 
df = 1/42, p < .10). During acquisition trial 
blocks the degree of differentiation between 
CS-- and CS— was significantly greater for 
Group N than for Group Z (F = 58, df= 
1/42, p < .05). As shown in Figure 2, condi- 
tional differentiation of first-interval GSRs 
was not observed in Group Z. Because only 
three Ss could be classified as accurate ver- 
balizers, assessment of GSR differentiation 
within Group Z in terms of the accuracy of 
Ss verbalizations was not attempted. Acqui- 
sition reactivity levels to CS+ and CS 
combined did not differ between Groups N 
and Z (F = 1.3, df = 1/42). During extinc- 
tion, no differentiation was observed in either 
group, and there were no differences in re- 
activity levels. 


Lu 13 15 
ACQUISITION 
TRIAL BLOCKS 
Fic. 2. Mean first-interval GSR magnitudes for normal Ss (Group N, 
and schizophrenic Ss (Group Z, N=22). 
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TABLE 1 
MEAN SEcoxp-INTERVAL GSR MAGNITUDES 
Trial blocks 
4 
Group N peterem Acdailiton Eondon 
cs+ cs- cs+ cs- cs+ cs— 
NA 12 13 A2 42 .07 41 .05 
NI 10 14 07 E 2 4 a 
N(NA + NE 22 14 10 : 4 4 : 
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Second- and Third-Interval GSRs 


Second-interval magnitudes are presented 
in Table 1. The distribution of these data 
suggested the use of nonparametric analyses, 
Two-tailed tests of significance were used 
throughout. For each S, second-interval ampli- 
tudes were summed over trials separately for 
CS+ and CS— within each phase of the ex- 
periment. During adaptation, Groups NA 
and NI did not differ when responses to the 
CS+ and CS— tones were combined (U = 55, 
df = 12/10), nor did either Groups NA or NI 
demonstrate GSR differentiation (T — 13, 
df=9; T — 3, df = 5, respectively). During 
acquisition, Groups NA and NI combined 
showed a statistically nonsignificant tendency 
toward differentiation (T — 67.5, df — 21, 
p < .10). When Groups NA and NI were 
considered separately, GSR differentiation ap- 
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Fic. 3. Mean RT for the accurately verbalizing 
normal Ss (Group NA, N = 12), the inaccurately 
verbalizing normal Ss (Group NI, N = 10), and the 
schizophrenic Ss (Group Z, N = 22). 


proached statistical significance for Group 
NA (T= 15, df= 12, p= .06) but not for 
Group NI (T — 21, df — 9). When Groups 
NA and NI were compared directly, it was 
found that the two groups did not differ sig- 
nificantly in differentiation (U — 37, dí — 
12/10) or in reactivity to CS+ and CS— 
combined (U — 56, df — 12/10). During ex- 
tinction, Groups NA and NI combined did not 
differentiate (T — 54, df — 17). Significant 
differentiation was found in Group NA 
(T = 10, df= 11, p < .05) but not in Group 
NI (T— 5, df= 6). Direct comparison of 
the groups revealed greater differentiation for 
Group NA (U = 23, df = 12/10, p < .02). 
Analysis of second-interval GSRs for Group 
Z produced no statistically reliable results. 
The schizophrenic Ss did not differentiate 
during acquisition (T = 64, df = 20), nor did 
comparison of the differentiation of Groups N 
and Z differ significantly (U = 236, df= 
22/22). Furthermore, Groups N and Z did 
not differ in reactivity to CS+ and CS— 
combined (U = 214, df = 22/22). During ex- 
tinction, Group Z did not differentiate (T — 
82, df — 16), and Groups N and Z did not 
differ in either differentiation (U — 175, df — 
22/22) or reactivity (U = 236, df = 22/22). 
Mean third-interval GSR magitudes during 
extinction trials were Group NA, CS+ = .12, 
CS— = .12; Group NI, CS+ = .06, CS— = 
-10; Group Z, CS+ = .12, CS— = .10. There 
was no tendency for these results to be sta- 
tistically significant for any of the groups. 


Reaction Time 


The RT curves for Groups NA, NI, and Z 
are displayed in Figure 3. In each group, RT 


DIFFERENTIAL CONDITIONING OF THE GSR 


decreased from early to late trials. The groups 
did not differ in RT on the first trial, nor in 
extent of reduction of RT from early to late 
trials. Mean RT over the final four ac- 
quisition trials differed significantly between 
Groups N and Z (U = 98, df = 22/22, p< 
02) and between Groups NA and NI 
(U= 12, df = 12/10, p < .002). A similar 
comparison between Groups NI and Z was not 
significant (U = 69, df = 10/22). 

To determine a possible relationship be- 
tween conditional GSR differentiation and RT 
in Group N, rank-order correlations were 
performed. A GSR differentiation score was 
computed by subtracting the summed GSR 
amplitudes during the last four CS— trials 
from the summed GSR amplitudes during the 
last four CS+ trials. Between the first- 
interval GSR differentiation score and mean 
RT during the last four trials (r= —.38, 
df = 22, p < .05) and between the second- 
interval GSR differentiation score and mean 
RT (r= —48, df= 22, p< .05). Failure 
to obtain conditioning in Group Z precluded 
similar correlational analyses in that group. 

Several supplementary correlations were 
computed to ascertain whether RT was re- 
lated to GSR indexes other than the differen- 
tiation score. Considering the final four trials, 
mean RT was not significantly correlated with 
the first-interval GSR amplitudes to CS+ 
(r= 29, df = 22) nor with second-interval 
amplitudes to CS+ (r = .13, df = 22). The 
magnitude of third-interval GSRs accompany- 
ing RT was not significantly correlated 
with mean RT during the final four trials 
(r = —.22, df = 22). 


Discussion 


The present results indicate that GSRs of 
normal Ss can be differentially conditioned 
using an RT task as the UCS. During the 
acquisition phase, statistically significant dif- 
ferentiation was obtained for first-interval 
GSRs, and a tendency toward differentiation 
was observed for second-intervals GSRs. Dur- 
ing extinction, differentiation of both first- 
and second-interval responses was abolished, 
and there was no evidence for third-interval 
differentiation. The present procedure may be 
particularly attractive in studying GSR condi- 
tioning of Ss such as children and the elderly 
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for whom the use of a noxious UCS is 
undesirable, 

Certain characteristics of the present pro- 
cedure may have facilitated conditioning. 
Instructions for the RT task were highly 
exhortative, presumably enhancing the Ss’ at- 
tention to experimental events and their 
motivation to perform the RT task. Since the 
two conditional stimuli were presented in a 
single alternation pattern, conditional differen- 
tiation could be based on the alternation of 
the tones as well as on the differences in fre- 
quency between them. The insertion of inter- 
trial reports may have facilitated conditioning 
and increased the number of accurate verbal- 
izers. Such effects had been previously ob- 
served using shock as the UCS (Fuhrer & 
Baer, 1966). 

Both direct and indirect comparisons of GSR 
differentiation between normal Ss who could 
accurately verbalize conditional stimulus rela- 
tions (Group NA) and those who could not 
(Group NI) showed reliable conditioning in 
Group NA but not in Group NI. During the 
acquisition phase there was both first- and 
second-interval GSR differentiation for Group 
NA. During the extinction phase, second- 
interval differentiation by Group NA was not 
abolished, although first-interval differentiation 
was. These findings are generally in accord 
with those obtained when a shock UCS was 
used (Baer & Fuhrer, 1968; Fuhrer & Baer, 
1965, 1966). The concordance between condi- 
tional GSR differentiation and accurate 
verbalizations of stimulus relations evidently 
supersedes effects attributable to the type of 
UCS, aversive or nonaversive. 

The schizophrenic group did not show dif- 
ferential GSR conditioning in any of the 
response intervals during either acquisition or 
extinction trials, Our results using an RT task 
as the UCS appear consistent with those ob- 
tained previously using an aversive UCS. 
Peters and Murphree (1954) report some evi- 
dence that schizophrenics show less differen- 
tial conditioning of the GSR than normals 
using extended interstimulus intervals and 
either shock or a claxon as the UCS. The 
more recent studies of the simple GSR condi- 
tioning of schizophrenics and normals have 
not yielded consistent findings. Pishkin and 
Hershiser (1963) failed to find simple GSR 
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conditioning in a schizophrenic group, while 
Howe (1958) found that the GSR condition- 
ing of schizophrenics and normals did not 
differ. 

The conditionability of schizophrenics may 
also reflect the clinical population being 
sampled, though this was not evident in the 
present sample. Individual differences in first- 
and second-interval GSR differentiation were 
scrutinized within the schizophrenic group, but 
there seemed to be no systematic differences 
between Ss classified as chronic versus reactive 
or paranoid versus nonparanoid. 

As predicted, it was found that the fre- 
quency of inaccurate verbalizers was associ- 
ated with the presence of psychopathology. 
Of the 22 schizophrenics, 19 could not de- 
scribe the conditional stimulus relations dur- 
ing either intertrial reports or the interview. 
In the normal group of 22 Ss, only 10 failed 
to verbalize accurately. The failure of the 
schizophrenics to verbalize accurately did not 
appear attributable to a marked attentional 
deficit or to their inability to communicate 
coherently. The patients were able to com- 
municate well enough to be able to maintain 
themselves in the community under outpatient 
care. Pathognomonic types of bizarre or au- 
tistic verbalizations were rare. Finally, con- 
tent analysis of verbal reports elicited between 
adaptation trials indicated no difference be- 
tween patients and normals in the number of 
references made to the tones and the impend- 
ing RT task. This finding suggests that the 
patients were capable of verbalizing attention 
to experimental events. 

The generally convergent results provided 
by the schizophrenic and normal groups have 
implications for understanding the differential 
conditioning of the human adult as well as 
for interpreting the deficient conditionability 
of schizophrenics. Consistent with the results 
of our previous studies, an absence of condi- 
tional GSR differentiation was associated with 
inability to recognize the differential stimulus 
relations. Differential GSR conditioning was 
only manifested by accurately verbalizing 
normal Ss. It was not possible to assess the 
conditioning of accurately verbalizing schizo- 
phrenics since there was only three of these 
Ss and their GSR differentiation scores varied 

markedly. A characterization of the classical 
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conditioning process as a relatively primitive, 
automatic form of learning seems inadequate 
to account for this pattern of results. The 
differential classical conditioning of autonomic 
responses in the human adult appears to be 
closely related to a conceptual process re- 
flected by the ability to verbalize stimulus 
relations accurately. When such a conceptu- 
alization fails to occur, conditional differentia- 
tion is not observed. Of course, it cannot be 
asserted that this conceptual process fails to 
occur for the same reasons for inaccurately 
verbalizing schizophrenics and normals. It 
does seem consistent with this viewpoint that 
the conditional GSR differentiation of the 
schizophrenic group was found to be markedly 
deficient compared to the normal group. Im- 
pairments of conceptual activity are a well- 
documented characteristic of schizophrenics 
and seem to have been reflected in a notice- 
ably higher incidence of inaccurate verbalizers 
within this group. 

The heuristic value of considering parallels 
between classical conditioning and RT pro- 
cedures has been pointed out previously (e.g., 
Behar & Adams, 1966; Woodworth, 1958). 
The present study provided the unusual op- 
portunity to combine these procedures, per- 
mitting a joint comparison among indexes of 
conditional GSR differentiation, ability to 
recognize stimulus relations, and RT. A par- 
ticularly salient finding in the normal group 
was that accuracy of verbalization and GSR 
differentiation were both mirrored in RT. 
That is, the RT of accurate verbalizers was 
significantly faster than inaccurate verbalizers, 
and differential GSR conditioning scores cor- 
related significantly with the RT. Comparison 
of the normal and schizophrenic groups 
yielded a similar pattern of findings. The RT 
of the schizophrenic group was significantly 
longer than that of the normal group, and the 
schizophrenics were characterized by inaccu- 
rate verbalization of stimulus relations and an 
absence of conditional GSR differentiation. 

_ This pattern of results suggests that recog- 
nition of the CS-UCS relations by the accu- 
rately verbalizing normal Ss permitted these 
individuals to use the CS4- tone as a ready 
signal for making the RT response. In accord 
with previous studies reviewed by Woodworth 
and Schlosberg (1954), availability of a ready 
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signal was reflected in shorter RTs than shown 
by either inaccurately verbalizing normals or 
schizophrenics. Perception of the CS+ tone 
as a ready signal also allowed the accurately 
verbalizing normal Ss to adopt a preparatory 
set for making the RT response. Such a set 
probably embodies a complex pattern of 
physiological changes including increased 
muscle tension (e.g., Davis, 1940) as well as 
the differentiated GSR activity revealed in 
the present study. 

It has been suggested that fundamentally 
different processes underlie the conditioning of 
first-, second-, and third-interval GSRs. For 
example, Gale and Stern (1967) interpret the 
differentiation of first-interval GSRs as condi- 
tioning of the orienting response to onset of 
the CSs, Second-interval GSRs are viewed as 
anticipatory responses representing prepara- 
tion for delivery of the UCS. If responses in 
these intervals reflect different underlying 
mechanisms, GSR activity within each should 
bear a different functional relationship with 
other variables, In the present study, however, 
both first- and second-interval GSR differen- 
tiation for the normal group was similarly 
related to the accuracy of Ss’ verbalizations. 
Furthermore, the schizophrenic group showed 
no GSR differentiation in any of the intervals, 
and both the first- and second-interval GSR 
differentiation for the normal group was simi- 
larly correlated with RT. These results do not 
justify a sharp distinction between the proc- 
esses underlying GSR activity in the various 
intervals. 

There is a remaining question concerning 
possible effects of the phenothiazine medica- 
tion upon results obtained for the schizo- 
phrenic group. It is difficult to assess the pos- 
sibility that these agents affected the relevant 
Cognitive functioning of Ss in this group. 
Chapman and Knowles (1964) have shown 
that phenothiazine treatment has the dual 
effects of improving conceptual activity and 
decreasing alertness in these patients. Other 
studies indicate that phenothiazine treatment 
improves intelligence test performance in cata- 
tonic patients (Gilgash, 1961), but not in 
newly admitted schizophrenics (Judson & 
MacCasland, 1960). It is unlikely that drug 
effects were responsible for the depressed RTs 
of the schizophrenics. Neither Heilizer 
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(1959) nor Sibilio, Andrew, Dart, Moore, and 
Stehman (1959) found significant alterations 
of schizophrenics’ RTs due to phenothiazine 
treatment. Finally, there is no available evi- 
dence that these drugs affect the conditional 
GSR differentiation of schizophrenics. While 
there is an indication that phenothiazines af- 
fect GSR indexes such as the frequency of 
nonspecific GSRs in schizophrenics (Bernstein, 
1967), Crider and Tursky (1967) have shown 
in normal Ss that the frequency of nonspecific 
skin potential responses bears no relationship 
to their conditional differentiation. 
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FREQUENCY OF PSYCHOTHERAPEUTIC SESSION AS A 
FACTOR AFFECTING OUTCOME: 
ANALYSIS OF CLINICAL RATINGS AND TEST RESULTS' 


CHRISTOPH M. HEINICKE ? 
Reiss-Davis Child Study Center, Los Angeles, California 


Two groups of boys referred for learning disturbances were seen either in 
once-a-week or in four-times-a-week psychoanalytic psychotherapy. Children 
seen more frequently showed greater ego flexibility by the end of treatment, 
as well as a more adequate level of ego integration during the follow-up period. 
Analysis of test results indicated that these same children improved at a faster 
rate in tests of reading, spelling, and vocabulary, and responded more flexibly 


to the Rorschach cards. 


The importance of the frequency with 
which a patient is seen in psychotherapy has 
long been recognized, but few systematic 
studies are available. Imber, Frank, Nash, 
Stone, and Gliedman (1957) found that adult 
patients evaluated after 6 mo. showed 
greater gains in social effectiveness if seen 
once a week as opposed to every other week. 
By contrast, Lorr, McNair, Michaux, and 
Raskin (1962) found that the variation of 
every other week, versus once a week, versus 
twice a week, made little difference in the 
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Katz, and Stella Gunther for their devoted assistance. 

Requests for reprints should be sent to Chris- 
toph M. Heinicke, Reiss-Davis Child Study Center, 
ee Pico Boulevard, Los Angeles, California 


status of the patient 4 and 8 mo. after 
treatment was initiated. For patients in 
treatment at least a year, however, those seen 
on a more frequent basis “describe them- 
selves as more outspoken, assertive, indepen- 
dent, and determined to protect their own 
interests.” A follow-up of a part of this initial 
sample (McNair, Lorr, Young, Roth, & Boyd, 
1964) found significant relations between the 
number of interviews and 4 of the 13 outcome 
criteria. 

The present study compared two groups of 
boys in terms of treatment and outcome 
variables. One group was seen once a week 
and the other group four times a week. Two 
psychoanalytic child therapists with similar 
training and experience treated the children. 
All the mothers were seen on a once-a-week 
basis, and, where appropriate, the fathers 
also had contact with a therapist. Using cer- 
tain broad diagnostic considerations, cases 
were assigned to matched samples on a case- 
to-case basis. All treatments were terminated 
at the request of the parents and with the 
consent of the therapist. Information on the 
process variables was derived from the record 
dictated by either the child’s or parent’s 
therapist immediately after the sessions, but 
is not discussed in this paper. Information on 
the outcome was derived from multiple as- 
sessments at the beginning, end, and 1 and 2 
yr. after the end of treatment. 

The total data available on each child for 
each assessment point were initially organized 
in terms of the Diagnostic Profile (Freud, 
1962). Further analysis of the outcome data 


553 


554 


involved clinical ratings of the Profile ma- 
terial as well as test scores. It is the purpose 
of this paper to show that the results of this 
analysis are consistent with previously formu- 
lated clinical hypotheses (Heinicke, 1965a). 
The most general of these hypotheses was that 
the children seen four times as opposed to 
once a week make greater progress during the 
follow-up period in the level of ego integra- 
tion, flexibility, and adaptation. It was an- 
ticipated that this differential growth would 
be reflected not only in clinical ratings of 
these attributes but also in IQ scores, Ror- 
schach responses, and the rate of improvement 
in academic subjects. 


METHOD 
Methods of Assessing the Children 


At the beginning and end of treatment, the follow- 
ing data were available: interviews conducted by 
therapists with the parents and teachers, the sessions 
of therapist and child, one or two interviews with a 
child psychiatrist, and the results of extensive test- 
ing with the child (Revised Stanford-Binet, Form L, 
the Wide Range Achievement Test, the Rorschach, 
parts of the TAT and Michigan Picture Test, and 
the Draw-a-Person Test). 

This same information was again available at the 
first and second follow-ups, but the child was not 
seen again by the diagnostic psychiatrist, and the 
second follow-up included no testing. Scores on read- 
ing tests were available from the school records. 


Subjects 


Data analysis, including group comparison, was 
carried out on two samples of four boys each who 
(a) ranged in age 6 yr. 8 mo.-10 yr. 5 mo.; (b) were 
referred primarily for a learning disturbance associ- 
ated with psychological difficulties; (c) were threat- 
ened with being or had already been held back in 
school; (d) were unable to profit from supplemental 
educational help; (e) achieved a Stanford-Binet IQ 
of 90 or better; (f) came from intact business or 
professional families; and (g) were in treatment 
from 14 to 23 yr. Case-to-case matching insured that 
the two groups did not differ in regard to the above 
characteristics. They also did not differ at the begin- 


ning of treatment on any of the ratings which differ- 
entiated them after treatment, 


Ratings Based on the Diagnostic Profile 


The Diagnostic Profile provides a common frame- 
work and language in narrative form, for making 
inferences from multiple assessments of the child 
and his family. A full example is given in Heinicke 
(19652). 

The forty-five 10-point ratings used to describe the 
children reflect a number of important theoretical 
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distinctions suggested by the Profile Outline, as well 
as the nature of the problem being studied. The 
examples of the ratings cited below are central to the 
results which emerged from the data analysis. In 
addition to much more specific definitions, character- 
istics of Points 10, 5, and 1 for each rating were 
specified descriptively.* 


1. Child's level of ego integration—To what ex- 
tent is the child's functioning characterized by unity? 
To what extent do important areas of functioning 
not contribute to his adaptation and how effective is 
that adaptation? 

2. Extent to which the child's defenses are bal- 
anced—To what extent is the defensive organization 
characterized by the use of many of the important 
mechanisms or the excessive use of single ones? 
(Freud, 1962). 

3. Child's capacity to express a variety of affects— 
To what extent are a variety and balanced use of 
affects available to the child? 

4. Capacity to form peer relationships—What is 
the number and quality of the peer relationships 
which the child can initiate and sustain? 

5. Extent of development from dependency to 
emotional self-reliance—Where along the line from 
the newborn’s utter dependence on maternal care to 
the young adult’s self-reliance should the child be 
placed? (Freud, 1963). 


Reliability of the ratings. The reliability of the 
ratings by the therapist was checked in three ways. 
The first was to correlate the therapist’s ratings 
with those done on the same Profiles by another 
clinician. The median Q-type coefficient was .85; the 
N for each computation was 45 or the number of 
ratings. A second check was to correlate the ratings 
based on the raw clinical data. The median coeffi- 
cient between the therapist and psychiatrist was 
.75. A third check was to estimate the reliability of 
the factor score, 


Analysis of Test Scores 


All test results were initially analyzed clinically 
(Heinicke, 1965a). Individual Rorschach score summ- 
aries were later derived by a person not familiar 
with the frequency of treatment each child experi- 
enced. 

The grade level scores in reading, spelling, and 
arithmetic derived from the Wide Range Achieve- 
ment Test were correlated with the grade place- 
ments made by the teachers, The median correla- 
tion was .70 (p< 05).4 


*'The complete list of the definitions of the 45 
ratings and details of the subsequent analysis may be 
obtained by writing to the author. 

*Rate of change was used in all further data 
analysis. For the beginning of treatment the gain 
made up to that point was divided by the number 
of months the child had been in school. Given the 
low association between the rate of improvement 
before treatment and that during the first year after 
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RESULTS 
Differences on Factor Scores 


" There were no significant group differences 
jn group mean ratings at the beginning of 
treatment, but three statistically significant 
differences indicating greater ego flexibility in 
‘the children seen four times a week emerged 
by the end of treatment; 39 of the 45 dimen- 
sions showed a statistically significant differ- 
ence by the first or second follow-up points. 
“Since many ratings were intercorrelated, a 
principal components factor analysis (with 1s 
in the diagonal and with rotation of all factors 
"with an eigenvalue greater than 1) was car- 
‘tied out on the table of intercorrelation of 
the 45 ratings computed for each of the four 
‘assessment points. 


The Four Factors 


Of the factors isolated, four were obtained 
‘for each assessment: (a) level of ego inte- 
gration, (5) extent of ego flexibility, (c) ca- 
“pacity for forming peer relationships, and 
(d) extent of self-reliance. Those ratings as- 
sociated with a factor to the extent of .625 or 
. better at three of the four assessments points 
— further defined the factor. 

From the first factor, level of ego integra- 
tion, which at the various assessment points 
accounted for 50-69% of the variance, the 
following rating items remained after the 
nonsignificant contributions had been elimi- 
mated: level of ego integration, capacity 
to sublimate, absence of primary process, 

— effectiveness of the defenses, capacity to toler- 
ate anxiety, level of self-esteem as a function 
_ Of the child's achievement, ability to assert 
4 himself, extent of academic ambition, ability 
to persist in a task, realistic concern about 

academic progress, absence of denial or avoid- 
— ance of the academic problem, progress made 
in libidinal development, predominance of 
Progressive over regressive forces, and the 
“congruence between various developmental 
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E The factor of ego flexibility accounted for 
2-17% of the variance. Four ratings loaded 


on the factor to the extent of .625 or better 


p 
‘treatment, as well as the high reliability of the test 
Scores, it was not felt necessary to compute true 
Bain scores (Lord, 1956). 
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TABLE 1 
Sum or FACTOR SCORES FoR CHILDREN IN ONCE AND 
Fovz-TrwEs-A-WEEK THERAPY 
Point of treatment 
Factor 
First Second 
End follow- follow- 
up up 
Level of ego 
integration 
Once a week 47|—160 | —248 
Four times a xx 
week —47| 160 248 
Extent of ego 
flexibility 
Once a week -23|— id - fe 
* * 
Four times a 2.3 5.3 73 
week 
Capacity 
for peer 
relations 
Once a week NE das — 63 
Four times a P 
week = 8 13 6.3 
Extent of self- 
reliance 
Once a week — 4|— 13 —41 
Four times a 
week ES 13 14 


* Difference significant at p < .05. 
** Difference significant at p < .01. 


on at least three of the assessment occasions: 
capacity to express a variety of affects, extent 
of balance in the defense organization, nega- 
tively with the extent of repression of aggres- 
sion, and nondefensive use of humor. 

The factor of the capacity to form peer 
relationships accounted for 5-10% of the 
variance and included: capacity to form peer 
relationships, development from egocentricity 
to companionship, and level of self-esteem as 
related to the body image. 

The fourth factor, self-reliance, was again 
named on the basis of several ratings and 
accounted for 4-7% of the variance at each 
of the assessment points. However, only 
one rating, development from wetting and 
soiling to autonomous bladder and bowel 
control, was loaded on this factor at three of 
the assessment points. 

Group differences on four factors. The 
above procedure made it possible to derive 
a score for each child for each factor at each 
assessment point. Differences in group trends 
could thus be analyzed. Using the ratings of 
the total sample of eight children, standard 
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TABLE 2 
p VALUES DERIVED FROM ANALYSIS OF VARIANCE OF 
Four Factors 
Ego Ego Peer Self- 
integra-| flexi- | relation- 

Srg ieira Bert resonance 
Frequency ns .05 ns ns 
Frequency X Assess- 

ment Point .005 .005 01 ns 


scores (M — 0, SD — 1) were computed sepa- 
rately for each rating at each assessment 
point. This procedure allowed summing the 
scores of a given child across the various 
ratings within a factor. Thus a “factor score" * 
was derived, This procedure also results in the 
group mean for all eight children being zero, 
and since the total group was split into two 
(once a week vs. four times a week), the 
two subgroups necessarily had scores that 
together summed to zero. Table 1 gives the 
sum of the factor scores for each of the two 
groups for each assessment point. 

To assess the statistical significance of the 
difference in the trends for the children seen 
once versus four times a week, an analysis 
of variance (trend analysis, Edwards, 1950) 
was carried out on each factor. The values 
resulting from this analysis are summarized 
in Table 2. Most important, it shows that 
there is a statistically significant difference 
between the two  frequency-of-treatment 
groups on three of the factors when the 
assessment point is taken into account 
(Frequency X Assessment Point). This in 
turn warranted the application of £ tests to 
these three factors. In Table 1 it can be seen 
that the children seen four times a week 
showed, by the second follow-up, a more ade- 
quate level of ego integration, ego flexibility, 
and capacity to form peer relationships. Fur- 
thermore, this same group was already more 
flexible in its ego functioning by the time 
of the first follow-up. 

The possible effect of the previous level of 
the rating on subsequent assessment was 
taken into account by computing the follow- 
ing difference scores: beginning to end of 


5'This factor score is an approximation since no 
differential weightings were used. The composite 
score is similar to a test whose items each have unit 
weight. 
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treatment, end to first follow-up, and first to 
second follow-up. Using a ¢ for difference 
scores, the two frequency groups could then 
be contrasted for each factor. The children 
seen four times a week, when compared with 
the once a week children, showed significant 
gains in (a) extent of ego flexibility, from 
the beginning to the end of treatment; (5) 
level of ego integration, from the end to the 
first follow-up; and (c) capacity for peer 
relations, from the first to the second 
follow-up. The values, respectively, were .02, 
.001, and .02.° This analysis, then, again indi- 
cates significant group differences for three 
of the factors, and also pinpoints the time in- 
terval when the children seen more frequently 
made particularly significant gains. 

Reliability of “factor scores.” The nature 
of the data analysis allowed application of 
Lindquist’s formula (Lindquist, 1953, For- 
mula No. 152, p. 361) to estimate the reli- 
ability of the “factor” or composite score 
(see Table 3). Reliability here means the 
intraitem agreement after partialing out indi- 
vidual differences. 


Ego Integration, Total IQ, IQ Change, and 
Environmental Impact 


The association of the factor scores with 
variables independent of the clusters was also 
studied. The ego integration factor scores cor- 
related with the Stanford-Binet IQ scores 
at the beginning, end, and first follow-up 
point to the extent of .96, .91, and .66 
(all ~’s < .01). Moreover, the increase in IQ 
from the beginning to the first follow-up cor- 
related to the extent of .83 (p < .01) with the 
level of ego integration attained at the first 
follow-up point." 

The ego integration scores at the beginning 
of treatment also correlated significantly 
(P < .05) with the rate of achievement in 
reading, arithmetic, and spelling, and nega- 
tively with ratings of extent to which the 


6 Since the regression of the factor scores was not 
the same for the two groups, analysis of covariance 
was not appropriate. It was nevertheless carried 
out and the results are consistent with those reported 
above. 

^ The absolute level of IQ at beginning of treat- 
ment did not correlate significantly with the incre- 
ment in IQ from beginning to follow-up point. 
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"child's environmental influences impede his 
potential for development.* Significant associ- 
tions were not found for other assessment 
points. 


Group Differences in Rate of Academic 
Achievement 


The rate of improvement in the various 
subjects for the children seen once and four 
times a week is given in Table 4; rate of im- 
provement for reading has been graphed in 
Figure 1. By contrast to the other two sub- 
jects, there were no statistically significant 
group differences for the subject of arithmetic 
(the Mann-Whitney U test was used for all 
the analyses of test scores). 

Particularly striking in these results is the 
reversal of group trends in reading: the once- 
a-week children showed a greater improvement 
during the first year of treatment, but then 
their growth decelerated, especially right after 
treatment. 


Group Differences in Total IQ and 
Vocabulary Subtest 


The median total IQ scores for the two 
groups are given in Table 5. The children 
seen once a week by comparison with those 
seen more frequently showed a significant 
drop at the end of treatment ($ = 029) but 
in the year after returned to the level seen 
at the beginning. 

While the median IQ scores were thus not 
different for the two groups after treatment, 
their performance on the vocabulary subtest 
did differ. Comparison of the increments from 


the beginning to the end of treatment and 


from the end to the first follow-up favored 
the children seen four times a week (the 
respective p values were .029 and .057). 


Group Differences on Rorschach Indexes of 
Ego Flexibility 

As part of the effort to develop measures 
of outcome independent of the clinical judg- 


5 To derive an indication of environmental impact 
On the child is clearly difficult. The number of 
Quantitative and qualitative distinctions is vast. The 
Taters were able to differentiate the extent to which 
the environmental influences impede the child's po- 
tential for development. Only those features of the 
environment were considered which do in fact make 
an impact. 
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TABLE 3 


Rewrasriity or “Factor Scores” AT Each POINT 
IN TREATMENT 


Point of treatment 


ein Lud First | Second 

ning End Follow: fallen 
Ego integration 903 .931 944 924 
Ego flexibility "352 | .764 | .845 | .802 
Peer relationships | .873 .708 .828 167 


ment of either tester or therapist, the two 
groups were also compared in terms of quanti- 
tative Rorschach indexes reflecting differences 
in (a) the degree of constrictive control and 
(b) emotional responsiveness. 

The following were considered in relation 
to the degree of constrictive control: the 
number of determinants used, the children's 
percentage of form responses to the Rorschach 
(F%), the percentage of good form responses 
(F4-), and the percentage of animal responses 
(A+ Ad%; Klopfer, Ainsworth, & Klopfer, 
1954). It was found that the children seen 
four times a week used a greater number of 
different determinants at first follow-up; a 
median number of 6 as opposed to 4 (p = 03). 
Their lower median F% (31 vs. 46%) is con- 
sistent with this finding, and since their 
median F+% was 90% and thus close to the 
93% expected for their age, the below aver- 
age F% and greater use of a variety of de- 
terminants is more likely an indicant of 
spontaneity rather than a break in reality 
appreciation. The latter conclusion is sup- 
ported by the fact that these children showed 
an increase from the end of treatment to the 
first follow-up point in the number of popular 
responses ($ = .01). 

There were other indications that the once- 
a-week children were more constricted and 
had a more stereotyped approach to the 
world, They showed a median F+% of 100 
at first follow-up, and an A% that was not 
only significantly greater (p = .06) than that 
of the children seen four times a week 
(61 vs. 53%), but also considerably higher 
than the norm for this age (46%). By con- 
trast, the A% of the children seen four times 
a week decreased from end to follow-up point 
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TABLE 4 


OF IMPROVEMENT IN READING, SPELLING, AND ARITHMETIC FOR CHILDREN SEEN 
Ay ONCE A WEEK AND Four TIMES A WEEK 


Li First year Second yea: 
Acadenile subject before Er vies of treatment after treatment | after treatment 
Readii 
ONE a week 06 rt .08 .02 pe na z 
Four times a week .05 04 .09 A1 Pi 
Spellin; 
Foree, a week 04 .06 .05 on — 
Aes times a week 04 .03 .06 10 — 
Arithmetic 
Once a week .06 .05 40 07 — 
Four times a week .06 .08 .06 .07 — 


and a greater number of different content 
categories were used (p = .06). 

The percentage of responses to the last 
three Rorschach cards has been interpreted 
to indicate “the general responsiveness to 
emotional stimuli from the environment 
whether this is expressed in overt reactivity 
or not [Klopfer et al, 1954].” This per- 
centage was significantly lower (p = .02) for 
the once-a-week as opposed to the four-times- 
a-week children at follow-up (29 vs. 44%). 


Discussion 


It is possible that factors other than those 
associated with frequency of treatment ac- 
counted for the striking differences in out- 
come. Important changes did occur in the 
impact of the family on the child, but these 
were no greater for one or the other group. 
The quality of teaching and the availability 
of supplemental educational experiences were 
also essentially equivalent for both groups. 
The total number of sessions was Positively 


TABLE 5 


MEDIAN TOTAL IQ VALUES FOR THE ONcE-A-WEEK 
AND FOUR-TIMES-A-WEEK CHILDREN 


Point of assessment 


Group Beginni is 
eginning E Year after 
of treat- nd Of | 
ment treatment pem 
Once a week 110 100 109 
Four times a week 106 108 109 


associated with the various outcome indexes, 
but not significantly so in any instance. We 
would expect that if larger samples were 
available, the above variables would be 
sources of variation in their own right. 

The therapist's attitude toward the vari- 
ous frequencies of treatment could also have 
accounted for the differences in outcome. 
Examination of the recordings of his feelings 
about each session revealed no differential 
interest in or differential expectations as to 
the ultimate outcome of the treatment. More- 
over, if the therapist’s attitude toward the 
efficacy of the different frequencies was the 
crucial factor, why did the once-a-week group 
initially show greater gains, and why did the 
two groups not differ significantly on a greater 
number of indexes at the end of treatment? 

Examination of the therapeutic process of 
the two treatment situations suggested instead 
that frequency of session was associated with 
a cluster of process variables which were in 
turn associated with differential outcome. 
Striking differences in both the child’s ma- 
terial and the therapist’s ability to use that 
material have been found and will be reported 
in the future. 

The quality and the pattern of changes in 
the outcome indexes themselves suggest that 
a differential personality reorganization took 
place as a function of the different treatment 
Processes. It is hypothesized that the nature 
of the treatment of the once-a-week children 
was such as to allow for a more immediate 
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ONCE -A-WEEK - TREATMENT 


Ud stt 


FOUR -TIMES -A-WEEK -TREATMENT 


120 
( 094 
RATE OF 
IMPROVE- .080— — — — — — 
Y MENT „063 
Gain in 
Grade 040 
Place- 
ment 
Berasa 
Divide 
by linalar IAAD 
of Months 
in Interval 
i 
1 
| 160 
i 
120 
] 
080— = = = HH 
049 
(IG. 1045 
f 
.000 
The Months Duri 
BEFORE Treat- FIRST 


ment. 


During 
‘SECOND YEAR 
After Treat- 
ment. 


Di 
YEAR AFTER 
Treatment. 


Duri 

Last Year 
Of Treat- 
ment. 


TIME INTERVAL 


Fic. 1. Rate of improvement in reading as measured by the Wide Range 
Achievement Test. 


improvement in the level of ego integration, 
and this in turn permitted more immediate 
advances in the academic learning situation. 
By contrast, the more intensive frequency 
facilitated the regressive repetition of all the 
ramifications of intrapsychic and environmen- 
tal conflicts, and this would initially not favor 
improvement in the symptom area. But by 
the end of treatment it was seen that this 
More extensive and qualitatively different 
Process allowed for a degree of flexibility 
that in combination with an adequate level 
Of ego integration would insure continued 
Progressive adaptation in the follow-up period 
(see Heinicke, 1965a, for elaboration). 
. The combination of outcome indexes used 
in this study not only facilitated the link to 
the process variables but has other advan- 
tages. The special knowledge of the therapist 
Was made use of but his subjective bias was 
i checked both by the assessments of other 
clinicians and by the results of the analysis 


of relevant test scores. A reliable procedure 
for summarizing the behavioral observations 
of different clinicians has been developed 
through the use of the Diagnostic Profile and 
the ratings based on it. Factor analysis of 
these ratings resulted in meaningful factors. 
Scores based on one of these, namely, ego 
integration, were positively associated with 
both the absolute level and the increase in 
the Stanford IQ scores. This, in turn, sug- 
gests the importance of motivational factors 
in accounting for variations in IQ. Studies of 
normal populations have demonstrated a rela- 
tionship between measures of IQ and ratings 
of personality not unlike those included in 
the ego integration cluster (Bayley & Schaefer, 
1964; Sontag, Baker, & Nelson, 1958). 
While further analysis of the present data 
may lead to different results and interpreta- 
tions, the following general conclusions seem 
warranted at this time: (a) Children whose 
learning disturbance is a function of internal 
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and external psychological conflict are in 
many instances not likely to benefit from 
additional educational help; (b) psychoana- 
lytic psychotherapy with this type of child 
and his family does facilitate the child’s pro- 
gressive development, including his use of 
educational opportunities; (c) the child's 
continuous development following the termi- 
nation of treatment is greatly enhanced if he 
has experienced an intensive form of psycho- 
therapy; and (d) studies of treatment which 
do not assess the child after termination are 
seriously incomplete; one of the salient 
Characteristics of the children seen once a 
week was their continued dependence on the 
intervention and guidance of the adult to 
maintain their defensive organization. 
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“Tn a recent exchange of criticisms Davison 
969) and Folkins, Evans, Opton, and Laza- 
(1969) found each other to be guilty of 
ploying a faulty relaxation treatment group 
the investigation of the comparative effects 
‘of systematic desensitization (Wolpe, 1958), 
laxation therapy (Jacobson, 1938), and 
“extinction” or “cognitive rehearsal” (i.e., 
imagining disturbing situations without re- 
laxation training). Davison’s contention is 
“that the relaxation condition employed by 
-Folkins, Lawson, Opton, and Lazarus (1968) 
1 vitiated by the latters’ instructions to Ss 
j “relaxation . . . can be used to your 
vantage to handle the somewhat stressful 
p film you are about to see. Just remain 
as relaxed as you can [p. 103]." This pro- 
cedure of instructing Ss to relax during the 
presentation of actual aversive stimulus ma- 
terial was seen by Davison to change the na- 
ture of the “treatment” condition such that 
it more closely represented "in vivo desen- 
sitization” than relaxation alone. Davison 
(1969) feels this is true because “Folkins et 
s ‘relaxation group’ seems to have experi- 
enced the apparently crucial pairing of anx- 
lety-competing relaxation with stressful stim- 
lli [p. 87]." 
- The retort by Folkins et al. contended that 
-is Davison, himself, who has employed a 
faulty relaxation group in his study of syste- 
Matic desensitization as a “counter condition- 
"ing? process (Davison, 1968). This, they 
"reason, is true because Ss undergoing relaxa- 
tion training in Davison’s study knew at the 
d 1The author thanks Janet T. Spence of the Univer- 
‘sity of Texas for her suggestions and comments in 
e preparation of this paper. 
3 Requests for reprints should be sent to the author 
who is now at the Department of Psychology, Uni- 
ty of Kentucky, Lexington, Kentucky 40506. 


A NOTE ON RELAXATION TREATMENT GROUPS IN 
STUDIES OF SYSTEMATIC DESENSITIZATION 


WILLIAM T. McREYNOLDS 1 


University of Texas 


Recent criticisms by Davison and Folkins et al. of each others relaxation 
training condition in the investigation of systematic desensitization are reviewed. 
It is concluded that a “pure” relaxation treatment condition void of their 
criticisms does not represent a real treatment or treatment analogue. 


time of their treatment the purpose of the 
treatment (to reduce fear of snakes) and, 
thus, “had ample opportunity and motivation 
to think about snakes during the relaxation 
training [pairing anxiety-competing relaxa- 
tion with the disturbing stimulus] and hence 
to contaminate the relaxation treatment with 
elements of desensitization [Folkins et al., 
1969, p. 88].” Further, Folkins et al. argue 
that their relaxation training procedures 
avoided this “flaw” since Ss in their study 
knew only that they were to see a stressful 
film but were unaware of the content of that 
film until the actual showing. Folkins et al. 
concluded that desensitization could have very 
little opportunity to occur, and even though 
their Ss were explicitly told to make use of 
relaxation training during the stressful situa- 
tion while Davison's were not, the procedures 
in the latter study implicitly encouraged Ss 
to do so. 

What both Davison and Folkins et al. 
appear to be advocating for the study of the 
separate effects of relaxation and conditioning 
in systematic desensitization is a "pure" re- 
laxation treatment condition (ie., one free 
of desensitization effects) in which Ss are not 
aware of the target behavior (e.g. reaction to 
a film of a fatal accident, fear of snakes). 
The present author seriously questions such a 
treatment condition. Unless Ss know the focus 
of treatment and the effect treatment is ex- 

ected to have on them, the nonspecific placebo 
effects of their expectations and aspirations 
for change will not contribute to the final out- 
come. This point is especially crucial in com- 
parisons between Ss receiving relaxation treat- 
ment and Ss undergoing systematic desensi- 
tization (or any other form of treatment) 
since, as Folkins et al. (1969) pointed out, 
Ss in desensitization will, by virtue of the 
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nature of the pre- and posttreatment tests 
and the treatment procedures (e.g. anxiety 
hierarchy), have a good idea of the goal and 
purported effects of treatment. The omission 
of these expectation and other placebo effects 
will have an undefinable and possibly serious, 
detrimental effect on the effectiveness of treat- 
ment (Frank, 1959, 1961; Imber, Frank, 
Nash, Stone, & Gliedman, 1957). Without 
the knowledge of the goal and expected result 
of treatment, the experience of relaxation can- 
not be expected to have more relevance to 
Ss’ “symptomatic” behavior than any other 
pleasant experience. 

Davison further contended that for a re- 
laxation condition to be free of flaw Ss must 
not be instructed to attempt to maintain re- 
laxation during crucial disturbing situations. 
Stated differently, Davison’s contention is 
that there must be no “application” instruc- 
tions if the relaxation treatment is to be “re- 
laxation alone.” Folkins et al. (1969) noted 
that the inclusion of such instructions makes 
the relaxation training more similar to the 
clinical application of treatment by relaxation. 
The present author would go one step further 
and insist that instructions to use relaxation 
in the face of disturbing situations are a 
necessary and important part of relaxation 
treatment. Jacobson (1938) stated: “The aim 
is to train the patients to use his own initia- 
tive. He learns to localize tensions when they 
occur during nervous irritability and excite- 
ment and to relax them away [p. 40].” This 
aspect of relaxation training is often over- 
looked by researchers and may prove to be a 
crucial variable in relaxation treatments? 
(Zeisset, 1968). 


: 2W. T. McReynolds, Systematic desensitization, 
insight-oriented psychotherapy, and relaxation ther- 
apy in a psychiatric population. Unpublished doctoral 
dissertation in preparation, University of Texas, 1969. 
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In the light of the preceding discussion it 
appears that the weaknesses Davison (1969) 
and Folkins et al. (1969) find in each others 
relaxation condition are in fact their real 
strengths, and vice versa. It may be that the 
resolution of their discrepant results will be 
found in other, less subtle procedural differ- 
ences than the ones pointed out by the au- 
thors themselves. 
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State University of New York at Buffalo 


AND 


HARVEY PESKIN ? 
San Francisco State College 


Since the role of student has been traditionally reserved for the male in 
Chinese society, it was hypothesized that China-born, female college students 
would feel generally alienated from and in defiance of norms of the parent 
culture. Results on the California Psychological Inventory (CPI) with 43 
male and 43 female China-born college students supported this view, with 
females scoring lower on Socialization and Good Impression. The Chinese 
model of the feminine role (reserve, patience, modesty, etc.) was most re- 
jected by the subsample of females on student visas, and most accepted by 
the Laturalized females (including permanent residents), as reflected on the 
CPI Femininity Scale. Regardless of sex, naturalized and resident students 
clearly outranked visa students on important aspects of psychological health. 


People migrating to another society can be 
expected to carry over the culturally ingrained 
emotional meanings of essential social roles. 
When two cultures are as different as the 
Chinese and the American, new role mean- 
ings often come into conflict with traditional 
ones, with evident implications for personality 
adjustment under the new circumstances. 
Moreover, such discrepancies may be greater 
for one sex than the other, and thus pose in- 
teresting questions relating to sex roles. This 
study is concerned with the differential ad- 
justment of China-born male and female stu- 
dents in America. 

In the role of student, the sexes doubtless 
face differential role strain in Chinese culture. 
Since the sixth century, males only, rich or 
Poor, were allowed to take the imperial exami- 
Nations, based on the Confuscian teachings 
(Hsu, 1955). In 1905, the imperial examina- 
tions were abolished before the decline of the 
Manchu dynasty and the establishment of 
the people's republic.-But the traditional re- 
Spect for the male’s scholarship is still very 


1This study is based on a paper presented at the 
Western Psychological Association Convention, San 
Francisco, 1967. 

?Also Research Associate, Institute of Human 
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lege, 1600 Holloway Avenue, San Francisco, Cali- 
fornia 94132. 


much in evidence, with the college degree re- 
placing the imperial ones. Education for the 
woman was seen to endanger feminine virtue. 
Her status in life was to be a devoted helper 
to the future mother-in-law and a bearer of 
many grandchildren for her, preferably male 
ones. It was only in the last century, with the 
aid of missionaries, that the emancipation of 
Chinese women began, and a number of 
Christian schools and colleges were estab- 
lished. As late as 1910, these schools alone 
provided women with the same educational 
opportunities as men (Liu, 1963). Although 
their educational ambitions have enlarged, 
Chinese girls may still find that their emerg- 
ing role is not fully accepted by all segments 
of Chinese society (DeVos & Abbott, 1966; 
Wright, 1964). 

From this brief analysis, it is reasonable to 
suppose that females in college will feel alien- 
ated from and rebel against traditional expec- 
tations of what is appropriate for Chinese 
females. 

A host of sociopsychological factors may 
either mitigate or aggravate such sex differ- 
ences. Certainly, length of residence in Amer- 
ica would be expected to facilitate the in- 
corporation of the new culture. Over and 
above residence, the Chinese’s eligibility for 
naturalization, under liberalized immigration 
policy, has conferred the constitutional and 
psychological security of full membership in 
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TABLE 1 
NATURALIZED AND Vis MALES AND FEMALES ON THE CALIFORNIA 
Me tes BiG PSYCHOLOGICAL INVENTORY 
Males Females 
i Naturalized 
EP kerais (23) NPIAS w=) (= 21) 

M M M M 
Dominance 47.7 50.9 46.8 51.0 
Capacity for Status 44.3 49.6 42.3 ^ 
Sociability 45.9 49.0 41.2 s 
Social Presence 44.6 51.0* 44.5 ay 
Self-Acceptance 48.9 328. 46.4 js T 
Sense of Well-Being 39.1 46.5 35.5 e. 
Responsibility 45.2 48.8 41,6 48. 
Socialization 50.3 52.9 46.2 46.8 
Self-Control 46.4 49.1 46.3 43.1 
Tolerance 40.0 48.3** 40.1 44.9 
Good Impression 47.3 49.6 44.6 42.1 
Communality 45.3 5n 45.0 48.3 
Achievement via Conformance 45.2 Ei Cga 43.5 46.7 
Achievement via Independence 45.9 54.1* 47.2 51.1 
Intellectual Efficiency 40.1 46.1 39.5 43.1 
Psychological Mindedness 49.2 50.0 46.4 52.2 
Flexibility 44.6 51.0* 49.3 556... 
Femininity 55.7 55.1 50.0 58.6 

* 
EH $35 
Mp cg 


American society. Thus, it may be expected 
that the naturalized Chinese (or those eligible 
for citizenship) will be noticeably more in 
tune with significant American norms and 
thus with themselves, in terms of well-being 
and self-esteem, than a disenfranchised Chi- 
nese group. For this study, the latter will be 
represented by Chinese holding student visas. 


METHOD 
Subjects 


The Chinese Ss were students, living and attend- 
ing colleges in the San Francisco Bay area. They 
were all born in China, and the sample consisted of 
43 males and 43 females. Of these Ss, 20 males and 
21 females were naturalized American citizens or 
permanent residents eligible for citizenship. (Seven 
of these 41 Ss fell into the latter; these 7 have only 
recently been able to satisfy the residency require- 
ment for citizenship and presumably plan to file for 
citizenship in the near future,) These 41 Ss are re- 
ferred to as the naturalized male and female groups 
in this study. Twenty-three males and 22 females 
were on student-visa status. The mean ages of these 
four groups of male and female Ss were 22.3, 21.2, 
24.0, and 20.8 yr., respectively. The naturalized Ss 
have lived in the United States for 11.5 mean yr., 
and the visa Ss, 3.5 yr. Most of the Ss were born on 
the China mainland (65%); the remainder came 


from Hong Kong (30%) and Taiwan (5%). Gen- 
erally, the Chinese from the China mainland mi- 
grated before the establishment of the Communist 
government. 


Procedure 


Most of the Ss were obtained from psychology 
courses at San Francisco State College. Since the 
students are required to take two courses in psy- 
chology to meet the requirements of the college, the 
present sample represents a wide spectrum of cur- 
riculum majors. A few students at other colleges 
were also given the questionnaires. The question- 
naires were filled out anonymously by Ss and re- 
turned by mail. Sixty-five percent of the question- 
naires were returned. It is hoped that later studies 
will obtain a better rate of return than has been 
possible in this voluntary study. 

All of the Ss were given a battery of questionnaires 
which included a Personal Data Form (Fong, 1965) 
and the California Psychological Inventory (CPI, 
Gough, 1957). The present report considered only 
the CPI whose relevance for cross-cultural research 
has been repeatedly demonstrated (e.g., Gough, 1960, 
1966; Gough & Sandu, 1964). 


RESULTS 


The mean scores of the visa and naturalized 
males and females on the CPI scales are pre- 
sented in Table 1. A two-way analysis of vari- 
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ance computed by the method of least squares 
for unequal cell frequencies (Winer, 1962) 
was performed on each scale with main effects 
of sex and citizen status. Results are pre- 
sented in Table 2. 

The effect of sex yielded significant differ- 
ences on two scales, Socialization and Good 
Impression, with the males scoring higher on 
both. 

The effect of citizen status (ie. visa or 
naturalized) yielded significant differences on 
10 of the 18 CPI scales. The naturalized Ss 
obtained higher scores than the visa Ss on all 
of them: Social Presence, Sense of Well-being, 
Responsibility, Tolerance, Communality, 
Achievement via Conformance, Achievement 
via Independence, Intellectual Efficiency, 
Flexibility, and Femininity. Since the last 
named scale owed its significance entirely to 
the wide gap between the female groups (see 
Table 1), it is more accurate to speak ofa 
yield of 9 significant scales on the main effect 
of citizen status. The Femininity scale was 
the only one which showed a significant inter- 
action between sex and citizen status. The 
naturalized females scored significantly higher 
on Femininity than the visa females. 


DISCUSSION 
Males and Females 


It was assumed that the female students 
would experience greater conflict with and 
rebellion against social norms and role expec- 
tations than the male students. This predic- 
tion is supported by the lower scores of fe- 
males on Socialization and Good Impression, 
the two sex-discriminating scales of this study. 
Studies (e.g., Gough, 1965; Stein, Gough, & 
Sarbin, 1966) have shown that the Socializa- 
tion scale deals basically with role-taking dis- 
Position and relates to deviant or rule-break- 
ing behavior. The low scores of the Chinese 
females on Good Impression indicate that 
they either reject or are less aware of the 
socially desirable response to enhance self- 
esteem in their social groups. The present 
view would, of course, presume that such 
Scores reflect the same deliberate disavowal of 
cultural role demands that had been operating 
in their decision to attend college, and possi- 
bly to pursue a career. 
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TABLE 2 
ANALYSIS OF VARIANCE oF CPI SCALES 
Eon Sex Citizen A 
le (S Se SXC |Within 
Social Presence 
MS 9.67 | 733.33 6.30 |107.94 
F 10 6.79* .06 
Bopp he 
S 100.77 | 561.69 50.34 | 83.32 
F 121 6.74* .60 
Socialization 
MS 555.05 | 50.93 21.82 | 87.18 
F 6.37* .58 25 
Tolerance 
MS 53.08 |920.18 68.59 |109.51 
F 48 840** -63 
Sense of Well- 
being 
MS 470.77 | 841.79 26.01 |149.71 
F 345 5.62* AT 
Good Impression 
MS 526.74 .23 | 122.95 | 75.04 
7.02* .00 1.64 
Communality 
MS 94.57 |582.11 83.59 | 98.13 
F .96 5.93* .85 
Achievement via 
Conformance 
MS 220.74 | 497.91 58.63 | 77.75 
F 2.84 6. 75 
Achievement via 
Independence 
S 1047 |777.50 95.52 |104.42 
F 40 7.45** 92 
Intellectual 
Efficiency 
MS 63.34 | 499.26 30.11 |107.95 
F 59 4.63* .28 
Flexibility 
MS 296.42 | 601.17 23.37 |121.49 
F 244 4.95* 19 
Femininity 
MS 35.15 |342.51 |460.11 | 85.33 
F Al 4.01* 5.39* 


Note.—df = 1 for sex, citizen status, and S X C; df = 82 
for within. 
*)«.05 
**p <01. 


Visa and Naturalized Status 


The psychological consequences of the sec- 
ond variable, naturalization, have not, as far 
as the authors know, been reported before. 
Yet its effects seem to be large indeed. The 
naturalized Ss presented a picture of being 
more self-confident (higher Social Presence), 
freer from self-doubt and disillusionment 
(higher Sense of Well-being), more responsi- 
ble and dependable (higher Responsibility), 
more accepting and tolerant of others (higher 
Tolerance), more flexible and adaptive in 
their thinking and social behavior (higher 
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Flexibility), more productive in their intellec- 
tual functioning (higher Achievement via Con- 
formance, higher Achievement via Indepen- 
dence, higher Intellectual Efficiency), and 
more in touch with the implicit and under- 
lying values of the American scene (higher 
Communality). The visa Chinese, on the 
other hand, seemed to experience a degree of 
discomfort severe enough to disorder their 
functioning in these psychological and social 
areas. 

Understandably, immigration status and 
length of American residence were confounded 
in this study. Whether length of residence, ir- 
respective of citizenship (or eligibility), con- 
tributed to the higher CPI scores of the 
naturalized group can be gleaned from within- 
groups correlations. For all of these nine sig- 
nificant CPI scales, correlations within the 
naturalized group were positive, but only 
significantly so on the Responsibility scale (? 
« .10), that is, those residing longer in the 
United States scored higher on this measure. 
It is reasonable to suggest, therefore, that 
citizenship, or the prospect of citizenship, 
encourages a motivational state in which per- 
sonal commitment to the new society is 
achieved by the full granting of its legal guar- 
antees, relatively independent of the actual 
time of residence. 

It is instructive to inspect the pattern of 
sex differences among the nine scales that 
showed a significant main effect for citizen 
status. (The Femininity scale is not consid- 
ered here.) On eight of the nine scales, sig- 
nificant £s (p < .10) were obtained between 
the visa and naturalized groups within one or 
the other sex, but not within both sexes (Ta- 
ble 1). On seven of the eight scales, it was 
the difference between the male groups of visa 
and naturalized students which was signifi- 
cant; only on one scale did the female groups 
differ. Evidently, the contribution of the male 
groups to the main effect of citizen status out- 
weighed that of the female groups. The 
sharper separation of the male groups may re- 
flect the greater social disruption experienced 
by Chinese men on student-visa status. Being 
officially consigned to the limbo of “student- 
hood” while in America, without the legal 
right to plan a professional career here and 
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with the considerably fewer employment op- 
portunities in Free China, may constitute a 
blow to one's identity as a man, especially to 
those in their midtwenties. 
Femininity 

The Femininity scale is of particular in- 
terest on two counts. First, its cross-cultural 
validity has been perhaps the best established 
of the CPI scales (Gough, 1966). Second, in 
this study, it is the only scale on which the 
Chinese males and females exceeded the 
American norms. This finding accurately mir- 
rors the Chinese cultural emphasis on gentle- 
ness, modesty, patience, reserve, and social 
sensitivity. In the present study, it was the 
naturalized female students who scored the 
highest and the visa female students who 
scored lowest, with the male groups falling in 
between. That the visa female students, in 
terms of the overall Chinese average, should 
be least feminine is not surprising in view of 
the unusual self-reliance and independence 
which the Chinese woman must show to make 
her way into higher education. Considered to- 
gether with their low scores on Socialization 
and Good Impression, the visa females, as a 
special subgroup, would then appear to follow 
a style of the “masculine protest,” so well 
known in Western characterology (cf., Yun- 
tang’s, 1935, protest against “those masculine 
women prize winners at college.") But a sur- 
prisingly different style seems to characterize 
the naturalized females. Whereas their equally 
low scores on Socialization and Good Impres- 
sion indicate a similar social estrangement, 
their high Femininity scores point to a com- 
pensatory compliance with feminine role de- 
mands. Since almost all live in America with 
both parents (20 of the 21 girls compared to 
7 of the 22 visa girls), parental influence is 
more assured, not merely for role modelling, 
but as a force to be appeased. We might say 
that the naturalized females are in an ines- 
capable dilemma, from which the visa females 
are at least spared: that of making—or pre- 
tending—peace between their educational am- 
bitions in the permissive atmosphere of Amer- 
ican coeducation and the antifeminist values 
of Chinese parents with whom they continue 
to live in America. 
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This dilemma may diminish where older 
siblings had already helped to accommodate 
the family to western values. The probable 
influence of older siblings on acquiring the 
feminine role is suggested by the diminishing 
Femininity mean scores of the naturalized 
female firstborns (65.5), second borns (58.0), 
and later borns (53.3). Firstborns exceed the 
latter groups, p < .05. (For the visa females, 
all birth-order groups are essentially equiva- 
lent.) Interestingly, no similar birth-order 
differences for the naturalized females occur 
on the Socialization and Good Impression 
scales, where mean scores are essentially 
equivalent across birth order, We are perhaps 
left to conclude that the loyalty of naturalized 
firstborns to Old World norms of femininity is 
more apparent than real, that is, more in the 
nature of role playing than an authentic ac- 
ceptance of the Chinese status quo for women. 
It may also follow that these Ss’ parents 
(their own citizenship status notwithstanding) 
are seen by their daughters as carriers of the 
ethnic culture who must be mollified. Perhaps 
particularly toward sex roles, where parental 
attitudes are likely to have been settled for 
good and all, a display of obedience by the 
first offspring may both disguise and betray 
the alienation between the generations. 
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PREDICTION OF REACTION TO PLACEBO* 


MICHAEL M. NASH ax» FRED M. ZIMRING? 
University of Chicago 


Expectations of drug effect and openness to experience were employed as 
predictors of placebo response. Ninety-nine Ss (mean age 82) received either 
drug or placebo for 8 days, Before drug therapy S's short-term memory was 
tested, expectations about the pill determined, and openness to experience 
ascertained. After therapy memory was tested and S questioned about change 
experienced. An analysis of variance revealed that the two experimental drugs 
had no noticeable effect. A similar analysis of expectations yielded significant 
F ratios. Openness had no predictive value. Expectations correlated sig- 
nificantly with the subjective measure of placebo reaction (experienced change, 
p < 001) as well as with a behavioral criterion (improved short-term memory, 


p < 03). 


Within the last decade there has been 
increasing interest in drug research with 
human Ss. A vast range of drugs has been 
tested including compounds as different in 
their behavioral effects as tranquilizers and 
hallucinogens. Personality research has also 
been intense, especially the study of individual 
differences. 

Despite this activity, a particular topic 
which overlaps the two fields has been con- 
sistently difficult to explore. Uhr (1960) 
pointed out that one of the rarest of objective 
drug experiments is the study of individual 
personality differences that correlate with a 
person’s response to drug therapy. Six years 
later the problem was the subject of a special 
workshop on Prediction of Response to 
Pharmacotherapy (Wittenborn & May, 1966). 
In the published report of the proceedings, 
however, the only chapter which dealt specif- 
ically with the actual prediction of drug- 
induced changes was concerned with avoid- 
ance behavior in rats. Most discussions of 
individual differences and drug effect have 
been limited to post facto attempts at psycho- 
dynamic analysis (Kubie, 1960). The few 
studies which separated the influence of drug 
from personality have used mentally ill popu- 
lations and tranquilizing drugs (Kurland, 
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1960; Wittenborn & May, 1966). Closely 
related to the problem of the influence of 
personality on drug action is the phenomena 
of an experienced or observed drug effect 
when the substance ingested is actually inert. 
Such substances, called placebos, are agents 
which although pharmacologically impotent 
are employed with the definite suggestion that 
they will be helpful. Any changes which occur 
after ingesting a placebo (“placebo effect") 
are interpreted as being primarily psychologi- 
cal in origin. 

The manner in which suggestion can inter- 
act with drugs, especially in producing the 
placebo effect has been of great theoretical 
interest (Katz, 1967). The approach most 
psychopharmacologists would prefer would be 
to predict who will react to a placebo in a 
given drug study in order to control for that 
effect. That is, if it would be possible to 
assess the degree to which an S possessing à 
particular trait would react to a placebo, 
then this effect could be taken account of 
when assessing the effect of the actual drug 
being tested. Making predictions of this 
nature, however, on the basis of previously 
measured personality characteristics has thus 
far met with almost no success (Peterson, 
1966). 

The present study was an investigation of 
the relationship between two carefully selected 
personality characteristics and susceptibility 
to two drugs and to a placebo; the drugs 
and the placebo presented to Ss as having 
the effect of improving short-term memory. 
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The immediate goal of the study was to pre- 
dict which Ss would react to the placebo. 

The two personality characteristics selected 
as independent variables were: (a) S's expec- 
tations (EXP) toward the drug he was to 
receive and (5) S's openness (OPN) to new 
experience. Susceptibility, the dependent vari- 
able, was defined as the degree to which S's 
performance improved on three tasks empha- 
sizing short-term memory (STM) and the 
amount of positive experienced change (ECH) 
he reported. The drugs used were diphenyl- 
hydantoin (DPH), procaineamide (PRO), 
and a placebo (PLA). 

The first of these personality variables, S's 
EXP, is viewed as an important component 
of S's attitude. Several investigators (Frank, 
1961; Katz, 1967; Kubie, 1960) have felt 
that S's attitude is perhaps the most central 
factor in any therapy program in which the 
concern is changing S's behavior. Optimum 
change appears to require a positive, trusting 
attitude toward the agent of change. In drug 
therapy these agents are the Es as well as 
the drugs. In the present study, S was told 
that there was a drug which might help him 
remember better, was assured it would not 
harm him, and was asked to volunteer. The 
degree to which S trusted the Es and actually 
expected to be helped was called S's EXP 
and was construed as being important to the 
placebo response. 

Psychotherapy research by Rogers and 
others (Rogers & Dymond, 1954) has shown 
that a client's OPN to new experience is 
related to positive change while in therapy. 
Tn the present study, OPN is taken to mean 
the ability to be aware of, and to be articulate 
about, events occurring within oneself. It 
seemed likely that persons who rate high on 
OPN would show reaction to placebo. Such 
Open Ss, sensitive to any variance within 
themselves, were thought to be likely to be- 
lieve that the small random changes in their 
daily physical and emotional feelings were 
drug induced. 

The drugs used in the study belong to a 
class of anticonvulsants which act to raise 
the threshold for neuron firing. It is suspected 
(Gordon, 1967) that a decreased threshold for 
neuron firing with increasing age interfers 
with STM. 
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Both DPH and PRO have been shown to 
improve the immediate memory of certain 
strain of aged rats (Gordon, Tobin, Doty, & 
Nash, 1968). The placebo reactor study was 
designed as part of a larger research project 
comparing the effects of the two drugs to that 
of the placebo. When it was determined that 
DPH and PRO had not affected Ss differently 
from PLA on any dimension, both drugs were 
then viewed as placebos and the entire sample 
handled as three groups given placebos, Thus 
the results reported occurred within both 
drug-as-placebo groups and the actual placebo 


group. 
METHOD 
Subjects 


A potential population of 200 Ss was drawn from 
the residents of a Jewish home for the aged in 
Chicago, Illinois. In a series of meetings Ss were told 
that a new pill had been developed which might 
improve their memory for recent events. They were 
assured the pill would not harm them and were 
offered a small sum of money if they would volun- 
teer to take the pill for a week and answer some 
questions, One hundred and one residents joined the 
program, 2 withdrew before receiving the drugs, 
leaving a final sample of 99. Three volunteers had 
to be refused because of other medication they were 
receiving. The Ss ranged in age 62-94 with a mean 
age of 82. Twenty-two men and 77 women 
participated. 


Procedure 


The sample was divided on a random basis into 
three groups and 34 Ss assigned DPH, 32 PRO, and 
33 PLA. A double blind design was used. In this 
type of design, drug identity is coded and is unknown 
to the nurses, interviewers, Ss, and Es. In addition 
to this procedure, all outcome variables were scored 
and analyzed without knowledge of the corresponding 
predictive measures. Experienced nurses administered 
the pills and kept close watch for side effects. Four 
trained interviewers tested a specific group of Ss 
through the entire study. 

Each S was told by his interviewer that the pur- 
pose of the study was to see if the drug would 
affect their memory and tested to establish his initial 
level of STM ability and to obtain EXP and OPN 
measures. This interview, designated Time 1 (Ty), 
was conducted 1 wk. prior to S receiving one of the 
drugs. The second interview (Ta) took place about 
3 hr. after S had received his first drug administra- 
tion. This interview was included to test the reliabil- 
ity of some instruments and to check for possible 
acute drug effect. After S received either DPH, PRO, 
or PLA each morning for 8 days, he was tested a 
third and final time (Ts). Data from Ts included 
measures of STM change and experienced change 
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TABLE 1 


CORRELATIONS OF PREDICTIVE VARIABLES 
WITH RESPONSE TO PLACEBO 


Hypothesis R 
OPN T; with experienced change T; .097 
EXP T; with experienced change Ts KTL aaa 
EXP T; with short-term-memory 


improvement T; to T; 


* p «.03. 
> <.001, 


(ECH). Following the conclusion of the testing, data 
from Ts, Ts, and Ts were scored and the drug code 
revealed. Each group was then compared and found 
to be equivalent in age, sex, and at Ti to be equal 
in initial level of EXP, OPN, and STM. 


Measures 


OPN was measured at T; by asking S for stories 
for Cards 1 and 2 of the Thematic Apperception 
Test (TAT) as well as for a third card especially 
designed for geriatric populations called Park Bench 
(an old man and an old woman sitting together on 
a bench in a park). Each story was scored for OPN 
by using an affect-cognition scale developed by 
Rachman (1965) based on work by Zimring (1962, 
1963). From 1 to 4 points was given for a response 
depending on the degree to which affect was merely 
mentioned, attributed to a character, or developed 
within a conceptual framework. Subscores for the 
three cards were then totaled and the sum designated 
as OPN to experience. 

EXP was evaluated at T; by use of a questionnaire 
asking about S's belief in the Es and in the drug 
and also asking about the ways and amount S hoped 
the drug would help him. Eight questions were each 
scored on a 3-point scale, 2 points being given if S 
indicated that he had positive feelings and expecta- 
tions, 1 point if a neutral or indifferent answer was 
given, and O points if S was pessimistic or cynical 
in his reply. The S's scores on all eight questions 
were then summed and recorded as his EXP. For 
example, the median score of 7 or 8 on this variable 
depicted an S who was either noncommittal or 
unconcerned about what he felt the drug's effects 
might be. 

In addition, at this time the interviewers were 
asked to fill out a status sheet which required them 
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to make judgments (on a 5-point scale) about 12 
personality characteristics of S. For example, S was 
rated on how cooperative, suspicious, and anxious he 
was during an interview. 

ECH was measured at Ts by a set of eight ques- 
tions intended to force S into assessing the particular 
ways in which he experienced the drug as having 
helped (for example, did he think his memory had 
improved?) as well as eliciting his more global feel- 
ing about his experience (would he recommend the 
pill to his friends?). From 0 to 3 points were given 
for a response depending on the degree to which it 
indicated a positive reaction to pharmacotherapy and 
the scores were totaled across the eight questions. 
The median score of about 8 on this variable was 
typical of an individual who reported no feeling of 
change, either positive or negative, as a result of 
medication. The experiences reported, however, 
ranged from very enthusiastic to totally critical. 

Predrug STM ability was obtained from S at Ti. 
This was done by combining his scores on modified 
versions of Digit Span, Logical Memory and Mental 
Control from the memory scale developed and 
standardized by Wechsler (1945). The same three 
tests were given at Ts. Improvement was defined as 
fewer errors made on the tests at Ts than at Tı. The 
improvement seen in STM scores from T, to Ts for 
those Ss with high EXP was not due to the effects 
of initial level of STM ability. This level was the 
same for both high and low EXP groups. 


RESULTS 


An analysis of variance of the drugs (see 
Table 3) revealed that DPH and PRO did 
not differ significantly from PLA on any of 
a dozen outcome variables including ECH and 
STM. Since a drug effect had not been dem- 
onstrated, all three groups were combined and 
treated as a placebo group to simplify inter- 
pretation of the results. 


Hypothesis 1 


The greater S’s OPN prior to receiving 4 
drug (Ti), the more positive is his ECH 
at Ts. 

Table 1 reveals that this hypothesis was not 
supported by the data. Predrug OPN was not 


TABLE 2 
DISTRIBUTION OF PREDICTIVE VARIABLES 
Lower third 
Variable at, | Pepe | Midale saira | Upperencd | Rangeot | a, skew 
OPN 52% 28% 16% 1-13 4 ee e 
EXP 34% 33% 33%, 115 7 2 


Note.—T = time; OPN = openness; EXP = expectations. 
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TABLE 3 
ANALYSIS OF VARIANCE: DRUG ALONE 
Outcome measure Diphenylhydantoin | Procaineamide | Placebo df F ? 
Short-term—memory change 
Raw score 12 09 4.24 2/90 14 24 
Scaled score 30 28 .50 2/90 48 .83 
Experienced change 
Raw score* 9.57 9.03 8.0 2/90 1.58 21 
Scaled score^ 445 31 —0 2/90 1.06 35 
* One-way analysis by expectations alone, F = 6.54, p = .01. 
b One-way analysis by expectations alone, F = .800, p = .006. 
a good indicator of posttreatment ECH. Al- Hypothesis 3 


though the correlation was in the predicted 
direction it was far from significant. Test- 
retest (T,-Tz) reliability on the OPN mea- 
sure was .65, a reflection of both the perform- 
ance variability inherent in a geriatric popula- 
tion and the nature of the TAT itself. How- 
ever, the most probable reason for failure to 
find the predicted relationship is that the 
distribution of scores on this measure fell dis- 
proportionately toward the low end of the 
scale as seen in Table 2. Because S’s T; OPN 
score was not known until after Ts, the Es 
were unable to control for the skewed dis- 
tribution. There were so few Ss who rated 
high on OPN that either the measure used is 
unsuitable for aged Ss, or such Ss differ 
markedly as a group on this variable from 
the college students on whom the instrument 
was developed, and who gave an approxi- 
mately normal distribution of scores. No con- 
clusions as to the predictive value of the 
OPN variable for placebo reactors can be 
made at this time. 


Hypothesis 2 


The higher the EXP S has at T;, the more 
Positive is his ECH at Ts. 

This hypothesis was clearly supported by 
the data (see Table 1). Those people with 
the greatest pretherapy expectations reported 
experiencing the most positive change. The 
EXP scores, when split into high, moderate, 
and low EXP groups and subjected to an 
analysis of variance yielded significant F 
ratios regardless of whether S had received 
DPH, PRO, or PLA (Table 3). This indicates 
that it was high EXP and not drug action 
Which brought improvement in some Ss. 


The greater the EXP S has prior to taking 
the drug, the greater is his improvement on 
STM tasks. 

This hypothesis was also clearly supported 
by the data as shown in Table 1. Those indi- 
viduals with the highest expectations were 
indeed the strongest placebo reactors on an 
objective measure of change (STM). The 
correlation is lower than for ECH but it 
should be noted that the STM change repre- 
sents a specific, observable change in ability 
in reaction to the placebo, rather than a 
report of subjective experience. 

No significant relationships between any of 
the experimental variables (OPN, EXP, 
STM, ECH) and age, sex, or initial level of 
STM ability were found. Table 4, the inter- 
correlations between the four experimental 
variables, indicates that the only significant 
relationships between EXP and ECH and 
EXP and STM change. The predictive varia- 
bles, OPN and EXP, were essentially inde- 
pendent as were the two outcome measures 
ECH and STM. In addition, data from the 
placebo reactors (those Ss with high EXP) 
were analyzed for an acquiescence response set. 


TABLE 4 
INTERCORRELATIONS OF EXPERIMENTAL VARIABLES 
Variable OPN | EXP | ECH 
EXPT 180 
EXP D 
ECH T .097 EH 
STM T; 417 .231* —.044 


=openness; EXP =expectations : ECH =ex- 
eel $ T =time; STM =short-term-memory change, 
*p <05. 


*p c 0l. 
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TABLE 5 


Supyects SHowmnc CHANGE, Tres 1-3, on EXPERI- 
ENCED CHANGE AND SHORT-TERM MEMORY 


Both ECH | No change | Both ECH 


EXP and STM or split and STM 

positive scores negative 
High (N = 34) 22 11 1 
Moderate (N = 31) 13 14 4 
Low (N = 34) 5 14 15 


Note.—EXP = expectations; ECH = experienced change; 
STM = short-term memory. 


If mere acquiescence had been demonstrated, 
the Es could have expected to find that “yes” 
saying on both EXP and ECH had resulted in 
a significant correlation (high EXP and posi- 
tive ECH). Although this would explain the 
finding on Hypothesis 2, there would not be 
any reason to expect that acquiescence had 
resulted in improved STM for this group. 
Table 5 shows clearly that more Ss with high 
EXP had positive STM as well as ECH scores 
than Ss with only moderate or low EXP. 
This consistency in behavioral as well as sub- 
jective improvement among the high EXP 
group eliminates acquiesence as a reasonable 
explanation of the placebo response. 


Discussion 
Although the Es’ as well as the Ss’ expecta- 


tions were for the most part confirmed, sev- 
eral limitations of the study should be noted. 


TABLE 6 


CORRELATIONS OF PERSONALITY VARIABLES 
WITH EXPECTATIONS 


Variables Rat Time 1 
PRU 
ipontaneous i 
Friendly s 
Rapport | .341** 
Cooperative 351** 
Happy .390** 
Motivated .515** 
Total positive A45 
Negative 
Impulsive 047 
Suspicious —.081 
Need for support —.109 
Careful 105 
Anxious —.220* 
Confused —.392** 
Total negative —.288** 
* 
4$ Eon. 
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The measures of EXP and ECH were devel- 
oped for this study and should be validated 
by using them with other Ss receiving other 
drugs. Caution must always be exhibited in 
generalizing from one sample to another: in 
particular the age of the Ss in this study 
may limit the usefulness of the findings. Given 
the fact, however, that the aged person is 
almost twice as likely to require drugs as 
would a person in his 20s, the use of geriatric 
populations in drug research (as compared, 
for example, to college students) is advisable 
at least in terms of implications for clinical 
practice. 

A possible explanation for improvement in 
STM seen in the placebo (high EXP) group 
is that high EXP meant that the aged S was 
sufficiently mobilized to take better advan- 
tage of the practice effects involved in having 
his memory tested three times. This very 
tentative view should be subjected to further 
investigation. It is interesting to note that 
although all 99 Ss volunteered for the study 
a full third of them (those with low EXP) 
had very little belief that the pill would help 
them and apparently were willing to take the 
drug for other reasons than its intended 
purpose. 

The Es had hoped to uncover additional 
personality variables besides EXP and OPN 
which might be useful in identifying the 
placebo reactor and gaining some insight into 
their dynamics. Table 6 is a list of other 
variables, clinical assessments of S by the 
interviewer at T;, which correlated with EXP. 
Hopefully some of these variables will have 
predictive value in future studies of the 
placebo response. If this psychological effect 
can be understood it would be possible to 
accurately assess the pharmacological value of 
the therapeutic agent. 
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TALKING TIME AS AN INDEX OF INTRAFAMILIAL 
SIMILARITY IN NORMAL AND ABNORMAL FAMILIES 


WILLIAM D. WINTER? anp ANTONIO J. FERREIRA ? 
San Jose State College 


In a recent study, Becker and McArdle found that members of normal families 
correlated more highly with each other in the average action or talking length 
of their extemporaneous speech than did members of clinic families. In the 
present research, 127 family triads (with varying diagnoses of the tested child) 
were asked to make up group Thematic Apperception Test stories, and the 
total number of seconds of speech of each family member was correlated with 
that of the other two family members. In contradiction to earlier findings, 
these results indicate that in this interactional situation members of abnormal 
families resemble each other more than do members of normal families. This 
is particularly true of the similarity between father and child, which was much 


greater in abnormal families. 


In a recent study, Becker and McArdle 
(1967) asked individual family members to 
speak extemporaneously on some aspect of 
family life for 5 min. They found that normal 
family members resemble each other in the 
average duration of action length of their 
spontaneous speech more than do members of 
clinic families, presumably indicating greater 
identification in normals. 

"The authors had available data on the total 
number of seconds of speech, or talking time 
(TT), emitted by each of three family mem- 
bers who were conjointly making up three 
Thematic Apperception Test (TAT) stories 
(Ferreira, Winter, & Poindexter, 1966). The 
purpose of the present study was to determine 
whether normal and abnormal child families 
differ in the TT correlations among the triad 
tested. 

METHOD 
Subjects and Procedure 


The families tested were 127 triads of father, 
mother, and child. These families were divided into 
four groups based on the diagnosis of the child: 
(a) 50 normals, (b) 44 emotionally maladjusted, 
(c) 16 schizophrenic, and (d) 17 delinquents. The 
children were at least 94 yr. old, and the family 
members were white, literate, native-born Americans 
in good health, who had been living together as 
a family for more than 2 yr. Each family was pre- 


1 Requests for reprints should be sent to William 
D. Winter, Counseling Center, San Jose State College, 
San Jose, California 95114. 

2The authors express their appreciation to Ken 
Merritt for his help in analyzing these data. 


sented with a set of three TAT cards and instructed 
to make up a story they could all agree upon, which 
would link the three cards together in a specified 
order, including in their story a description of what 
had led up to the situation, what the characters 
were doing and feeling, etc. (Winter, Ferreira, & 
Olson, 1965). This procedure took place three times, 
with three different sets of cards. The basic score 
used in the present study was TT emitted by each 
family member while all three stories were being 
made up by the family. The TT for each family 
member was intercorrelated with the TT for the 
other two members, and the correlations among the 
groups were compared. 


RESULTS AND DISCUSSION 


Table 1 shows the TT intercorrelations 
among the family triad for the normal and 
total abnormal groups as well as for various 
subsamples of the data, broken down by sex 
of child and specific abnormal diagnosis, both 
of which strongly affect the correlations. AS 


can be seen from Table 1, for the sample ‘ 


taken as a whole the TT scores of the normal 
family members do not intercorrelate with 
each other, whereas the scores of all three 
dyads in abnormal families are significantly 
related. It would appear that for this non- 
lexical measure, at least, members of abnormal 
child families resemble each other, whereas 
members of normal families do not, a finding 
at variance with that of Becker and McArdle. 
This reversal of findings seems most evident 
in the father-child correlations, where the dif- 
ference between the normal and abnormal 
groups is significant (Z = 2.4, p < .02). The 
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TABLE 1 


Torat TALKING TIME, IN SECONDS, CORRELATIONS 
AMONG FAMILY MEMBERS, FATHER, 


MotueEr, CHILD 
Father- | Father- | Mother- 
Source mother | child | child | 
Male + female 
Normals 25 09 As, 50 
Abnormals Bee — 4% — 126" 77 
Emotionally malad- 
justed "P 53 34t 4A 
Schizophrenic "37 —04 48 — 16 
Delinquent 27 sou oh 15 3090017 
Male child 
Normals 07 E AT 34 
Abnormals "32" lge un 55 
Emotionally malad- 
justed 38 37 43" 28 
Schizophrenic =19 24 PETI 
Delinquent 39 ‘60* — 17 16 
Female child 
Normals 64 —.13 06 16 
Abnormals AT .50* 04 22 
Emotionally malad- 
justed L53% TOM 30 16 
Schizophrenic 332 teisi Ato: 5 
Delinquent _ — -— 


*p <.05. 
**$ <.01, two-tailed. 


authors used somewhat different measures of 
speech length with different samples engaged 
in different activities. However, it is important 
to realize that the conclusion of Becker and 
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McArdle that normal family members resem- 
ble each other in nonlexical speech more than 
do abnormal family members, thus implying 
greater identification with each other in nor- 
mal families, may be limited to only certain 
kinds of situations, such as private, not inter- 
actional speech. From the present authors' 
data, equally plausible hypotheses could be 
formulated that in problem-solving interac- 
tional settings normal family members are 
free to talk or not talk as much as they 
please, while in abnormal families one or more 
members are more constrained to take their 
speech cues from another, or else they fall 
into a more rigid pattern of similar produc- 
tivity based on habit, particularly for father 
and child. In any case, these hypotheses await 
further clarification. 


REFERENCES 


Becxer, J., & McAnprs, J. Nonlexical similarities as 
an index of intrafamilial identifications. Journal of 
Abnormal Psychology, 1967, 72, 408-414. 

FERRERA, A. J, Winter, W. D., & Pornpexter, E. 
Some interactional variables in normal and ab- 
normal families. Family Process, 1966, 5, 60-75. 

Winter, W. D., FERREIRA, A. J., & Orson, J. L. 
Story sequence analysis of family TAT's. Journal 
of Projective Techniques, 1965, 29, 392-397. 


(Received June 6, 1968) 


Journal of Abnormal Psychology 
1969, Vol. 74, No. 5, 576-582 


ELECTROENCEPHALOGRAM AND RAPID EYE MOVEMENTS 


DURING FREE IMAGERY AND DREAM RECALL 


JOSEPH REYHER + anp HOWARD MORISHIGE 


Michigan State University 


EOG, EEG, EMG, and heart rate of 10 male Ss were monitored under a 
visual imagery condition (free imagery) and a dream recall condition. Only 
the dream recall condition was associated with alpha desynchronization. The 
imagery of the dream recall condition was characterized by more primary 
process, more kinetic imagery, more frequent episodes of imagery, longer 
duration of images, and greater heart rate. Both conditions were associated 
with an equal increase in REMs, which were more characteristic of the 
waking state than those observed during nocturnal dreams. The differences 
between the conditions are discussed in terms of the nature of the tasks in- 
volved and the more blatant evidence of primary process regulation of the 
imagery in the dream recall condition. 


Spontaneous visual imagery (free imagery) 
has been shown to have powerful uncovering 
properties in psychotherapy (Reyher, 1963, 
1968). Resistances are acutely experienced by 
the client and primary process regulation of 
imagery progressively produces more blatant 
representation (derivatives and symbols) of 
repressed material As the uncovering pro- 
ceeds, anxiety and/or psychosomatic symp- 
toms are produced along with even greater 
resistance. The client realizes from his own 
experience that something very disturbing 
inside of him is operating outside of his 
awareness. The goal of psychotherapy is the 
discovery of this pathogenic process. Eventu- 
ally, if the client has sufficient ego strength, 
derivatives and visual symbols of repressed 
conflicts become directly depictive. No in- 
terpretations are required. This process is 
called “emergent” uncovering to distinguish it 
from the more traditional “interpretive” un- 
covering (Reyher, 1968, 1969). 

Laboratory investigations into the psycho- 
dynamic properties of visual imagery are con- 
sistent with clinical experience. Burns (1967) 
found that the frequency of galvanic skin 
responses (GSR) was proportional to the 
amount of primary process intrusions in visual 
imagery. In another investigation Reyher and 
Smeltzer (1968) found that visual images in 
response to emotionally laden stimulus words 

1 Requests for reprints should be sent to Joseph 


Reyher, Department of Psychology, Michigan State 
University, East Lansing, Michigan 48823. 


had greater uncovering properties than verbal 
associations, Visual imagery was associated 
with more frequent GSRs, more primary 
process, more direct representation of drives, 
and less effective defense than was verbal 
association. 

The purpose of the present research was to 
monitor Ss while they were engaged in free 
imagery and dream recall to clarify the rela- 
tionship between visual imagery, rapid eye 
movements (REMs), and the electroenceph- 
alogram (EEG). The authors were interested 
in whether the REMs of free imagery and 
recalled dreams resembled the REMs of 
waking imagery or the REMs of nocturnal 
dreaming. Dement and Kleitman (1957b) 
have hypothesized that REMs represent the 
dreamer’s scanning of his visual images while 
Amadeo and Shagass (1963) view REMs as 
a nonspecific neural mechanism which is 4 
concomitant of attentive activity. According 
to Shimazono, Ando, Sakamoto, Tanaka, 
Eguchi, and Nakamura (1965), REMs occur- 
ring during nocturnal dreams are generally 
greater in amplitude than waking REMs and 
take less time for deflection. Aserinsky and 
Kleitman (1955) noted that REMs during 
nocturnal dreams show relatively jerky mo- 
tions with short arcs (generally .2 sec.) and 
tend to occur in clusters. 

On the basis of the current literature, the 
authors did not expect the EEG to differenti- 
ate between conditions, but it was included 
as a way of providing information about S’s 


576 


Eve Movements, FREE IMAGERY, AND DREAM RECALL 


state of awareness. Since the EEG findings 
turned out to be of considerable interest, 
salient aspects of the literature relating EEG 
characteristics to imagery are reviewed. 

In 1934 Adrian and Matthews confirmed 
Berger's original findings that visual stimula- 
tion desynchronizes the alpha rhythm but that 
visual imagery does not. In contrast, Jasper 
and Cruickshank (1937) reported that the 
alpha rhythm is desynchronized during the 
formation of visual images in the flight of 
after images. They described a number of 
factors causing alpha desynchronization but 
concluded that the “attention value,” “arousal 
value,” or “reaction value” of the stimulus 
situation is the critical psychological vari- 
able. The alpha rhythm is controlled by an 
integrating subcortical pacemaker in the di- 
encephalon which is a primary center of 
integration for nervous processes involving 
the autonomic nervous system. 

Travis (1937) also found that the alpha 
rhythm was not desynchronized during visual 
stimulation, but he observed that the ampli- 
tude of the alpha waves were reduced for 
both visual images of light and objects, with 
the association appearing to be weaker for 
the latter. However, Golla, Hutton, and 
Walter (1943) distinguished among three 
groups of normal Ss in terms of alpha activity 
and reported that visual imagery was the 
most characteristic mental activity of those 
Ss with no alpha during eyes-closed rest. In 
1954, Short and Walter contended that the 
desynchronization of the alpha rhythm in the 
waking state provides an objective index of 
the use of visual imagery in problem solving, 
but Barratt (1956) questioned this contention 
and showed that the amount of alpha de- 
synchronization was unrelated to the degree 
of visual imagery that was involved. He con- 
cluded that alpha desynchronization is a 
function of attention. 

Costello and McGregor (1957) improved 
upon Short and Walter’s procedures and con- 
cluded that alpha desynchronization is the 
result of at least two factors: (a) the vivid- 
hess of the visual image and (5) the extent 
to which thought associations, or what may 
collectively be called the “higher processes,” 
are involved. They assumed that these higher 
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processes suppress visual imagery. In support 
of this contention, they referred to the blatant 
derivatives that are produced by a release in 
ego control, as noted by psychoanalysts and 
“Rorschach workers.” They felt that the 
investigation of imagery and EEG changes is 
a worthy and fruitful endeavor in personality 
research. 

More recent investigations do not remove 
the inconsistencies reported in the earlier 
literature. Foulkes and Vogel (1965) noted 
several stages of sleep onset that were charac- 
terized by vivid visual fantasies and alpha or 
theta waves; however, Stoyva and Kamiya 
(1968) reported that low alpha activity was 
associated with visual imagery in normal Ss 
and in Ss under the influence of LSD. 


METHOD 
Subjects 


Eleven male college students ranging 18-29 yr. 
volunteered as Ss and all but one were readily able to 
experience visual images. The one exception was 
discontinued. 


Scoring 

REM. Voluntary eye movements in the waking 
state with eyes closed are easily detectable; however, 
involuntary eye movements are obscured by artifacts 
caused by muscle movements, brain waves, ocular 
tremors, and blinks. Eye tremors and blinks, for the 
most part, occur vertically and can be detected on 
the vertical electrooculograms (EOG). To reduce the 
effects of these artifacts, only eye movements moni- 
tored on the horizontal EOG were used to detect. the 
occurrence of REMs. 

Eye movements were recorded and measured by 
using procedures described by Dement (1955) and 
by Dement and Kleitman (1957a, 1957b). Conjugate 
horizontal eye movements were monitored by at- 
taching a pair of Beckman biopotential electrodes at 
a point approximately 1 cm. lateral to the external 
canthus of each eye. Vertical eye movements and 
blinks were identifiable by means of supra- and infra- 
orbital biopotential electrodes on the left and right 
eyes, respectively. Bipolar potentials were recorded 
for both horizontal and vertical electrode pairs using 
an electrode attached to the mastoid area of the left 
ear as a ground. 

According to Aserinsky and Kleitman (1955), the 
EOG contains slow (SEM) and rapid (REM) move- 
ments which are distinguished mainly in terms of 
duration and latency. Only REMs were considered 
in the present study, and difficulties in counting very 
small base-line oscillations and borderline SEMs were 
avoided by adopting the following criteria: (a) hori- 
zontal EOG deflection was equal to or greater in 
amplitude than the corresponding deflection on the 
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vertical EOG; (b) amplitude of the horizontal EOG 
was equal to or greater than 4 mm. (10 uv.); and 
(c) duration from base to peak of .5 sec. or less. 
Waking REMs of 75 uv. (30 mm.) (Caldwell & 
Domino, 1967a) or greater in amplitude and occur- 
ring in clusters were considered to be comparable to 
REMs occurring during nocturnal sleep. 

EEG. The EEG was recorded bipolarly from the 
occipital area. Two silver cup electrodes filled with 
Bentonite electrode paste were attached to the skin 
by applying USP collodion. The positioning of the 
occipital EEG was done according to the Interna- 
tional 10-20 Electrode System (Jasper, 1958). The 
S's earlobes wired in parallel served as a common 
reference electrode. 

Due to the limitations involved in scoring attenu- 
ated alpha rhythm activity by visual inspection, 
occipital alpha was defined as the presence of not 
less than 3 consecutive waves of 8-13/sec frequency 
and 2 mm. (5 uv.) or greater in amplitude. Percent- 
age of time occipital alpha was defined as the 
duration of the total observed occipital alpha activ- 
ity in seconds multiplied by 100 and divided by the 
length in seconds of the total duration of the episodes 
or period examined. 

Heart rate. For interest, finger pulse was included 
io obtain an index of heart rate. The finger-pulse 
sensoring device, provided by Ax and described in 
Ax, Beckett, Cohen, Frohman, Tourney, and Gottlieb 
(1962), was applied with elastic tape to the fore- 
finger of S's nonpreferred hand. 

The EEG, EOG, and finger pulse were monitored 
on the Grass No. 5 polygraph. Electrode resistances 
were checked prior to running S, and any electrode 
giving a reading above 10 K ohms was reapplied. 
The standard EEG constant speed of 10 mm/sec was 
used and the gain set so that a 20-mm. deflection 
was equivalent to 50 uv. 


Procedure 


Training session. Each S was given a training 
session to provide him with experience in free 
imagery and to accustom him to the laboratory. He 
also was asked to abstain from all sedatives, stimu- 
lants, and alcoholic beverages at least 24 hr. prior 
to his experimenta] appointment which was scheduled 
for the following week. 

Experimental session. The S was seated in a 
reclining chair in a dimly illuminated, soundproof 
room adjoining the equipment room where E was 
situated. The S was told that the purpose of the 
investigation was to determine the relationship be- 
tween visual imagery, brain waves, and the electrical 
conductivity of the skin. No mention was made of 
eye movements. The E then attached the electrodes 
to S and asked him to close his eyes and keep as 
still as possible. 

Free imagery condition, After an additional 5-min. 
period of adaptation to the laboratory situation, a 
trial run of 1-min. eyes-closed resting period (Base 
Line 1) was recorded to establish a base-line measure 
for later comparisons with visual imagery. The free 
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imagery condition began with a request for S to 
describe pictures that came into his mind's eye. He 
was asked to signal E (by pressing a switch with 
his preferred hand) the onset and termination of 
each epísode of images. The switch, powered by a 
3-v. dc battery, activated a signal light in the record- 
ing room. To avoid contaminating the imagery, S 
was asked to describe in detail what he saw after 
each imagery episode. The descriptions were recorded 
on a Uher 400 remote-controlled tape recorder with 
a built-in two-way intercom system. The free 
imagery session lasted for 45 min. 

Owing to the long duration of the free imagery 
condition, there was some doubt concerning the 
meaningfulness of Base Line 1 throughout the entire 
period. Such processes as adaptation, fatigue, drowsi- 
ness, or increasing apprehension about the nature of 
the imagery could substantially change the percentage 
of alpha. Accordingly, the alpha percentage for each 
visual episode during the free imagery condition was 
compared with the alpha percentage in an equal 
period of no imagery just prior to the activation 
of the signal indicating the onset of visual imagery 
(Alternate Base Line 1). 

Dream recall condition. On completion of the free 
imagery phase of the experiment, S was allowed to 
relax for 5 min. A second 1-min., eyes-closed resting 
period was monitored to establish a base line 
(Base Line 2) for this condition. After the trial run, 
S was instructed to close his eyes and visualize scenes 
from a single previous “vivid” nocturnal dream. As 
in the free imagery phase of the experiment, S was 
asked to signal the onset and termination of each 
visual imagery episode, and he was asked to describe 
what he saw after the entire dream was completed. 
The polygraph record of this phase lasted until the 
completion of S's “dream.” 


RESULTS 
Visual Imagery 


The total number of visual imagery epi- 
sodes during the 45-min. free imagery condi- 
tion of the experiment ranged 28-116, with 
an overall mean of 48.1, SD = 27.6, and a 
mean rate of 1.1 visual episodes per minute. 
The mean duration of each visual episode 
was 11.1 sec. In comparison, the mean dura- 
tion of the dream recall condition was 1.4 
min. and the total number of visual imagery 
episodes ranged 1-6, with an overall mean of 
3.6, SD = 1.7, and a mean of 2.6 visual epi- 
sodes per minute. The mean duration of each 
visual episode was 30.4 sec. The only question 
of significance in these data was the rate of 
visual episodes, and a ¢ test showed that the 
two experimental conditions differed signifi- 
cantly (.05 level) in this regard. It also was 
obvious that the imagery of the dream recall 
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condition was more kinetic; that is, there 
was more movement in comparison to the free 
imagery condition wherein the images tended 
to be static. It must be emphasized, however, 
that there were some instances of highly 
kinetic imagery during the free imagery 
condition which is congruent with clinical 
experience with free imagery. 

All of the Ss frequently showed a transient 
desynchronization of the alpha rhythm just 
prior (1 sec.) to signaling the onset of visual 
imagery. This transient initial desynchroniza- 
tion of alpha ranged between 46% and 100% 
with a mean of 60.1% for the free imagery 
condition, and between 1.7% and 100% 
with a mean of 58.2% for the dream recall 
condition. 

As expected, most of the Ss did not ex- 
perience primary process regulated imagery 
and/or the uncovering that often occurs with 
the clinical use of free imagery. For the most 
part, their imagery could be most aptly char- 
acterized as the visual reproduction of recent 
experience, everyday objects, and routine 
events of little obvious psychodynamic im- 
port, although it is known from clinical ex- 
perience that such images often are really 
remote derivatives of unconscious strivings. 
This is apparent because over a session(s) 
images gradually become less symbolic and 
more representational, with the symbolic sig- 
nificance of previously “nonsymbolic” images 
becoming apparent. Regardless of the fact 
that this was an experiment with only 45 min. 
of free imagery experience, one S showed 
blatant primary process regulation of his 
imagery. In one episode he reported that he 
was peeking through the window of an old 
shop and an old man came in, pointed a finger 
at him and laughed, whereupon to S’s con- 
sternation, the old man’s finger turned into 
a penis. Of no little consequence was a con- 
comitant increase in heart rate and a de- 
synchronization of the alpha rhythm from 
44% to 12%. If this S were a client in psy- 
chotherapy, he would have been asked to re- 
image the scene and the imagery would 
gradually become more depictive of a re- 
Pressed striving. Anxiety and/or psychoso- 
matic symptoms would have been experienced 
simultaneously with an increase in resistance 
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TABLE 1 


Means or REMs AND OCCIPITAL ALPHA FOR 
EXPERIMENTAL AND BAsE-LINE CONDITIONS 


Episode X | sp t 
REM (mean REM/min) 

Base Line 1 1.0 9 
Free imagery 29| 3.1 |2.08* 
Base Line 2 16| 31 
Recalled dream $3| 54 |1.99* 
Free imagery 29| 34 
Recalled dream $3| 54| 33 


Occipital alpha (mean % alpha) 


BaseLine 1 33.0 | 23.2 

Free imagery 33.7 | 23.5 | .11 
Base Line 2 45.0 | 25.5 
Recalled dream 32.9 | 24.6 | 4.10** 
Free imagery 33.7 | 23.5 
Recalled dream 32.9 | 246 | .28 


Note.—N = 10. 
* > <S .05, one-tailed. 
** p S .005, two-tailed. 


against the procedure. If the authors are cor- 
rect, the old man eventually would have 
turned into S's father and at that moment he 
would have realized his sexual interest in his 
father's genitals. 


REM 


Table 1 shows the mean REMs/min under 
both experimental conditions, the results of 
which were analyzed separately by comparing 
each with its own base line. Both conditions 
were associated with significantly (.05 level) 
more REMS than their respective base lines; 
however, they did not differ from each other. 
Since the percentage of alpha of Base Line 1 
and its alternate were almost identical (33.0 
and 324), only the results of the former 
are reported. 

REMs characteristic of nocturnal sleep 
were absent in both experimental conditions. 
The obtained REMs were less than the 75 
nv. criterion, were more gradual in slope, and 
did not occur in clusters. The REMs accom- 
panying nocturnal dreams seem to have a 
component uniquely associated with the neural 
determinants of sleep, perhaps like those 
specified by Jouvet (1962). The recalled 
dreams seemed more like hypnogogic imagery 
than nocturnal dreams, according to the de- 
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TABLE 2 


PERCENTAGE INCREASE AND DECREASE IN HEART 
RATE FOR EXPERIMENTAL CONDITIONS 


Episode x SD t 
Within-experimental conditions 
Free imagery 
Decrease 192 | 12.0 
Increase 24.7 | 11.2 | 1.0 
Recalled dream 
Decrease 11.6 | 17.6 
Increase 39.3 | 29.0 | 2.6* 
Between-experimental conditions 
Decrease 
Free imagery 192 | 120 
Recalled dream 11.6 | 17.6 | 1.3 
Increase 
Free imagery 247 | 112 
Recalled dream 39.3 | 29.0 | 1.6 


Note,—N = 10, 
*p $.05. 


scriptions of Foulkes and Vogel (1965) and 
Roffwarg, Dement, Muzio, and Fisher (1962). 
The reimaging of a nocturnal dream had a 
mean duration of 2 min. in contrast to means 
of 9 and 11.2 min. reported by Dement and 
Kleitman (1957a) and Caldwell and Domino 
(1967b), respectively. There was also an 
absence of visual continuity of images during 
the recalled dream. 


EEG 


The results of Table 1 indicate that alpha 
activity during free imagery did not decrease; 
however, there was alpha desynchronization 
during the visual imagery episodes in the 
dream recall condition relative to Base Line 2 
(.005 level). This result is due to a gross 
difference between the two base lines, not 
because there was a difference between the 
experimental conditions. 


Heart Rate 


"The onset of each visual episode was exam- 
ined for changes in heart rate. Table 2 shows 
that there was a significant increase in heart 
rate for the dream recall condition but not 
for the free imagery condition. Although the 
remaining comparisons were uniformly non- 
significant, they were in the right direction 
and further suggest that the dream recall 
condition had more central nervous system 
(CNS) activating properties or, in psycho- 
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dynamic terms, more  anxiety-producing 
properties. 
DISCUSSION 


The desynchronization of the alpha rhythm 
during the dream recall condition, relative to 
its base line, was surprising in view of the 
lack of alpha desynchronization for the free 
imagery condition, relative to its base line. 
Inspection of Table 1 reveals that the relative 
desynchronization of the dream recall condi- 
tion is solely due to the increase of alpha 
for Base Line 2. Since the percentage of alpha 
for the two experimental conditions was the 
same, it might be argued that there is an 
upper limit of alpha activity during visual 
imagery. This hypothesis is vitiated, however, 
by the wide range of percentage of alpha (2.8 
to 74.6) in the free imagery condition, which 
was comparable to that of Base Line 1 and 
its alternate. Another hypothesis is that the 
percentage of alpha in the free imagery condi- 
tion was too low to be desynchronized by 
visual imagery; however, the obtained per- 
centage of alpha in this condition is con- 
sistent with the percentage of alpha in an 
eyes-closed resting period often cited in the 
general EEG literature. Moreover, according 
to Ostow (1950), the direction of change is 
unrelated to the base-line percentage of alpha. 
There should have been some desynchroniza- 
tion regardless, but there was not. 

The so-called law of initial values (LIV); 
which specifies a positive correlation between 
the initial level of a function and the amount 
of change from that level, is pertinent to Base 
Line 2 and the dream recall condition. For 
these data there should be relatively greater 
desynchronization of alpha for Ss with high 
initial alpha (Base Line 2) resulting in à 
sharply reduced mean percentage of alpha for 
the dream recall condition. An inspection of 
the scores of individual Ss between Base 
Line 2 and the dream recall condition reveals 
that only one S did not show a decrease in 
alpha, and the correlation. (Spearman) be- 
tween initial level and amount of change was 
only .22. The LIV does not appear to be an 
influencing variable. 

There are two viable hypotheses yet to be 
considered. The first would attribute the rela- 
tive desynchronization of the dream recall 
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condition to the differential demands made by 
the two experimental conditions. In the free 
imagery condition, S is not asked to do any- 
thing other than press a button at the onset 
of a visual image, whereas in the dream recall 
condition he is asked to do something specific. 
He must make the effort to recall a dream. 
It is this effort which creates a pattern of 
CNS activation of which reduced alpha is one 
aspect. The second hypothesis takes into ac- 
count the activating properties of primary 
process thinking which was much greater for 
the dream recall condition. It was the anxiety 
implicit in primary process thinking that 
caused CNS activation and alpha desynchro- 
nization. Related investigations in the authors' 
laboratory using the GSR (Burns, 1967; 
Reyher & Smeltzer, 1968) show that primary 
process regulated visual imagery is related to 
CNS activation of the GSR. Other studies, 
yet incomplete, are revealing the same rela- 
tionship, and further research is being planned 
to separate and evaluate these two variables. 

Taken together, these hypotheses (task and 
primary process sources of CNS activation) 
might account for the contradictory findings 
in the research literature concerning the rela- 
tionship between alpha and visual imagery. 
Unlike the present investigation, previous in- 
vestigations did not monitor all S's visual 
imagery; either S was asked to describe his 
mental state because of some change in the 
EEG and/or in REMs, or he was asked to 
visualize something in particular. In the first 
case, only visual imagery which had activating 
properties would have been detected, and in 
the latter case, it is the request per se to 
form a specific visual image which itself is 
activating, not the image. Because of these 
biasing circumstances, it is understandable 
why the research literature is confusing. 

The data also have implications with re- 
Spect to the hypothesis that REMs are related 
to S’s scanning of his images. If this were 
true, there should have been more REMs 
during visual imagery in each experimental 
condition relative to its base line, which was 
the case, Also, there should have been an 
increase in REMs for the experimental condi- 
tion associated with more of any one of the 
following characteristics: (a) higher rate of 
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visual episodes, () longer duration of visual 
episodes, and (c) kinetic imagery. Since the 
dream recall condition had a greater associa- 
tion with all three of these characteristics in 
comparison with the free imagery condition, 
it should also be associated with more REMs, 
which was not the case. These results are more 
consistent with the hypothesis advanced by 
Amadeo and Shagass (1963); that is, REMs 
represent a nonspecific neural mechanism 
which is a concomitant of attentive activity. 
Accordingly, Ss in the present research paid 
equal attention to the two experimental con- 
ditions, even though they differ in terms of 
CNS activation as suggested by the data on 
heart rate. Is there perhaps an upper limit 
to attention? Attention is a state of awareness 
in which the individual is tuned to respond 
adaptively to sensory input. Attention can be 
reduced below this level by fatigue, drugs, or 
loss of interest in the sensory stimuli, but it 
cannot be raised above this level. 

The data of the S who saw the old man in 
the window are pertinent to the hypothesis 
advanced by Antrobus and Singer (1965) 
which attributes an increase in REMs to the 
suppression of fantasies. He saw a snake 5.5 
sec. after seeing the old man's finger turn into 
a penis, indicating that anxiety associated with 
the old man and his penis finger was excessive 
and that repressive defenses were stimulated 
to produce a less blatant derivative. This 
heightened level of defense is maintained over 
the following images: flag pole with soldiers 
marching, a huge missile going by, and a 
girl's face with no features; but no increase 
in REMs was noted. Until more data like 
this are collected, which is common in emer- 
gent uncovering, it is concluded very tenta- 
tively that REMs are not associated with an 
increase in defense. Antrobus and Singer 
asked their Ss to suppress, and it may be the 
effort involved in this task that is associated 
with REMs, not the presence or absence of 
visual imagery. 

Free imagery has not only proved itself to 
be a powerful uncovering technique in psycho- 
therapy, but it is showing promise as a re- 
search tool as well. Its value as a research 
tool inheres in the response-producing psycho- 
dynamic processes that are set in motion and 
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rendered visible in the form of derivatives and 
symbols of repressed drives and conflicts which 
gradually become less symbolic and more 
depictive. The objectivity of the data can be 
preserved if interpretations are not given to 
the client. 
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The Ss were 20 adult females, hospitalized for psychiatric disorders, with 7 
displaying initial negative verbal habit (ie. using bad-impression verbs with 
self-referent pronouns) and 13 displaying initial positive verbal habit (ie. 
using good-impression verbs with self-referent pronouns). The Ss were matched 
for diagnostic category, age, education, and number of hospitalizations, and 
divided into two groups. Experimental Ss reinforced for displaying a verbal 
habit opposite to their initial habit significantly increased their tendency to 
display the reinforced habit. When the initial verbal habit was then reinforced, 
Ss significantly increased their tendency to display this habit. Control Ss never 
received reinforcement, and showed no change in initial verbal habit. 


Some recent research has focused on the 


influence of previously learned verbal habits 


on verbal conditioning. Dixon (1965) studied 


the role of the impression value of verbs in 
"the conditioning of the pronouns “I” and 


“we.” He employed a version of Taffel’s 
(1955) sentence-construction task, which re- 
quires S to choose one of six pronouns as the 
first word of a sentence whose second word is 
a past-tense verb. Dixon found that even in 
the operant period the pronouns I and we 
were used more frequently with good-impres- 
sion verbs than with bad-impression verbs, 
indicating that such verbal habits are ac- 
quired prior to experimental reinforcement. 
Moreover, reinforcement of I and we in- 
creased not only the frequency of these pro- 
Nouns, but also the use of good-impression 
Verbs with these pronouns. Clance and Dixon 
(1965) found that the influence of verbs on 
Pronoun choice extends to other-referent, as 
well as self-referent, pronouns. Both studies 
indicated that prior associations with verbs 
are greater determinants of pronoun choice 
than is currently administered reinforcement. 

The present study concerns the extent to 
Which such previously acquired verbal habits 
can be modified. The distinction should be 
Made between positive verbal habit (PVH) 
and negative verbal habit (NVH). The former 
may be defined as the tendency to associate 
Bood-impression verbs with self-referent pro- 
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nouns; the latter, as the tendency to associate 
bad-impression verbs with such pronouns. 
Since it is possible that PVH and NVH are 
differentially affected by reinforcement opera- 
tions, it was felt desirable to study both hab- 
its. Normal Ss, however, habitually display 
only PVH (Clance & Dixon, 1965; Dixon, 
1965). The Ss for the current study were 
selected from a population of psychiatric pa- 
tients since, unlike the normal populations 
previously studied, it included individuals who 
exhibited NVH, as well as those who showed 
PVH (unpublished pilot study by the senior 
author). 

The hypothesis tested was that Ss rein- 
forced for displaying a verbal habit (either 
positive or negative) opposite to their initial 
verbal habit (either positive or negative) in- 
crease their tendency to display the rein- 
forced habit. Further, when the reinforce- 
ment contingency is shifted so that the origi- 
nal verbal habit is reinforced, Ss should in- 
crease their tendency to display the original 
habit. 


METHOD 


Subjects. Twenty females hospitalized for psychi- 
atric disorders were matched on diagnostic category, 
age, education, and number of hospitalizations, and 
divided into experimental and control groups. The 
mean chronological age of both the experimental 
and control groups was 40.2 yr. The mean num- 
ber of years of formal education for the experi- 
mental and control groups was 10.7 and 10.4, re- 
spectively. The mean number of previous hospitaliza- 
tions for the experimental group was 5.1, and for 
the controls, 5.5. On the operant trial of the experi- 
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mental task, five experimental Ss displayed PVH and 
five displayed NVH; for the controls, these figures 
were eight and two, respectively. 

Task. In a modified version of the Taffel (1955) 
task, the pronoun I was presented with a selection 
of 30 past-tense verbs that had been scaled previ- 
ously for impression value (Dixon & Dixon, 1964) 
and employed by Dixon (1965). Ten verbs connoted 
a good impression; 10, a neutral impression; and 10, 
a bad impression. Each of the 150 index cards con- 
tained the pronoun I and 2 verbs centered beneath it. 
Verbs in each of the three impression-value classes 
appeared two-thirds of the time, and were paired 
equally often with verbs from the other two cate- 
gories, The 30 verbs were so paired that the same 2 
verbs did not appear together more than once. 

Procedure. After being told that she was to par- 
ticipate in a personality study, S was instructed to 
construct a sentence for each card, using I as the 
first word and one of the two verbs as the second. 
Each set of 15 cards constituted a trial, with the 
first trial establishing the base level of response. 
During Trial 1, E made no response to S. During 
Trials 2 through 6 with the experimental group, Æ 
reinforced those responses opposite to the base level 
response tendency; that is, those Ss displaying PVH 
during the operant trial were reinforced for display- 
ing NVH, and vice versa. During Trials 7 through 
10, E reversed the reinforcement contingency and 
supported the original verbal habit of the experi- 
mental Ss. The E reinforced desired responses by 
smiling, nodding his head, and uttering one of the 
following statements: *You are doing a good job"; 
"That's fine"; “That was a good sentence"; “Very 
good." Control Ss received no reinforcement during 
the 10 trials, 

Following the experiment, each S answered an 
awareness questionnaire to determine if she could 


state the reinforcement contingencies (see Dixon, 
1965). 


RESULTS 


The principal results are shown in Figure 1, 
which presents the mean numbers of initial 
and opposite verbal habit responses as a func- 
tion of trials for experimental and control Ss. 

McNemar’s (1955) test for significance of 
change was used to evaluate the data. In line 
with the hypothesis, the experimental Ss sig- 
nificantly changed from their initial verbal 
habit to the opposite habit when the latter 
was reinforced (y? = 8.10, p < .01), and then 
showed a statistically significant return to 
their initial verbal habit when the reinforce- 
ment contingency was reversed (x? = 7.01, ? 
<.01). 

The data for the experimental group were 

analyzed separately for PVH and NVH Ss by 
means of the binomial test (Siegel, 1956). All 
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Ss in both subgroups displayed the predicted 
change to the opposite verbal habit when it 
was reinforced (p < .03, in both cases). Dur- 
ing the reversal phase, all the original PVH 


Ss returned to their initial verbal habit (p< 4 


.03), and all but one of the original NVH Ss 
did so. This latter finding is statistically non- 
significant, given the smallness of the sample 
size (n= 5). 

Control Ss showed no significant change in 
verbal habit throughout the 10 trials. 
. None of the Ss indicated knowledge of the 
reinforcement contingencies on the awareness 
questionnaire, 


DISCUSSION 


The results indicate that in a highly struc- 
tured situation, the habitual pattern of 
associating a specific class of verbal impres- 
sion values with self-referent pronouns may be 
changed and even reversed through the appli- 
cation of reinforcement. This finding holds 
true irrespective of the nature of the original 
verbal habit (i.e., positive or negative). These 
results are relevant to Ellis’ (1958) hypothe- 
sis that changes in verbal behavior are essen- 
tial in producing changes in emotional reac- 
tions. He contends that one may substantially 
change one's feelings by changing the internal- 
ized verbalized behavior by which one initially 
created the feelings. Tf, as Ellis believes, emo- 
tional disturbance essentially arises when indi- 
viduals repeat to themselves negative, self- 
defeating statements, then the substitution of 
more positive self-referent statements should 
be correlated with a cessation of the disturb- 
ance. 

The efficacy of reinforcement in modifying 
verbal behavior of psychiatric patients in this 
study stands in contrast to the findings of 
Dixon (1965) and Clance and Dixon (1965), 
in which prior verbal habits appeared to be 
greater determinants of verbal response than 
were current reinforcement contingencies. The 
apparent contradiction may be accounted for 
in terms of two variables: (a) the nature of 
the reinforcers and (b) S characteristics. 
three studies used social reinforcement. How- 
ever, the Dixon studies employed only the 
spoken word “good,” whereas the present 
study used one of a sequence of verbal rein- 


forcers, accompanied by a smile and head nod. 
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Wickes (1956), comparing a sequence of 
verbal reinforcers with a sequence of gestural 
reinforcers, found that both increased the fre- 
quency of M responses on the Rorschach test, 
with the nonverbal reinforcement being some- 
what more effective. It is likely that the com- 
bination of verbal and gestural reinforcers 
used in the present study were more effective 
than the single word good used by Dixon. In 
addition, Ss might be less likely to satiate on 


1 Ss (combined data for PVH 


a variety of reinforcers than on one consist- 
ently presented. 

With regard to differences in S character- 
istics, the Ss in the present study were insti- 
tutionalized, whereas Dixon’s were not. The 
finding that social reinforcement was appar- 
ently more effective with the former Ss is in 
line with reports of Zigler (1963) and others 
that institutionalization enhances the effec- 
tiveness of social reinforcement. 
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SYSTEMATIC DESENSITIZATION VERSUS IMPLOSIVE 


THERAPY * 


CURTIS L. BARRETT? 
Child Psychiatry Research Center 
University of Louisville School of Medicine 


Systematic desensitization (SDT) and implosive therapies (IT) were compared 
for their effectiveness and efficiency in reducing snake phobic behavior in other- 
wise normal adult human beings. SDT and IT Ss differed significantly from 
control Ss in posttreatment avoidance of a snake and in change of reported 
discomfort. SDT and IT, however, did not differ in effectiveness. These results 
held at a 6-mo. follow-up. IT was more efficient in that treatment was com- 


pleted in 45% of the time required for SDT. Results were qualified by the 


finding that SDT had a consistent an 


d continuing effect across Ss and across 


time whereas IT effect was more variable. Occurrence of unusual disturbance 
of Ss during SDT and of IT Ss between sessions was discussed. 


Two current learning-theory-based thera- 
pies, applied to reduction of phobic behaviors, 
call for directly opposed operations. These 
are Wolpe’s (1958) systematic desensitization 
therapy (SDT) and Stampfl’s (1961) implo- 
sive therapy (IT). In Wolpe’s well-known 
approach, relaxation is used in reciprocal 
inhibition of small doses of anxiety that may 
be aroused by a hierarchical arrangement of 
stimuli. Since relaxation is assumed to inhibit 
only small amounts of anxiety, a gradual pro- 
gression through the hieriarchy is used. Con- 
siderable care is taken to see that Ss experi- 
ence little or no anxiety as they imagine 
elements of feared stimulus events. 

Stampfl’s approach is the opposite of 
Wolpe's. He assumes that extinction of a 
negative emotional response to objectively 
neutral stimuli will proceed most rapidly when 


1This paper is based on a dissertation submitted 
io the Department of Psychology, University of 
Louisville in partial fulfillment of the requirements 
for the PhD degree. The research was supported in 
part by California Department of Mental Hygiene, 
Patton State Hospital Research Project PSH-141. 
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Coates, Ronald S. Lindner, and Lovick C. Miller, 
who served as therapy session raters, to Jack Fox 
and Walter W. Surwillo for their guidance and valued 
comments, and to Willard A. Mainord, dissertation 
advisor, who also served as the clinical psychologist 
for the study. 

2 Requests for reprints should be sent to the author, 
Child Psychiatry Research Center, 608 South Jackson 
Street, Louisville, Kentucky 40202. 


Ss are exposed to a conditioned stimulus (CS) 
without reinforcement. Use of a gradual ap- 
proach, for Stampfl, invites reinstatement of 
the response to the CS in accordance with the 
conservation of anxiety hypothesis (Solomon, 
Kamin, & Wynne, 1953). Therefore, as 
Stampfl and Lewis stated, IT therapists at- 
tempt to “represent, reinstate, or symbolically 
reproduce the stimuli (cues) to which the 
anxiety response has been conditioned, in the 
absence of primary reinforcement [p. 499]." 
This is accomplished by instructing S to 
imagine scenes suggested by the therapist 
until some sign of reduction in the anxiety 
elicited by the scene occurs. Then scenes that 
produce even more anxiety are suggested. This 
process is continued until the scenes suggested 
by the therapist no longer elicit anxiety. 

A practitioner's choice of either SDT or IT 
on theoretical grounds would be premature 
since both therapies must await future demon- 
stration of the adequacy of their underlying 
theories. Choice on the basis of effectiveness 
also would be difficult since both SDT and IT, 
when studied under well-controlled conditions, 
have been shown to be effective. Lang, 
Lazovik, and Reynolds (1965), for example, 
found SDT to be more effective than control 
or *pseudotherapy" procedures in reducing 
snake phobic behavior. Paul (1966) showed 
SDT to be more effective than insight therapy 
or control-placebo procedures in reducing per- 
formance anxiety. Similarly, IT has been 
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shown to reduce rat phobic behavior more 
than control procedures (Kirchener & Hogan, 
1967) and to be effective in the treatment of 
hospitalized schizophrenics (Hogan, 1966) 
and neurotic outpatients (Levis & Carrera, 
1967), But there has been no direct compari- 
son of SDT and IT such as that conducted 
by Paul (1966) to compare SDT and insight 
therapy. 

The present study was conducted (a) to 
provide a direct comparison of the effective- 
ness of SDT and IT and (5) to compare the 
efficiency of SDT and IT in reducing the same 
disordered behavior. Fear of harmless snakes, 
as it occurs in otherwise normal adult human 
beings is the disordered behavior subjected to 
study. Efficiency of a therapy, for purposes 
of this study, is defined as time required for 
treatment. Thus, of two equally ecective 
therapies, the one that takes less time is 
regarded as more efficient. 


METHOD 


Thirty-six Ss, 8 males and 28 females, were care- 
fully selected according to the following criteria: 
(a) demonstrated fear of a harmless snake, (b) ab- 
sence of severe emotional disorder, (c) verbalized 
reason to overcome fear of harmless snakes, and 
(d) commitment to participate in up to 15 hr. of 
therapy. 


Fear Measures 


Several measures of fear were administered by 
E» who participated in no other aspect of the 
experiment, 

Avoidance test (AT). The a priori criterion mea- 
sure for fear reduction, and therefore, for therapy 
effectiveness, was S’s passing of an AT used by 
Lang et al. (1965). Passing required that S touch or 
hold a 6-ft. black snake. 

Avoidance test change score (ATCS). The AT 
yielded a percentage change statistic (ATCS) (Lang 
et al, 1965) based on S's approach behavior. Scores 
ranged fron: 1 for holding the snake; 2, touching it; 
3, standing next to the cage; 18, Standing 15 ft. away ; 
to 19, refusing to enter the room. The ATCS equaled 
S's pre-AT score minus his post-AT score divided 
by his pre-AT score. 

Fear survey schedule (FSS III). Wolpe and Lang's 
(1964) fear survey schedule was used to assess S's 
reported reaction to a variety of objects and situa- 
tions using a 5-point scale (not at all, 1; to very 
much, 5). The sum of S's responses (FSS III, total 
score) indicated his generalized disturbance. Item 63 
of the FSS III refers to harmless snakes. 

Fear thermometer (FT). Walk's (1956) fear ther- 
mometer permitted S to rate degree of fear felt in 
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the test situation using a thermometer-shaped device 
with 10 divisions. 

S-R inventory of anxiousness (SRIA). The Endler, 
Hunt, and Rosenstein (1962) SRIA permitted S to 
respond to 14 items describing his response to being 
asked to pick up a live, harmless snake. The sum of 
S's responses on a 5-point scale, to 14 items, served 
as an index of anxiety that S experienced when 
confronted with the snake. 


Procedure 


Persons who were afraid of harmless snakes were 
sought in the Louisville area írom graduate and 
undergraduate classes, nursing schools, and from the 
public via newspaper publicity. Those whose fear 
appeared to be genuine received pretreatment testing 
(AT, FSS III, FT, and SRIA) which was conducted 
as follows. E; introduced S to Es who asked S to 
come to a room in which, 15 ft. from the door, 
there was a caged, live 6-ft. black snake. The S was 
asked successively to approach, to touch, and to hold 
the snake. Afterwards, S completed the other fear 
measures and was interviewed by Es in order to 
tape-record S's qualitative statements of the experi- 
ence. The Ss who íailed AT, that is, those who 
neither held nor touched the snake, were next given 
the Minnesota Multiphasic Personality Inventory 
(MMPI). Those whose MMPI profile appeared to 
be normal were referred to an experienced clinical 
psychologist (CP). If CP found no evidence of 
present or past emotional disorder other than snake 
phobia, S was accepted and assigned randomly to one 
of three groups: systematic desensitization therapy, 
implosive therapy, or control group. 

Control group (CG). The Ss assigned to CG were 
told that they would be contacted concerning the 
next step in their “therapy.” Next, CG Ss received 
a letter that arranged for posttreatment testing after 
8 wk. The CG Ss were contacted similarly to arrange 
for a follow-up testing after 6 mo. After follow-up 
testing, Es explained the experimental procedures and 
the function of control groups. Then he offered Ss 
treatment for the phobia at the University of Louis- 
ville Psychology Clinic. 

SDT. The Ss assigned to SDT received 4 training 
sessions and up to 11 SDT sessions. In the first 
training session, Ss heard a tape recording which 
briefly explained SDT theory and which indicated 
that the investigators, as scientists, would be pleased 
no matter what the results of the experiment. The 
remainder of Session 1 and the next 3 sessions were 
used to construct an individual hierarchy for S by 
Lang and Lazovik’s method (1963) and to teach Ss 
to relax. Relaxation instructions were based on Paul 
(1966, pp, 118-120). The SDT proper was begun in 
Session 5 and essentially was conducted as described 
by Paul (1966, pp. 120-122). The criterion for 
termination of SDT in fewer than 11 treatment 
sessions was presentation of the hierarchy twice 
without S signalling disturbance. The SDT Ss re- 
ceived the post- and follow-up testing that CG Ss 
received. 
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TABLE 1 
Summary OF RESULTS AT POSTTREATMENT TESTING 
— 

FSS III 

i AT ATCS SRIA ee tal 
roup & se) Item 63 change | Total score change 
M SD M M SD M SD 

Control 1 A3 35 92 | 1.78 66 4.16 | 19.63 | 2.92 | 5.32 
SDT 11 .92 AT 425 | 247 | 2.33 31.67 | 29.32 | 12.33 | 9.57 
IT 10 .88 .30 433 | 243 | 2.42 28.33 | 16.23 | 15.58 | 10.48 


Note.—SDT = systematic desensitization therapy; IT — implosive therapy; AT = avoidance test; ATCS = avoidance 
test change score; FT = fear thermometer; FSS III = fear survey schedule;and SRIA = S-R inventory of anxiousness. 
a Passing the avoidance test means S either touched or held the snake. 


IT. The Ss assigned to IT received two 50-min. 
clinical interviews followed by up to two sessions 
of IT which were of unspecified duration. In the 
first interview, IT Ss heard a recorded explanation 
of IT that was identical to the SDT rationale tape 
except for the portion describing IT. Interviews I 
and II were also used to explore S's fear of snakes. 
Near the end of the second interview S practiced 
neutral imagery such as walking into a store. Inter- 
views III and IV were conducted as described else- 
where by Stampfl (1961), Stampfl and Levis (1967), 
and Barrett (1967). Essentially, a scene such as con- 
fronting a snake while climbing a tree, which initially 
was thought to arouse maximum anxiety in S, was 
presented until S showed some diminution of anx- 
iety. Then a new scene (e.g, being enveloped in a 
python’s coils) was presented until it too ceased to 
elicit anxiety. Scenes were tailored to the experience 
of each S but scenes depicting helplessness, bodily 
harm, envelopment, and being devoured by huge 
snakes were commonly used. The only departure 
from standard IT procedure was exclusion of psycho- 
dynamic material. The termination criterion for IT 
was the same as that for SDT: presenting S’s most 
frightening scene twice without his indicating dis- 
turbance (“fear,” “anxiety”). The IT Ss received the 
post- and follow-up testing received by SDT and 
CG Ss. 

Therapy ratings. All SDT and IT sessions were 
tape-recorded and then rated by two clinical psy- 
chologists and one experimental psychologist (all 
PhDs, one ABPP) who were trained in SDT and IT 
for the project, The rating procedure required identi- 
fication of randomly selected 5-min. excerpts as SDT 
or IT and indication of degree of adherence to 
SDT or IT operations. A rating of 1 indicated no 
adherence to operations (therapy unacceptable); 7 
indicated that operations were followed exactly. Ac- 
ceptance of an excerpt by two of three raters ad- 
mitted the results of the therapy to the study. To 
offset any bias of the excerpts, raters were provided 
with three randomly selected complete interviews for 
each SDT and IT S (72 recordings). Raters were 
asked to sample these in order to satisfy themselves 
that the excerpts were representative of actual 
therapies, 


RESULTS 


Results of this study necessarily hinge on 
the demonstration that Ss received the therapy 
that was appropriate for their group. Raters 
were able to identify all SDT excerpts as 
SDT and all IT excerpts as IT. Addition- 
ally, the mean rating for SDT sessions 
(X= 6.33, SD=.66) and IT sessions 
(X=6.10, SD —.91) did not differ (f= 
1.15, df = 70, .10 < p < .20). All three raters 
agreed on a rating of 5 or higher in 91.6% of 
SDT cases and in 83.3% of IT cases. Thus, 
there was evidence to indicate that rater 
agreement was quite high, the therapies were 
correctly identified by raters, and the degree 
of adherence to operations did not differ for 
SDT and IT. 

Posttreatment findings are summarized in 
Table 1, Fisher’s exact probability test indi- 
cated that the frequency of passes of AT for 
SDT and IT differed from that for CG 
( < 01), but SDT and IT did not differ. 
Analysis of variance followed by orthogonal 
comparison (Winer, 1962, p. 66) showed 
similar results for ATCS and for simple 
change scores (pre- minus post-) that were 
computed for the remaining measures. That 
is, SDT and IT differed from CG (? < .01) 
but not from each other. 

Table 2 shows the results for 6-mo. 
follow-up. The same procedures of analysis 
that were used for postresults, performed on 
frequency of passes on the AT and change 
on FT; on FSS III, total scores; on FSS III, 
Item 63; and on SRIA, showed that SDT and 
IT differed from CG (p< .05). However, 
SDT and IT did not differ in fear reduction 
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TABLE 2 

Summary or RESULTS AT 6-MoNTH FoLLow-Ur 

FSSIII 
m ATCS FT change m a SRIA DE 

eds Item 63 change ‘otal score change 
Group Sb 

M SD M SD M SD M SD M SD 
Control 3 .22 57 1.79 1.96 25 87 5.16 | 23,59 6.33 | 6.89 
SDT 12 97 .06 3.75 | 3.34 | 2.25 | 1.29 | 33.42 | 27.17 | 14.00 | 8.48 
IT 9 .79 39 5.00 | 199 | 2.83 | 1.27 | 35.42 | 29.66 | 17.58 | 9.61 


Note.—See Table 1 for explanations of abbreviations. 


a Passing the avoidance test means S either touched or held the snake, 


assessed by these measures. Follow-up ATCS 
data were assessed by £ tests since the hy- 
pothesis of homogeneity of variance for these 
data was rejected (Cochran’s C > .61). Fol- 
low-up ATCS for CG differed from that for 
SDT (t= 4.68, df = 22, p<.01) and IT 
(t = 2.85, df = 22, p < .01) but SDT and IT 
did not differ (¢= 1.63, df= 22, .10 < p < 
.20). 

"Table 3 shows mean training and treatment 
time for SDT and IT. Considering treatment 
time only, IT was completed in 41% of the 
time required for SDT. Comparing treatment 
plus training time, IT was completed in 4596 
of the time required for SDT. 

The original experimental design of this 
study, as shown in the preceding paragraphs, 
called for evaluation of mean changes on a 
number of fear measures. However, Bergin 
(1963, 1966) questioned use of only mean 
change to assess outcome of therapy and has 
suggested that differential effects of control 
and treatment procedures may be shown in 
variances changes, where differences in mean 
changes are small. This has been labeled the 
"deterioration effect" to indicate that in 
therapy some patients get better and some 
get worse. By contrast, control procedures in 


TABLE 3 


MEAN TRAINING AND TREATMENT TIME, Hours, 
FOR SUBJECTS IN TREATMENT GROUPS 


SDT anv IT 
Group Training Treatment Total 
SDT 3.3 5.8 D 
IT 17 24 4.1 


Note.—SDT = systematic desensitization therapy; IT 
= implosive therapy. 


studies cited by Bergin resulted in little 
change in means or in variance. Treatments 
also could be compared for effect on variance. 
That is, two treatments could yield similar 
mean changes, yet one treatment could have 
a consistent effect, common to all Ss, that 
would show in a significant variance reduc- 
tion. To assess such effects in this study, 
additional analyses were suggested. 

The criterion measure, AT, yielded a “pass- 
fail” score and could not be assessed for 
variance change. Therefore, AT scores used 
for ATCS were selected for analyses sum- 
marized in Table 4. For CG Ss there was no 
significant change in AT score variance at 
pre-, post- or follow-up testing. For SDT Ss, 
however, AT score variance showed continu- 
ing reduction from pre- to post- (F = 6.11, 
df = 11/11, p< .01) and from post- to 
follow-up testing (F= 13.95, df= 11/11, 
$< .01). The variance change pattern for 
IT differed from that for both CG and 
SDT. There was significant reduction in 
variance from pre- to posttesting (F = 6.43, 
df = 11/11, $ < .01) and significant increase 
in variance from post- to follow-up testing 
(F = 12.30, df = 11/11, p < .01). The pre- 
and post-AT score variance did not differ 
(F = 1.91, df = 11/11, p > .05). 


Discussion 


In this direct comparison of SDT and IT, 
the therapies were found to be equally effec- 
tive in reducing snake phobic behavior of 
otherwise normal adult human beings. Both 
overt avoidance and verbal reports of fear 
or anxiety were reduced by learning-theory- 
derived operations which were directly op- 
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posed. The therapies did differ in their ef- 
ficiency since IT was completed in 45% of 
the time required by SDT. Control proce- 
dures resulted in less reduction in phobic 
behavior. Thus, IT would appear to have been 
the more efficient of two equally effective 
methods. 

Results of additional analyses, based on 
Bergin’s argument, indicate some necessary 
qualification of the present findings. These 
data indicate that the effect of SDT on AT 
was consistent across Ss and that it continued 
until the last point of testing. The effect of 
IT, as assessed by AT, was less consistent. 
Some Ss improved markedly and maintained 
their improvement. Others improved on this 
measure but, at follow-up, showed avoidance 
behavior that was little different from their 
pretreatment behavior. Finding a consistent 
and continuing effect of SDT was consonant 
both with the findings of Paul (1966, 1967) 
who used similar individual SDT, and with 
results of SDT conducted in groups (Paul, 
1968; Paul & Shannon, 1966). However, it is 
difficult to account for differential effects of 
IT across Ss, Controls were provided for dis- 
order, technique, and therapist variables in 
this study. The only departure from standard 
IT procedure was exclusion of scenes specif- 
ically suggested by psychodynamic literature, 
for example, Oedipal scenes. But there is as 
yet only tentative clinical evidence to indicate 
that explicit use of psychodynamic themes en- 
hances IT. Further, IT has been shown to be 
effective where no scenes based on psycho- 
dynamic themes were used (Kirchener & 
Hogan, 1967). 

As has been shown in Tables 1 and 2, the 
major findings of this study concerning effec- 
tiveness and efficiency of SDT and IT are 
consistent with results of previous research 
(especially Kirchener & Hogan, 1967; Lang 
et al., 1965). However, some reactions of Ss 
to the present treatments have not been re- 
ported widely in the behavior therapy litera- 
ture. For example, during SDT, one S showed 
considerable distress (tears, wringing of hands, 
tensing) and reported that “wavy lines” that 
she visualized between scenes had turned into 
moving, frightening snakes. Similarly, an SDT 
S began sobbing during visualization of a 
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TABLE 4 


MEANS AND VARIANCES OF AVOIDANCE TEST Scores 
AT Pre-, Post-, AND FOLLOW-UP TESTING 


CG SDT IT 
Testing 
M | Variance} M | Variance} M | Variance 
Pre 10.08| 18.81 |8.42 17.90 |8.16| 14.33 
Post 9.00| 27.71 |1.75 2.93 |1.91 2.23 
Follow-up | 7.90| 27.54 |1.25 .231 |3.16| 27.42 


b Note.—CG = control group; SDT = systematic desensitiza- 
tion therapy; IT = implosive therapy. 


scene in which she previously was comforta- 
ble. She reported that the scene, which in- 
cluded a snake in a closed cage, had developed 
suddenly and unexpectedly into a frightening 
one in which the snake came toward her. 
These may be examples of the same phe- 
nomenon reported by Weitzman (1967) as 
“flow of visual imagery” and by Weinberg and 
Zaslove (1963) as “resistance.” However, the 
degree of disturbance that Ss in this study 
experienced has not been reported elsewhere. 

Similar reactions to scenes during IT, of 
course, were expected. As required by IT, 
instructions for S to continue imagining fright- 
ening scenes were given when Ss showed dis- 
turbance and, after a time, S ceased to show 
disturbance, However, two Ss in the IT group 
showed a disturbance between IT sessions. 
One S, a college student, reported that after 
therapy she consistently visualized snakes 
when she shut her eyes, that her sleep was 
disturbed, that she could not attend lectures, 
and that she was unwilling to participate in 
the second session of IT. She was persuaded 
to continue in IT, and, at post- and follow-up 
testing, held the snake. A similar report of 
being very upset was given by a nurse who 
said that her work was disrupted frequently 
by her crying when she thought of snakes. 
She too participated in the second IT session 
and was successful at post- and follow-up 
testing. Similar between-treatment effects of 
IT have been reported by Roger Kotila.” For 
example, one of Kotila’s Ss began compulsively 
to check her bed for snakes every night. Re- 
sults such as these underline the criticisms of 
pehavior therapy (Breger & McGaugh, 1965; ' 
Weitzman, 1967) by indicating that the 


3 Roger Kotila, personal communication, May 21, 
1968. 
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operation of instructing S to imagine scenes 
does not lead necessarily to reliable responses. 

Findings of the present study support the 
view that IT is an effective, efficient behavior 
modification technique for use with phobias. 
Further, the results make more explicit evi- 
dence for SDT as an effective technique that 
has a consistent and continuing effect across 
Ss and across time. However, disturbances 
reported by both SDT and IT Ss during 
imagery, again raise the serious question of 
what Ss actually do following instructions to 
“imagine a scene.” 
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SPAN OF APPREHENSION IN ACUTE SCHIZOPHRENICS 


JOHN M. NEALE, CURTIS W. McINTYRE, ROBERT FOX, anp RUE L. CROMWELL 
Vanderbilt University 


The span of apprehension for two schizophrenic groups (good premorbid 
paranoid and poor premorbid nonparanoid) was compared with the span for a 


normal control group 


(hospital aides) by a forced-choice letter-recognition 


method analogous to signal-detection methodology. This procedure, in conjunc- 
tion with a quantitative model of the span, provides an indirect but relevant 
measure insensitive to variance imposed by nonperceptual systems. The span size 
was the same for all three groups when visual “noise” was absent. In the 
presence of noise, span size for both schizophrenic groups was about 50% smaller 
than that of the control group. It is argued that the reduction in span represents 
a true deficit in attention, uncontaminated by extraneous variables. 


A considerable portion of the literature on 
schizophrenia (Bleuler, 1951; Silverman, 
1964; Venables, 1964) suggests that a central 
characteristic of the schizophrenic process in- 
volves a deficit in attention. Although the hy- 
pothesis is certainly reasonable, it is difficult 
to evaluate empirically (e.g., Neale & Crom- 
well, 1969). One problem is posed by vaga- 
ries in the definition of attention. The difficul- 
ties are further compounded by the special 
psychophysical measurement problems posed 
by schizophrenics themselves. Because of their 
labile motivational systems and often retarded 
response systems, it is essential that care be 
taken so that abberations in these systems do 
not interfere with the measurement of atten- 
tion. 

In this article an experiment is described 
that attempts to circumvent these difficulties 
by employing an explicit model of one aspect 
of attention and by employing a measurement 
technique that is relatively insensitive to per- 
turbations imposed by extraneous systems. 
Specifically, the span of apprehension in schiz- 
ophrenics was measured. The span of appre- 
hension is a measure of the amount of infor- 
mation a person can simultaneously attend. 
Span was measured by a method derived from 
signal-detection procedures. The span of schiz- 
ophrenics was found to be less than that of 
control Ss, and we argue that this represents 
a demonstration of a deficit in attention that 


1 Requests for reprints should be sent to Curtis W. 
McIntyre, Department of Psychology, Vanderbilt 
University, Nashville, Tennessee 37203. 


is uncontaminated by extraneous nonpercep- 
tual variables. 

The span of apprehension is a measure of 
considerable relevance to any theory of at- 
tention, In the earliest experiments on the 
span, stimuli such as letters of the alphabet 
were exposed so briefly that eye movements 
could not occur (i.e., tachistoscopic presenta- 
tion). The Ss were requested merely to report 
the letters they had seen. Typically, Ss cor- 
rectly reported four or five letters even though 
many more were presented initially. But it has 
now been demonstrated that the limitation to 
four or five letters is one imposed by memory 
rather than by the perceptual capacity of the 
Ss (Sperling, 1960). Sperling showed that the 
span of apprehension is a positive function of 
the amount of information in the display by 
using a reporting method that removed much 
of the burden from S's immediate memory. 

Current theoretical conceptions of the span 
of apprehension involve the idea that all of 
the information in the display is available for 
processing for several hundred milliseconds 
aíter the display terminates. A hypothetical 
central scanning device is assumed to examine 
the elements in the display serially. If the 
scanner attends to an element, that element 

on to a subsequent information-proc- 
essing stage, but if the excitation from an 
element dissipates before the scanning device 
can attend to it, the information associated 
with the excitation is lost. 

Quite recently Estes (1965) has developed 
an explicit quantitative model of the span of 
apprehension that specifies an indirect method 
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TABLE 1 
DESCRIPTION OF SUBJECT GROUPS 
Group Age Education 

Normals 

M 29.83 11.23 

SD 8.64 2.79 
Good paranoids 

M 33.24 8.93 

SD 5.95 1.65 
Poor nonparanoids 

M 29.07 9.00 

SD 8.90 2.06 


for estimating the span. Although the model 
is not complex, there is no need to review 
its formal assumptions here. It will be help- 
ful, however, to state verbally the general 
rationale of the measurement procedure, From 
the standpoint of the S the basic task is 
simply a forced-choice letter-recognition task, 
where on any tachistoscopic exposure either 
one of two letters will appear. The S knows 
what these letters are, say for example the 
Letter T and the Letter F, and he also 
knows the a priori probability of their oc- 
currence, The letters can be thought of as 
analogous to the signals one would be re- 
quired to detect in a standard signal-detection 
experiment (see Green & Swets, 1966). If the 
signal or target letters are placed in an array 
of other irrelevant letters, then it is intui- 
tively clear that these irrelevant letters can 
be considered analogous to noise in the signal- 
detection sense. That is, if an S is going to 
make above-chance recognitions of the signal 
letters, his scanning process must discrimi- 
nate the signal letter from the irrelevant noise 
letters. The number of letters that the scan- 
ner examines, of course, is an operational 
statement of the span of apprehension. 

Estes has shown how the number of letters 
scanned can be estimated from the percentage 
of correct recognitions and from the number 
of letters physically present in a display. He 
has demonstrated that as the number of let- 
ters in a display increases the probability of 
correct recognition predictably declines, but 
the number of elements scanned or processed, 
that is, the span of apprehension, increases 
with display size. 

It should be emphasized that the Ss task 
remains the same, independent of the num- 
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ber of noise letters in a display. He merely 
gives a forced-choice recognition response; 
for example, it was a T or it was an F. Sub- 
jectively, the S's task seems similar regard- 
less of the number of noise elements presented 
in the display. Note that the scanning proc- 
ess presumed to operate does not involve eye 
movements; rather, it is a central process. 

Tn the present experiment the span of schiz- 
ophrenics and of control Ss under two infor- 
mation conditions was measured. In one con- 
dition no noise letters were presented. In the 
second condition, the signal letter was sur- 
rounded by seven noise letters. Explicit a 
priori predictions were not made. While con- 
Structs such as narrowed attention (Venables, 
1964) and extensive scanning (Silverman, 
1964) have been applied to the performance 
of schizophrenics on various tasks, the unique 
operations associated with these constructs 
suggests that it would not be useful to relate 
them directly to this experiment. 


METHOD 
Subjects 


Forty-five males were selected to form three 
groups: 15 good premorbid paranoid schizophrenics; 
15 poor premorbid nonparanoid schizophrenics; and 
15 hospital aides. The groups were equated for age 
and level of education, See Table 1 for group means 
and standard deviations. Differences between these 
means were not significant. All patient Ss were 
hospitalized for less than 3 yr, and most were on 
phenothiazine medication. Premorbid adjustment 
status conformed to the criteria suggested by Held 
and Cromwell (1967), while paranoid classification 
was based upon both clinical history and psychiatric 


Apparatus 


The stimulus displays, consisting of arrays of 
letters, were designed and constructed in the follow- 
ing way. Each array was made by typing letters 
on a white index card, which in turn was mounted on 
a cardboard insert that facilitated rapid insertion and 
removal from a tachistoscope. To allocate letters in 
gach array, an imaginary matrix of 16 letter spaces 
was located at the center of each card. Each matrix 
(4 letters wide and 4 letters high) subtended 2° X 3 
of visual angle. Each array contained one of two 
target letters (T or F); cach letter appeared once on 
two separate cards in each of the 16 possible matrix 
Positions, yielding a total of 32 T arrays and 32 F 
arrays. One set of 32 arrays, composed of 16 T cards 
and 16 F cards, contained only the target letter 
(Matrix Size 1). A second set of 32 arrays, 16 T 
cards and 16 F cards, contained the target letter plus 
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7 additional noise letters (Matrix Size 8). In this 
second set of stimulus displays, a target letter was 
first located at 1 of the 16 possible positions and 
then 7 different consonant letters were randomly 
selected without replacement and randomly allo- 
cated to 7 of the 15 remaining positions in the 
matrix. Each letter in the matrix subtended approxi- 
mately 12’ X 18’ of visual angle, while letters occur- 
ring in adjacent cells were separated vertically by 30’ 
of visual angle and laterally by 24’ of visual angle. 

The stimulus arrays were exposed in a two-channel 
tachistoscope (Scientific Prototype, Model 800-E). 
One field, the constantly illuminated fixation field, 
contained four black lines 12’ in width arranged 
to form a 24° X 34° cornerless rectangle. The center 
of the rectangle defined the fixation area. The fixa- 
tion field luminance was 9.6 ftl. The stimulus displays 
were exposed for 90 msec. in the other field of the 
tachistoscope. The exposure field luminance was 
78 ftl. 


Procedure 


Each S was given a standard set of verbal instruc- 
tions followed by 20 practice trials. Practice trials 
were given to habituate S to the task and to insure 
that S understood the instructions. The Ss failing to 
perform above chance on the practice trials were to 
be dropped from the study; only one S, from the 
good premorbid paranoid group, was eliminated for 
this reason. During the practice trials, S was pre- 
sented with two randomly selected blocks of 10 
trials, each block drawn from a separate matrix size 
with the restriction of an equal occurrence of the 
Critical elements (5 Ts, 3 Fs). After the practice 
trials each S was allowed to rest for 2 min. before 
the experimental trial blocks began. Each S then 
viewed two sets (Matrix Sizes 1 and 8) of four 10- 
trial blocks, each separated by a 2-min. rest period. 
Order of presentation of the matrix sizes was con- 
trolled by an ABBA, BAAB counterbalancing 
scheme within Ss. Each S received a different randomi- 
zation of stimulus materials. Intertrial interval varied 
with the individual S's response rate, but an average 
of approximately one trial every 15 sec. was main- 
tained. A 30-sec. rest was given after each block of 
10 trials, 


REsuLTs AND DISCUSSION 


The basic data were the mean number of 
correct detections by each of the three groups 
ior the two matrix sizes. These data are 
plotted in Figure 1. An analysis of variance 
revealed significant effects for groups (F= 
328, df= 2/42, p<.05); for matrix size 
(F = 598.04, df=1/42, p<.001), and for 
Groups x Matrix Size (F = 5.56, df = 2/42, 
? « 01). To determine the source of the in- 
teraction one-way analyses were performed. 
For Matrix Size 1 the groups did not differ 
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o——o Normals 
&—— Good Paranoids 
o——o Poor Nonporanoids 


P(C) 


MATRIX SIZE 
Fic. 1. Probability of correct response as a function 
of matrix size for the control and schizophrenic 
groups. 


(F= 2.20, df= 2/42, p> .10); for Matrix 
Size 8 the groups did differ significantly (F = 
58.16, df=2/42, p<.001). A Newman- 
Keuls test revealed that at Matrix Size 8 
both schizophrenic groups performed at a sig- 
nificantly lower level than did the control 
group (p= .01) and, further, the schizophre- 
nic groups did not differ between themselves. 

The mean detection scores for each of the 
three groups at Matrix Size 8 were used to 
compute the estimated span of apprehension 
in accord with the Estes model. By that anal- 
ysis the control Ss processed 4.48 letters while 
the good premorbid paranoid group processed 
2.52 letters and the poor premorbid nonpara- 
noid group processed 2.40 letters. Thus it ap- 
pears that the span of apprehension for 
schizophrenics is approximately one-half that 
of the control Ss. 

It is unlikely that the poor performance of 
the schizophrenics was a function of their 
age, education, or level of drug dosage,” for 
none of the correlations between these varia- 
bles and detection scores was significant (r — 
—.06, —.02, +.05, respectively). 

Further, it is unlikely that the poor per- 
formance of the schizophrenics at Matrix Size 
8 was due to a general failure to comprehend 
the task or a similar kind of generalized defi- 
cit, since the schizophrenics clearly performed 
at a level comparable to that of the control 
group under the Matrix Size 1 condition. 


2 The authors would like to thank J. D. Griffith 
for his assistance in ranking drug levels. 
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Therefore, we are forced to conclude that 
the relatively poor performance of the schizo- 
phrenic group under the Matrix Size 8 con- 
dition results from an inability to process 
efficiently the information contained in the 
eight-letter display. In brief, the schizo- 
phrenics have a reduced span of apprehension. 
This reduction in span seems to be attribut- 
able to some inefficiency of the central pro- 
cessing mechanism that analyzes and encodes 
the information presented in the briefly ex- 
posed display. This interpretation seems most 
reasonable because many of the potential al- 
ternative explanations based upon the opera- 
tion of extraneous variables have been mini- 
mized by the design and procedures employed 
in this experiment. 

With respect to the nature of the deficiency 
that produces a reduction in the span of ap- 
prehension, several hypotheses can be con- 
sidered. For example, considerable evidence 
suggests that two primary characteristics of 
schizophrenics are reduction in speed of re- 
sponding and greater susceptibility to dis- 
traction. These concepts of “slowness” and 
“distractibility” are induced from experi- 
mental situations quite different from the 
present experiment and may not be applicable 
to tachistoscopic recognition. Yet it is pos- 
sible to entertain the hypothesis that the 
noise letters act as potent distractors that im- 
pede the search for the target letter, Simi- 
larly, it is possible that processing of informa- 
tion from a brief display simply proceeds at 
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a slower rate in schizophrenics. Either dis- 
tractibility or slowness would result in a re- 
duced span of apprehension. Decisions be- 
tween these alternatives and others can be 
made through further research using the de- 
sign of the present experiment and the ana- 
lytic techniques of the span of apprehension 
model. 
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EXPERIMENTAL CONTROL OF SEXUAL DEVIATION 
THROUGH MANIPULATION OF THE NOXIOUS 
SCENE IN COVERT SENSITIZATION* 


DAVID H. BARLOW,? HAROLD LEITENBERG, anD W. STEWART AGRAS 8 


University of Vermont 


Pedophilic behavior in one S and homosexual behavior in another S were 
decreased, increased, and once again decreased by introducing, removing, and 
reintroducing the noxious scene in covert sensitization, a form of aversion 
therapy. The results indicated that verbal description of a nauseous scene was 
an effective aversive stimulus, and that pairing this scene with scenes of the 
undesired behavior was responsible for declines in deviant sexual behavior 


during covert sensitization. 


The treatment of sexual deviation by aver- 
sion therapy has recently become more popu- 
lar (Feldman, 1966). All studies reported to 
date, however, are either uncontrolled single 
case reports or uncontrolled group outcome 
studies. It is not possible to determine from 
such data whether the reported success is due 
to the conditioning technique or to another 
of the myriad of variables present either 
singly or in combination in the therapeutic 
situation. 

The usual way to isolate relevant treatment 
variables is the group comparison in which a 
control group experiences the same treatment 
as an experimental group with the exception 
of the independent variable. Group differences 
are then ascribed to the independent variable. 
An alternative method used in this study is 
to demonstrate experimental control in indi- 
vidual cases. One way of doing this is to se- 
quentially vary some aspect of treatment 
while measuring associated changes in a well- 
specified, clinically relevant behavior (cf. 
Agras, Leitenberg, & Barlow, 1968; Leiten- 
berg, Agras, Thomson, & Wright, 1968; Wolf, 
Birnbrauer, Williams, & Lawler, 1965). 


1This study was supported in part by United 
States Public Health Service Clinical Research Center 
Grant FR-109 and National Institute of Mental 
Health Grant MH-13651. Part of the data was 
Presented at the meeting of the Eastern Psychological 
Association, Washington, D. C., April 1968. 

? Requests for reprints should be sent to David H. 
Barlow who is now at the Department of Psychiatry, 
University of Mississippi Medical Center, Jackson, 
Mississippi 39216. 

* Now at University of Mississippi Medical Center. 


In this experiment the aversion therapy 
studied was covert sensitization (Cautela, 
1966), In this procedure, descriptions of ex- 
tremely noxious scenes are paired with scenes 
of the undesired behavior. Uncontrolled case 
studies suggest that covert sensitization may 
modify sexual deviation. Cautela (1967) 
treated two homosexuals with this method 
resulting in reports by the patients of de- 
creases in homosexual behavior, while Davi- 
son (1968) treated a case of long-standing 
sadistic fantasies with a combination of covert 
sensitization and positive conditioning to 
heterosexual stimuli and achieved a remission. 
Other case studies include Kolvin’s (1967) 
successful treatment of a fetish. 

The crucial procedure in covert sensitiza- 
tion is thought to be the pairing of verbal 
descriptions of a noxious scene with descrip- 
tions of scenes involving the undesired be- 
havior. This notion was tested in the present 
study in two cases of sexual deviation by first 
pairing the two scenes, then removing the 
noxious scene while holding other variables 
constant, and finally pairing the scenes once 
more. If the pairing procedure is critical to 
therapeutic success, then any improvement 
should be reversed or stopped when the nox- 
ious scene is removed. 


METHOD 


Subjects. The Sı was a 25-yr.-old married male 
who reported a 13-yr. history of pedophilic experi- 
ences ranging from fantasies to several instances of 
sexual contact. He came for treatment after the 
solicitation of a neighbor's 9-yr.-old daughter was 
discovered. The neighbor, rather than report him to 
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the police, insisted that he seek psychiatric help. 
Sexual relations with his wife averaged twice a 
month, 

The S» was a 32-yr.-old married male who reported 
a l4-yr. history of homosexual experiences averaging 
about three contacts per week, usually in public 
toilets. He recently had fallen in love with a “boy- 
friend,” which was threatening his marriage and 
which motivated him to seek treatment. Sexual 
relations with his wife, although prevalent early in 
the marriage, had been virtually nonexistent for the 
previous 3 yr. 

Measures. A hierarchy of sexually arousing scenes 
was constructed. For S; the hierarchy contained 45 
scenes involving small girls. For example, the top item 
in the hierarchy was, “You are alone in a room with 
a very sexy looking 10-year-old girl with long blond 
hair" One of the least arousing items was, “While 
driving in your car you can see a small, thin, six- 
year-old girl walking down the street.” For S; the 
hierarchy contained 27 scenes involving males. The 
top scene in this hierarchy pictured a “30-year-old, 
well-built, good-looking fellow, dressed neatly and 
standing in a large rest room.” At the other end of 
the hierarchy was “an older, sloppily-dressed fellow” 
in the same location. Five scenes involving the boy- 
friend were added to these 27 items, The five scenes 
ranged from sexual contact to talking with him on 
the telephone. Thus, the total number of items 
was 32. 

Base-line procedures were then instituted for five 
sessions, during which three measures were taken: 
First, Ss kept a small notebook in which they re- 
corded each time that they were sexually aroused, 
for Sı by the sight of an immature girl, and for S. 
by the sight of a mature male. The Si, as an out- 
patient, recorded these events daily. Due to the dis- 
tance of his home from the treatment center, S, 
remained in the hospital during the week and re- 
turned home on weekends. He was, therefore, in- 
structed to visit one of the local bars 4 nights a week 
for 1-2 hr, during which time he recorded incidents 
of sexual arousal, A 

Second, the hierarchy scenes were typed on individ- 
ual cards. For Sı the 45 cards were divided into five 
packs of 9 cards each, chosen in such a way that 
each pack contained an equally arousing set of 
scenes. The 32 cards of Sa were not divided. The S, 
then was given one of the packs, while Sa was given 
the whole set. On every experimental day each S 
was asked to enter a separate room where he could 
sort each of the cards into one of five envelopes 
marked 0-4. He was told, 2 


The numbers on the envelopes represent amount of 
sexual arousal, 0 equals no arousal, 1 equals a little 
arousal, 2 a fair amount, 3 much, and 4 very much 
arousal, I would like you to read the description of 
each scene and place a card in the envelope which 
comes closest to how arousing the Scene is to 
you at this moment, 


If he then put one card in the Number 4 envelope 
and one in the Number 3 envelope, his score to that 
point would be 7, 

Third, galvanic skin responses (GSRs) were re- 
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corded from a Grass polygraph (Model 7) with 
silver, silver-chloride, palmar and wrist electrodes to 
six selected scenes from the hierarchy of each S. 
The same six scenes were used throughout the ex- 
periment. The S was relaxed and then asked to close 
his eyes and imagine the scene as clearly as possible. 
GSR deflection was measured as change in log con- 
ductance that was averaged for the six scenes, This 
measure was taken during every session of the base 
line and before every second session during treatment, 

Procedure. During the initial interview Ss were 
told, 


We view this type of problem as a bad habit that 
has been picked up over the years and we are 
going to try to break that habit. We will be using 
some techniques that have been found effective in 
dealing with bad habits such as this one. How- 
ever, no matter how successful, every treatment has 
its ups and downs so don't get discouraged. 


This last sentence was designed to allay any fear of 
ultimate failure that might have arisen when extinc- 
tion procedures were later introduced. The Ss were 
then trained in deep muscle relaxation following the 
method described by Jacobson (1938). The highest 
six items in Sys hierarchy were chosen for sensitiza- 
tion. The Sı was seen twice a week as an outpatient. 
Since Sas major problem at the outset was an “over- 
whelming" attraction for his boyfriend, the five scenes 
describing the boyfriend were chosen for sensitiza- 
tion along with one scene concerning sexual contact 
with “pickups” in public toilets. The Ss was seen 5 
days a week as an inpatient, and from Experimental 
Day 12 on was generally seen twice a day. 

Tn each session Ss were given relaxation instructions 
and presented with eight scenes. In four scenes S was 
described approaching the small girl (male), feeling 
nauseous and vomiting. For example, in one of the 
homosexual scenes, S; was described approaching his 
boyfriend’s apartment. 


As you get closer to the door you notice a queasy 
feeling in the pit of the stomach. You open the 
door and see Bill lying on the bed naked and you 
can sense that puke is filling up your stomach and 
forcing its way up to your throat. You walk over 
to Bill and you can see him clearly, as you reach 
out for him you can taste the puke, bitter and 
Sticky and acidy on your tongue, you start gagging 
and retching and chunks of vomit are coming out 
9f your mouth and nose, dropping onto your shirt 
and all over Bill's skin. 


The description of the nauseous scene was usually 
expanded and lasted from 30 to 60 sec. In the 
remaining four scenes S would be described approach- 
ing the small girl (male) and beginning to feel 
nauseous, At that point he would turn, start walking 
away from the scene, and immediately feel relieved 
and relaxed. The scenes were presented randomly. 
The Sexually arousing scene was presented for ap- 
Proximately 10 sec, the nauseous scene for 30-60 
sec., with an intertrial interval of 30 sec. 4 

. After 6 acquisition sessions consisting of 48 pair- 
ings for Sı, and 13 acquisition sessions consisting of 
104 pairings for S, extinction was introduced. The 
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Fic. 1, Total score on card sort per experimental day and total frequency of pedophilic sexual urges in blocks 
of 4 days surrounding each experimental day. (Lower scores indicate less sexual arousal.) 


S was told, *For the next few sessions we are going 
to change the procedure a bit. We have found that 
this is the best course of action at this time. Remem- 
ber to imagine only what I describe" Note that 
instructions during extinction suggest continued prog- 
ress. The procedure during extinction consisted of 
presenting the sexually arousing scene for 10 sec., 
leaving the 30-sec. nauseous interval blank, and 
saying, “Stop imagining that,” at the end of the 
interval. All other therapist behaviors remained 
constant. 

After eight extinction sessions consisting of 64 
scene presentations for each S, reacquisition was in- 
troduced. The design, then, consists of base-line 
Measures; acquisition, in which sexually arousing 
Scenes are paired with nauseous scenes; extinction, in 
which sexually arousing scenes are presented alone; 
and reacquisition. 


RESULTS 


Figure 1 plots the total score of the card 
sort for each experimental day and the total 
frequency of sexual arousal in blocks of 4 
days surrounding each experimental day for 
Si. These 4 days consisted of the 2 days be- 
iore the session, the day of the session, and 
the day after the session. 

During the base-line phase there was no 
treatment. The card sort remained stable dur- 
ig this period while reports of pedophilic 
urges steadily rose. Acquisition, in which the 


nauseous scene was paired with the sexually 
arousing scene, resulted in a sharp drop in 
both measures of inappropriate sexual arousal. 
In extinction the nauseous scene was omitted, 
resulting in an increase in both measures of 
sexual arousal. 

The instruction preceding this extinction 
phase conveyed an expectancy of therapeutic 
progress. This expectancy was verified when 
S said, toward the end of extinction, “I know 
you're doing your best, but I guess I'm just 
not cut out for this treatment.” At this time 
Sı became upset and depressed. 

In reacquisition, the nauseous scene was 
reintroduced and measures of sexual arousal 
dropped to zero. During this phase Sı re- 
ported increased heterosexual behavior. 

GSR, analyzed as change in log conduc- 
tance, was averaged over each of the four 
experimental periods: The means (Ms) and 
standard deviations (SDs) in „mhos are: 
base line, M = 221 and SD = 141; acquisi- 
tion, M =27 and SD = 29; extinction, M = 
200 and SD = 92; reacquisition, M = 30 and 
SD=23. 

Although there is a great deal of variability, 
it is clear that S,'s arousal rose during ex- 
tinction and declined during reacquisition. 
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EXPERIMENTAL DAYS 
Fic. 2. Total score on card sort per experimental day and homosexual urges as reported 


following nights out. 


Figure 2 plots the total score of the card 
sort on each experimental day for S» and also 
the frequency of sexual arousal on each night 
out. For various reasons Sz was not able to 
go out each day that sessions were held. At 
one point an illness prevented him from visit- 
ing the local bars for 4 consecutive nights 
(Experimental Days 22-25). Therefore, 
points referring to frequency of sexual arousal 
are plotted only on experimental days for 
which there are data, 

During the base-line phase both the card 
sort and reports of homosexual urges were 
relatively stable. The acquisition phase re- 
sulted in a drop in both measures of sexual 
arousal. In the course of extinction both mea- 
sures rose sharply with the total score of the 
card sort rising above base-line levels. 

During this phase S; also became depressed, 
and near the end of the period said: “This 
treatment isn’t doing much good. I'm just as 
bad now as I was when I came in." At this 
time he engaged in his first homosexual affair 
since treatment began. Due to an illness of 
the therapist, there was a 9-day period be- 


(Lower scores indicate less sexual arousal.) 


tween Experimental Days 20-22 during which 
S was only seen once (Day 21). Throughout 
this period S generally refused to leave the 
hospital, even for the weekend, and remained 
quite depressed, 

In reacquisition, measures of sexual arousal 
dropped to zero. All sessions beyond the twen- 
tieth session (Day 31) were self-administered. 
At this time, S; also reported increases in 
heterosexual behavior and acquisition of het- 
erosexual fantasies. 

Although mean magnitude of GSR declined 
Sharply during acquisition, it did not recover 
during extinction and therefore could not be 
interpreted, 


Discussion 


These findings demonstrate that pairing 4 
noxious scene with a sexually arousing scene 
is a crucial procedure in covert sensitization. 
The effect of this pairing was separated from 
other psychotherapeutic variables such as 
therapeutic instructions, patient expectancies 
of improvement, and rapport between patient 
and therapist since these variables were still 


CONTROL OF SEXUAL DEVIATION 


present during extinction when the noxious 
scene was removed, and yet behavior re- 
gressed. 

The two noxious stimuli most often used 
in aversion therapy are emetic drugs (James, 
1962) and more recently shock (Feldman & 
MacCulloch, 1965). This experiment demon- 
strates that an intensely imagined noxious 
scene can also act as an effective aversive 
stimulus. There would seem to be several ad- 
vantages to using a noxious scene in clinical 
situations. First, the patient is less likely to 
refuse treatment because of the pain involved, 
(Rachman, 1965). Second, it can be widely 
employed by many therapists, since it does 
not require drugs or apparatus. 

Generally, there have been two therapeutic 
strategies in dealing with sexual deviations: 
decreasing deviant sexual behavior and in- 
creasing appropriate heterosexual behavior. 
This experiment indicates that covert sensiti- 
zation alone is sufficient to decrease deviant 
behavior. More controlled research employing 
precise measures of heterosexuality is needed 
to determine what treatment variables are re- 
sponsible for increases in heterosexual behav- 
lor. Further research is also necessary to as- 
certain whether reductions in deviant behavior 
resulting from covert sensitization would be 
maintained without continued treatment and 
careful attention to later sexual adjustment. 

The present study does not completely over- 
come a measurement problem common to 
most psychotherapy research, namely, an ex- 
cessive reliance on the patient’s subjective re- 
ports of progress. Although there is evidence 
that GSR is a valid index of sexual arousal 
(Solyom & Beck, 1967; Wenger, Averill, & 
Smith, 1968), the remaining measures are not 
behavioral in the sense of being publicly 
observable. While observable and objective 
behavioral measures have been devised for 
some neurotic (e.g., Leitenberg et al., 1968) 
and psychotic disorders (Ayllon & Azrin, 
1965), it is more difficult in cases of sexual 
deviation where more than one person is 
involved. Recent studies in our laboratory 
and elsewhere (McConaghy, 1967), however, 
suggest that changes in penile volume to slides 
of inappropriate sexual objects, a measure 
originally devised by Freund (1963), may be 
useful in the measurement of sexual deviation. 
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ANXIETY AND OVERGENERALIZATION: 
NEGATIVE RESULTS: 


E. NEIL MURRAY? 
State University of New York at Buffalo 


The effect of chronic and experimental anxiety on stimulus generalization was 


investigated. The paradigm consisted 
response to lines 8.9 cm. long by means 


of conditioning a voluntary motor 
of verbal reinforcement and recording 


amplitude of generalized response to six shorter lines as well as to the training 
stimulus, No relationship was found between either chronic or experimental 


anxiety and the height of the gradients 
significant difference was obtained only 


for men or for women. For slope, a 
with experimental anxiety and only 


for men, but this result failed to replicate. Conclusions are: (a) chronic anxiety 
level is unrelated to stimulus generalization; (b) experimentally induced anxiety 
has not been demonstrated convincingly as having an effect on stimulus generali- 
zation; (c) the popular notion that anxiety leads to overgeneralization is based 


on the weakest of experimental evidence. 


The notion that increased anxiety leads to 
overgeneralization or increased stimulus gen- 
eralization is often cited as an example of the 
application of general principles of learning to 
the understanding of symptoms of emotional 
disturbance. According to Mednick (1958), 
broadened generalization causes more distant 
stimuli to become associated with anxiety, 
and anxiety level mounts accordingly, thereby 
establishing a positive feedback System of 
ever-increasing anxiety. 

Although a study by Mednick (1957) is 
widely quoted as favoring the anxiety-over- 
generalization hypothesis, greater generaliza- 
tion in a middle-anxious group and no dif- 
ference between high- and low-anxious groups 
was actually found, and these results cannot 
be accepted as positive evidence. In fact, 
there is only one investigation of the effects 
of anxiety on stimulus generalization that has 
yielded results that clearly support the hy- 
pothesis (Rosenbaum, 1953, 1956). Rosen- 


1The investigation was Supported in part by 
United States Public Health Service Grant MH- 
07080. This article is based on a doctoral dissertation 
submitted to the University of Pittsburgh. The 
author wishes to thank Arnold H. Buss, chairman of 
the committee, for his advice and direction at every 
stage of the research. For their thoughtful com- 
ments, appreciation is expressed also to A. David 
Lazovic, William J. Meyer, and Arthur R. Orgel, 
members of the committee. 

? Requests for reprints should be sent to E. Neil 
Murray, Department of Psychology, State University 
of New York, Buffalo, New York 14214. 
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baum obtained gradients from psychiatric pa- 
tients who had been rated high and low in 
clinical anxiety and gradients from college 
Students who both scored high and low on 
the Taylor Manifest Anxiety (MA) scale 
(Taylor, 1953) and were subjected to one of 
three induced experimental anxiety condi- 
tions: strong shock, weak Shock, or buzzer. 
He found that students in the strong shock 
group produced a higher and flatter gradient 
than those in the weak shock and buzzer 
groups, and that within the strong shock 
group students with high MA scale scores 
Produced a higher gradient than those with 
low MA scale scores, 

These results must be evaluated cautiously 
for there were several methodological restric- 
tions concerning the definition and measure- 
ment of stimulus generalization in Rosen- 
baum’s paradigm: (a) Ss were instructed, 
rather than conditioned, to respond to a given 
“correct” stimulus and not to respond to 
other stimuli; (5) Ss were instructed to re- 
spond as rapidly as possible upon presenta- 
tion of the stimuli; c) generalization stimuli 
Were introduced during training trials, and 
large numbers of additional training trials 
(“booster” trials) were run during the gen- 
eralization series. These procedures define 
generalization in terms of error tendencies, 
and they also resemble the operations em- 
Ployed in studies of discrimination learning 
more closely than those traditionally used in 
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studies of stimulus generalization. A detailed 
criticism of experiments of this general nature 
has been provided by Brown, Clarke, and 
Stein (1958). 

This study attempted to extend Rosen- 
baum’s findings by using a procedure that 
minimized the difficulties inherent in the ear- 
lier work: (a) the training procedure con- 
sisted of conditioning a single stimulus (an 
8.9-cm. line); (b) S was given ample time 
to make a response; (c) the test series con- 
tained generalized stimuli (shorter lines), as 
well as several presentations of the training 
stimulus, but no booster trials. College stu- 
dents with high, middle, and low MA scale 
scores were subjected to one of three experi- 
mental conditions that closely resembled those 
used by Rosenbaum (1953), and it was hy- 
pothesized that increases in MA scale score 
and experimental anxiety would lead to 
greater generalization responsivity. 


METHOD 
Subjects 


Ninety men and 90 women were divided into 
thirds for each sex on the basis of the MA scale 
scores of an introductory psychology pool of 453 Ss. 
Ten of the 30 Ss in these six groups were randomly 
assigned to each of the three experimental anxiety 
conditions, 


Stimuli 


Seven lines, 5.0, 5.5, 6.1, 6.7, 7.4, 8.1, and 8.9 cm. 
long, were selected so that each successive length 
represented an approximate 10% increase over the 
length of the preceding line. Each of these stimuli 
was presented on a standard memory drum for 4 sec., 
followed by a 4-sec. blank interval. After a stimulus 
had been presented, S pulled a large handle with the 
right hand, and the experimenter (E) immediately 
e the amplitude of each response in pounds of 
pull, 


Procedure 


The Ss in the shock groups received stimulation 
through a clip placed on the fourth finger of the left 
hand. For Ss in the strong shock groups the current 
varied between 1.0 and 2.0 ma. (M = 1.6), and for 
Ss in the weak shock groups the current varied be- 
tween .$ and .9 ma. (M =.7). The Ss in the buzzer 
groups were told that they would occasionally hear a 
buzzer. During a pretraining session Ss were trained 
to differentiate between a hard and a soft pull on the 
handle. For men a hard pull was defined as 35 Ib. 
or more and a soft pull as less than 35 Ib., and for 
women a hard pull was defined as 25 lb. or more 
and a soft pull as less than 25 lb. The Ss next were 
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Fic. 1. Response amplitude for men differing in 
experimental anxiety. 


given a sample of the shock or buzzer and told that 
their responding would be unrelated to its occurrence. 
Twenty training trials with the 8.9-cm. line ensued; 
if S made a hard pull he was told “right,” and if he 
made a soft pull he was told “wrong.” Stimulation 
occurred twice during these trials, randomly during 
Trials 1-5 and again during 11-15. The criterion for 
successful training was 8 of the last 10 responses 
correct (hard pulls). 

Following training, the generalization stimuli, con- 
sisting of six randomized orders of the seven stimuli, 
were presented. Stimulation also occurred twice dur- 
ing these trials, randomly during Trials 8-15 and 
21-28. 

RESULTS 


MA Scale Score 

There were no systematic differences in 
either the heights or the slopes of generaliza- 
tion gradients as a function of M: ‘A scale score 
for men or for women. 


Experimental Anxiety 

Men’s results, The gradients for men based 
on mean pounds of pull are presented in Fig- 
ure 1. As the generalization stimuli become 
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more distant (shorter) from the training stim- 
ulus (A) all three curves decrease regularly. 
The gradient for the Strong shock group is 
both highest and flattest, the gradient for the 
weak shock group is intermediate, and the 
gradient for the buzzer group is both lowest 
and steepest. Analysis of variance of these 
data reveals that, although there are no dif- 
ferences in height (F = 1.7, df — 2/81), dif- 
ferences in slope are highly significant (F= 
2.9, df = 12/486, p < .005). 

Women's results, The gradients for women 
are presented in Figure 2 and Suggest smaller 
differences than were true for the men. Anal- 
ysis of variance of these data reveals differ- 
ences neither in the height (F — 0.8, df=2/ 
81) nor the slope (F=0.5, dj = 12/486) of 
the gradients. 


Replications 


Because there was a significant effect of ex- 
perimental anxiety on the slope of generaliza- 
tion gradients for men but not for women, 
it was decided to attempt to replicate these 
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findings for the two most disparate male | 
groups, strong shock and buzzer. The replica- 
tion consisted of two identical phases, the 
first using the same E and the second a 
naive E, Each phase employed 20 additional 
male Ss from the same introductory psychol- 
ogy pool who were randomly assigned to the 
two conditions, Exactly the same experimen- 
tal procedure as for the original male groups 
was followed. 

Phase 1. The gradients for the two groups 
in the first phase, presented in Figure 3, are 
similar in height and slope. Analysis of vari- 
ance of these data indicate no significant 
differences, 

Phase II, The gradients for the two groups 
run with the naive E in the second phase are 
presented in Figure 4. The gradient for the 
buzzer group is higher, suggesting that this 
group made harder pulls than the strong 
shock group. Slope differences appear to be 
minimal. The analysis of variance performed 
on these data again failed to achieve signifi- 
cance for either height (F = 1.3, df = 1/18) 
or slope (F — 1.0, df = 6/108). When the 
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Fic. 4. Response amplitude for Replication Phase II. 


data from two replications were pooled, the 
F values for both height and slope were less 
than 1. Thus there were no differences be- 
tween the gradients, and the original slope 
differences for men failed to replicate. 


Discussion 
MA Scale Score and Clinical Anxiety 


The lack of a systematic relationship be- 
tween MA scale score and stimulus generaliza- 
tion found in this study is consistent with 
the majority of previous research on the 
topic. In view of the fact that six other experi- 
ments (Arnhoff, 1959; Burns, 1964; Buss, 
1955; Fager & Knopf, 1958; Mednick, 1957; 
Rothman, 1962) have also failed to demon- 
Strate a relationship between either MA scale 
Score or clinical anxiety and stimulus gen- 
eralization it seems clear that evidence in 
Support of such an hypothesis is lacking. 


Experimental Anxiety 


Although originally there was a flatter gen- 
eralization gradient for the male strong shock 
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group than for the male buzzer group, this 
effect failed to replicate and proved to be un- 
reliable. In only one other related study 
(Rosenbaum, 1953) has there been an at- 
tempt to vary anxiety experimentally, and in 
it a higher and flatter gradient was also ob- 
tained from the strong shock group. However, 
it has been pointed out that there were proce- 
dural difficulties in Rosenbaum’s work, and, 
furthermore, the results of the present study 
are more general. The negative findings are 
not limited to situations demanding rapid re- 
sponding, booster trials, or prior training with 
the generalization stimuli, and they apply to 
women as well as to men. These results, then, 
fail to substantiate the hypothesis that ex- 
perimental or situational anxiety affects stim- 
ulus generalization gradients in a systematic 
manner. 
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EFFECTS OF TASK COMPLEXITY AND VERBAL 
EVALUATION ON THE LEARNING OF 
NORMALS AND SCHIZOPHRENICS * 


DAVID L. STREINER ? 
Syracuse University 


This study tested the relative predictive powers of three theories of schizo- 
phrenia: Mednick’s drive theory; Rodnick and Garmezy’s censure-deficit 
theory; and Atkinson and Robinson’s censure-sensitivity theory. Thirty nor- 
mals, 30 process schizophrenics, and 30 reactive schizophrenics were each run 
in a low- and high-complexity verbal discrimination task. Half of each group 
was praised for correct responses, the other half was censured for incorrect 
ones. While normals did not respond differentially to the reinforcers, schizo- 
phrenics tended to learn faster when censured than when praised, supporting 


Atkinson and Robinson, The performance decrement from the low- to the high- 
complexity task was equivalent for all groups, which is nonsupportive of Med- 
nick’s theory, An alternative explanation of the results is offered and a cautious 
interpretation of the data is urged due to medication differences between 


normals and schizophrenics, 


Within recent years, there have been two 
major classes of theories that attempt to ex- 
plain differences in performance in learning 
tasks between normals and schizophrenics: 
drive theories and censure-sensitivity theories, 

The major drive theory accepted by West- 
ern psychologists is that of Mednick (1958), 
which assumes that schizophrenics are in a 
state of drive that is higher than that of nor- 
mals. Although Mednick’s theory has come 
under serious criticism on both theoretical 
and empirical grounds (e.g., Buss, 1966), it 
has found wide acceptance in the United 
States, presumably due in large measure to 
its congruence with the familiar Hull-Spence 
theory. 

Based on Hull's assumption of multiplica- 
tive relationship between drive (D) and habit 
strength (sHp), Mednick predicts that in 


l'This article is based on a doctoral dissertation 
submitted to the Department of Psychology, Syracuse 
University. The author wishes to express his deep 
gratitude to the members of his committee, Sanford 
J. Dean (chairman), Wallace R. McAllister, and 
Norman Simonson. Special thanks are also due to 
Edward Parkes, Director of Psychological Services, 
and the staff of Norristown State Hospital, Norris- 
town, Pennsylvania, whose cooperation made this 
study possible, 

? Requests for reprints should be sent to David L. 
Streiner, Department of Psychiatry, Hamilton Psy- 
chiatric Hospital, P.O. Box 585, Hamilton, Ontario, 
Canada. 


simple tasks, schizophrenics, because of their 
higher drive, should perform better than nor- 
mals. However, since D increases the strength 
of all responses, schizophrenics, in compari- 
son with normals, should reach criterion more 
slowly and make a greater number of errors 
in complex tasks, which allow for a greater 
number of competing or alternative responses 
being brought above threshold and interfering 
with the desired response. 

The censure-sensitivity theories of schizo- 
phrenia assume that the preschizophrenic per- 
son is born with or develops a heightened 
sensitivity to criticism, which has greater ef- 
fects on his performance than on the perform- 
ance of normals, One version of this theory, 
Rodnick and Garmezy’s censure-deficit posi- 
tion (Garmezy & Rodnick, 1959; Rodnick & 
Garmezy, 1957), states that sensitivity to 
social censure leads to easy disruption of the 
schizophrenic’s performance. Although much 
of the earlier research appeared to support 
their contentions, later studies have cast 
doubt upon the theory’s validity (Cicchetti, 
Klein, Fontana, & Spohn, 1967; Fontana, 
Klein, & Cicchetti, 1967; Klein, Cicchetti, & 
Spohn, 1967). 

Based on the same assumption of a height- 
ened sensitivity to criticism in schizophrenia, 
Atkinson and Robinson (1961) predict a dif- 
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ferent effect of censure. They write, 


For schizophrenic subjects, unlike normal subjects, 
the motivation to avoid mild punishment is stronger 
than the motivation to achieve mild reward, so that 
in situations in which giving a correct response is 
the only way to avoid punishment, punishment has a 
facilitating effect on performance [p. 325]. 


They report the results of two paired-asso- 
ciate learning experiments where schizophre- 
nics improved with social censure. 

Much of the variability in results can be 
explained by the vast difference in patient 
populations, in that the schizophrenics in one 
sample are rarely comparable to the schizo- 
phrenics in another sample. Despite the crud- 
ity of the Kraepelinian nosological system, 
even these distinctions in diagnoses are often 
ignored. Within recent years many investiga- 
tors, if not diagnosticians, have abandoned 
the symptomatologically based Kraepelinian 
typology for a dichotomous “‘process-reactive” 
differentiation of schizophrenia based on the 
clinical history and probable prognosis of the 
patient. 

Process schizophrenia involves a long-term progressive 
deterioration of the adjustment pattern with little 
chance of recovery, while reactive schizophrenia indi- 
cates a good prognosis based on a history of generally 


adequate social development with notable stress 
precipitating the psychosis [Herron, 1962, p. 329]. 


When schizophrenic Ss are divided into 
process (or “poor premorbid”) and reactive 
(or “good premorbid”) groups, the findings 
appear to be somewhat more consistent. Both 
Bleke (1955) and True (1966) reported re- 
sults that show that normals and reactive 
schizophrenics are relatively insensitive to so- 
cial censure, but that both groups differ sig- 
nificantly from the process schizophrenic 
group that did more poorly with censure, in 
support of the Rodnick and Garmezy position. 

The purpose of the present study was two- 
fold: to examine some of the variables that 
may account for the discrepancies in the find- 
ings, such as task complexity and schizo- 
Phrenic diagnosis; and to establish conditions 
under which the drive and censure-sensitivity 
theories would predict divergent results in 
Order to test the relative power of each of 
these theories. 


607 


METHOD 


Design. The basic design for this study was a 3 
(Diagnosis) X 2 (Reinforcement) factorial, generat- 
ing six experimental groups: normals receiving praise 
(N-P), normals receiving social censure (N-C), pro- 
cess schizophrenics receiving praise (P-P) and censure 
(P-C), and reactive schizophrenics receiving praise 
(R-P) and censure (R-C). Each S in each of the 
cells had two learning tasks, one of low complexity 
(LC) and one of high complexity (HC). 

Subjects. In order to ascertain whether the HC 
task was in fact harder to learn than the LC task, 20 
white male neuropsychiatric patients hospitalized at 
Norristown State Hospital, Norristown, Pennsylvania, 
were run in a pilot study, 10 patients per task. 

The schizophrenic sample for the major study 
consisted of 60 white male neuropsychiatric patients, 
each with a primary diagnosis of schizophrenia. All 
of the patients were between the ages of 18 and 60 at 
the time of testing, had had at least a sixth-grade 
education, had no history of mental retardation or 
central nervous system pathology, were not currently 
alcoholic or addicted to drugs, and had not received 
insulin- or electro-shock therapy for at least 12 mo. 
prior to testing. The patients were divided into 
process and reactive groups on the basis of the 
Ullmann and Giovannoni (1964) Self-Report (S-R) 
Scale. All patients who scored between 0 and 12 were 
classified as process, and between 13 and 24 as 
reactive. 

The process and reactive groups did not differ 
significantly on mean age at first hospitalization. 
At the time of testing, however, the process group 
had spent an average of 1140 yr. in mental hospitals 
(not necessarily consecutive) and the reactive group 
had been institutionalized for an average of 5.47 yr. 
This difference was statistically significant at the 
.003 level (F = 9.879, df = 1/56). 

The normal sample consisted of 30 white males 
between the ages of 18 and 60. All normal Ss had 
had at least a sixth-grade education, had no history 
of psychiatric hospital admissions, and were em- 
ployees of Norristown State Hospital. There were 
no differences in the amount of education among the 
various diagnostic groups, but there was a significant 
age difference among the groups (F = 24.998, df = 
2/84, p<.0001). A Scheffé test (Edwards, 1960) 
showed that the normals were significantly younger 
than the two schizophrenic groups, but that the 
process and reactive schizophrenic groups did not 
differ. 

Stimuli. A verbal discrimination paradigm was 
used, in which a stimulus word and four response 
words were presented to Ss. In each case, two of the 
response words were “correct” and two “incorrect,” 
although S’s task was to learn any one of the correct 
responses, The stimulus word and the response words 
were taken from Palermo and J enkins’ (1964) norms, 
and all had A or AA ratings in Thorndike and 
Lorge’s (1944) frequency count, In the LC task, the 
correct words had higher associative strengths to the 
stimulus word than did the incorrect words, while 
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DRIVE 
CARS 
FAST 


TRUCKS 


Fic. 1, Sample stimulus-response set shown to S. 


the low associative Strength words were correct in 
the HC task. That is, task complexity in this experi- 
ment was a function of the number of competing 
responses, In the LC task, the most dominant re- 
Sponses were correct, while in the HC task the most 
dominant responses were incorrect and competed 
strongly with the less dominant, correct responses 
that S had to learn. 

For each stimulus-response set, the stimulus word 
was printed in 4-in. capital letters on the left side 
of a sheet of 83 X11 in. White paper held hori- 
zontally, and the four response words were printed in 
a columnar fashion on the right side, The ordering 
of the four words was randomized for each of the 
sets, Thirty-five millimeter black and white slides 
were taken of each set. The 10 LC sets were ran- 
domized seven times and placed in a Kodak Carousel 
slide holder with a completely opaque slide separating 
each trial of 10 stimulus-response sets. Thus, the 
order of sets on the eighth and fifteenth trials was 
identical to the order on the first trial, the ninth 
and sixteenth orders were identical with the Second, 
and so on. Separate carousels were made for the 10 
LC and 10 HC sets, 

Procedure. For the pilot study, each S received only 
one of the tasks, and all Ss were tested under the 
praise condition. The Ss were alternately assigned to 
the LC and HC conditions in order of their appear- 
ance, 

All Ss in the major study were seen individually 
in a private office. At the beginning of the first testing 
session, each schizophrenic S was given the process- 
reactive S-R Scale. Since many of the patients mis- 
interpreted some of the items when the scale was 
self-administered, the items were read aloud to all 
Ss by the experimenter (E). For each diagnostic 
category, Ss were alternately assigned to the praise 


Ss are given in parentheses) : 


I'm going to show you a series of slides on the wall. 
Each slide will look like this, (S was shown a 3 x 5 
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index card as in Figure 1.) There will be a single 
word on the left side and four words on the right 
side. Your job will be to learn which one of the 
four words goes with the word on the left. Here's 
how we'll do it. As each slide is presented, say the 
word on the left aloud, in this case, “cars,” and 
then tell me which one of the four words you think 
is right. If you’ve chosen the correct word, let's say 
“drive,” I will say "right" (I won't say anything), 
If you haven't chosen the correct word, I won't 
say anything (I will say “no” or “wrong”), If 
you chose the wrong word, don't try another word 
just yet. There will be ten such sets of words, then 
a brief pause, and then we'll go through the same 
set of words, but in a different order. If you chose 
the right word the first time, stick with it; that is, 
say "cars-drive" again, but if you didn't get the 
word right the first time, try a diíferent word. 
We'll go through the list a number of times until 
you've gotten all the words correct three times in a 
row. Any questions? 


Because it was expected that all groups would 
learn the LC task faster than the HC task, the 
order of task presentation remained constant for all 
Ss. In this experiment, counterbalancing would most 
probably have led to greater intragroup variability, 
in that half of the Ss would start with a task that 
they solve relatively rapidly and half with a task that 
takes longer to learn, Consequently, some Ss would 
begin the second task with a history of relatively 
rapid success and some with a history of relatively 
slower learning and possibly more frustration. 
Furthermore, a fixed order of presentation would 
favor an equivalent amount of transfer from Task 1 
to Task 2 for all Ss. 

For the LC task, the internal timer in the pro- 
jector was set for a 15-sec, display time. The HC task 
was given at least 24 hr. after the LC task to 
minimize fatigue factors, The response contingencies 
remained the same for each S, but the projector was 
set for an 8-sec. exposure, Each S was reminded of 
the task and of the Tesponse contingencies. 

For both tasks, Ss were run to a criterion of three 
Consecutive errorless trials, All Ss who did not reach 


criterion within 30 trials were eliminated from the 
study, 


RESULTS 


To determine if the HC task was in fact 
harder than the LC task, an analysis of vari- 
ance was performed on the number of correct 
responses on the first trial of the major study 
for both tasks, The analysis showed that sig- 
nificantly more correct words were chosen as 
first responses on the LC task than on the 
HC task (F = 37.44, dj = 1 /84, p < ,0001).? 


a The pilot study showed it took twice as many 
trials to learn the HC task (X= 15,6) than the LC 
task (X —71) (t= 3.29, df = 18, p < 001). 
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Task COMPLEXITY, VERBAL EVALUATION, AND LEARNING 


For the major study, a Lindquist Type IIT 
analysis of variance (Lindquist, 1953) was 
performed on the number of trials to criterion, 
in which task was the repeated measure. The 
analysis yielded significant main effects of 
diagnosis (F = 11.002, df = 1/84, p = .0002), 
reinforcement (F = 4.842, df=2/84, p= 
029), and task (F = 16.317, df = 1/84, p= 
0003). A Scheffé test revealed that the sig- 
nificant diagnosis effect was caused by the 
superiority in learning speed of the normal 
group, which differed significantly from the 
two schizophrenic groups, while the two schiz- 
ophrenic groups did not differ from each 
other, 

The significant reinforcement effect showed 
that fewer trials were needed to reach criterion 
under the censure condition (X — 8.58) than 
under the praise condition (X — 9.99), while 
the task effect reflected fewer trials necessary 
to learn the LC task (X — 8.47) than the 
HC task (X — 10.10). The Task X Reinforce- 
ment interaction approached significance (F 
—2.997, dj = 1/84, p=.083), reflecting a 
tendency for Ss in the praise condition to 
Show more of a performance decrement from 
the LC to the HC task than Ss in the censure 
condition. Analysis of each task individually 
Showed that there was no effect in the HC 
task. Thus, on relatively simple tasks, the 
Speed of learning was independent of the 
type of reinforcement, but, under more dif- 
ficult conditions, Ss learned faster with cen- 
Sure than with praise. 

Although the main analysis showed that 
the Reinforcement X Diagnosis interaction 
did not approach an accepted level of signifi- 
cance (F = 1.666, df — 2/84, p= .193), ex- 
amination of Figure 2 indicates that there is 
a tendency for a differential effect of rein- 
forcement on diagnostic category. Combining 
both schizophrenic groups yielded a stronger 
interaction (F = 2.90, df= 1/86, p=.09). 
Thus, while normals perform equally well 
under conditions of praise and censure, there 
is a tendency for schizophrenics to learn bet- 
ter when censured for wrong responses than 
when praised for correct ones. 

The Task X Diagnosis interaction, pre- 
dicted by Mednick’s theory, was neither ap- 
Preciable nor statistically significant (F = 
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Frc. 2, Number of trials to criterion as a function of 
reinforcement for the three diagnostic groups. 


0.007, df = 2/84, p = .994), indicating that 
the performance decrement from the LC to 
the HC task was about equal for all diagnostic 
groups; that is, there was no differential ef- 
fect of task complexity on the performance 
of the different groups. 

Since it is possible that patients’ responsiv- 
ity to social criticism becomes flattened and 
dulled after extended hospitalization, the 
analysis was repeated using long-term versus 
short-term hospitalization rather than the 
process-reactive dichotomy. For both the 
praise and censure groups, the patients were 
divided into groups by a median split (Mdn 
= 5 yr. of hospitalization), irrespective of the 
patients’ process-reactive scores. In the short- 
term group, there were 19 reactive and 11 pro- 
cess schizophrenics, and 19 process and 11 re- 
active schizophrenics in the long-term group. 
The results of this analysis were directly com- 
parable with those of the previous analysis, 
which used process-reactive as a dimension. 
Despite the fact that there was not a complete 
overlap between the reactive and short-term 
groups, or between the process and long-term 
groups, long-term schizophrenics performed in 
a manner similar to process schizophrenics, 
and short-term schizophrenics performed simi- 
larly to reactive schizophrenics. 


Discussion 


The findings of this study lend partial sup- 
port to the censure-sensitivity theories that 
predict better performance by schizophrenics 
under conditions of censure than of praise. 
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The results did not, however, support the 
drive interpretation of schizophrenic perform- 
ance deficit. 

While the interaction between reinforce- 
ment and diagnosis approached significance 
only when the schizophrenic groups were 
combined and compared with normals, this 
tendency was in the direction predicted by the 
Atkinson and Robinson version of censure- 
sensitivity theory. Moreover, there was a 
strong tendency for the differential advantage 
of censure to be more pronounced on harder 
tasks than on easier one. At the same time, 
normals did not respond differentially to 
praise or censure, regardless of the complex- 
ity of the task involved. These findings, 
therefore, are contrary to the interaction pre- 
dicted by Garmezy and Rodnick, who state 
that schizophrenics’ performance should de- 
teriorate under conditions of response-contin- 
gent social censure. 

Although the absence of "right" in the 
praise or R-N condition is functionally equiva- 
lent to and conveys the same message as say- 
ing “no” or “wrong” in the censure or N-W 
condition, it appears that, for schizophrenics, 
the actual word “wrong” or “no” has a 
greater psychological effect than the “wrong” 
that is implied by silence in the praise condi- 
tion. Thus, while the normal person may 
respond largely to the information implied by 
the words, the Schizophrenic may have been 
responding more to their evaluative connota- 
tions. 

This finding, though, is open to two inter- 
pretations. The first, and that taken by 
Atkinson and Robinson, would state that the 
schizophrenics’ performance improved with re- 
sponse-contingent censure, and that praise 
had no effect or may have had a relatively 
mild detrimental effect, This presupposes, 
however, that there is an initial deficit in the 
performance of schizophrenics that was par- 
tially overcome by their motivation to avoid 
mild censure. The alternative explanation, 
which would also be congruent with the re- 
sults shown in Figure 2, is that there is no 
initial deficit in the schizophrenics’ perform- 
ance, and that their speed of learning is equiv- 
alent to that of normals. Praise, however, 
may not serve as an effective reinforcer for 
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schizophrenics, and they show slower learn- 
ing under this condition. 

If the performance of the normal group is 
taken as a base line, then the data reported 
here favor the second interpretation. However, 
if the schizophrenics’ performance under con- 
ditions of praise is used as a basal level, 
then the data support the first interpretation. 
The difficulty resides in the choice of an 
adequate base with which schizophrenics’ per- 
formance under different conditions may be 
compared. 

The failure to support Mednick’s theory 
raises the question of the adequacy of the 
study to test his hypothesis. Of all the differ- 
ences between normals and schizophrenics, 
perhaps none is as crucial as the iatrogenic 


difference that, for the most part, schizo- . 
phrenics receive psychoactive tranquilizers 


and mood elevators, and normals do not. 
Moreover, if the assumption is made that the 
dosage of tranquilizing agents is proportional 
to the schizophrenic's state of overt agitation 
and symptomatology, then the schizophrenics 


with the greatest amount of drive also receive | 


the largest amounts of tranquilizers. The re- 
sult would be to decrease the drive-level dif- 
ferences among schizophrenics and bring the 
drive level of scrizophrenics down to a level 
more nearly approximating that of normals. 
Consequently, schizophrenics would perform 
in much the same way as normals, except that 
they would be more susceptible to associative 
interference due to the tranquilizing effects of 
the drugs. In effect, therefore, they would 
respond to LC and HC tasks as do normals, 
but show an overall decrease in performance, 
as was demonstrated here. 

The lack of any difference in performance 
between process and reactive schizophrenics 
may also be a function of the medication. 
Most of the studies that have reported signifi- 
cant differences between process and reactive 
schizophrenics were carried out in the early or 
mid 1950s, whereas the Major tranquilizers 
Were introduced into hospitals in significant 
quantities in the late 1950s and early 1960s. 
Thus, while the two groups still differ in 
terms of their premorbid histories, the drugs 
may have artificially eliminated concomitant 
differences in cognitive functionings, such as 


E 
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those due to flight of ideas. Unfortunately, due 
to the wide diversity of medication, it was 
impossible to evaluate the effect of the drugs 
on performance. 

While the results of this study tend to 
support the contention that schizophrenics’ 
reactions to praise and censure differ from 
normals’ reaction, and do not support the 
drive theory of schizophrenia, a cautious inter- 
pretation of the data is necessary. 
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THE MMPI AND RECIDIVISM 


JAMES L. 
University of 


MMPIs of 80 recidivists and 68 parole 


MACK? 


Minnesota 


successes from a state training school 


were compared by analyzing mean scale scores, elevations, code types, and mean 
ranks of the clinical scales. Recidivists were expected to be characterized by 


elevations and code types reflecting the relative im 
Ma scales and the nonrecidivists by the Hs, D, Hy, 


portance of the F, Pd, and 
and Si scales. No important 


differences were identified. Conclusions are: (a) the MMPI used alone is not 


useful in identifying recidivists in relativel 
tions; (b) its use should be restricted to 


between groups; and (c) the combined 
MMPI to identify recidivists needs study. 


A number of investigators have attempted 
to distinguish recidivists from individuals who 
successfully completed parole from a correc- 
tional institution by means of the Minnesota 
Multiphasic Personality Inventory (MMPI). 
Negative results have been reported with 
single predictor scales (Freeman & Mason, 
1952; Gough, Wenk, & Rozynker, 1965; 
Mandel & Barron, 1966; Wattron, 1963); 
correlational studies (Clark, 1948); mean pro- 
file comparisons (Panton, 1962); blind sort- 
ing by clinical judges (Mandel & Barron, 
1966); and comparison of Proportions of ab- 
normal profiles (Clark, 1948). Investigators 
using large Ns have obtained significant, 
though small, group differences on the F, Hs, 
D, Hy, Pd, Ma, and Si scales (Gough et al., 
1965; Wirt, 1967), leading Wirt to charac- 
terize recidivists as more sociable and psycho- 
pathic and less neurotic than parole suc- 
cesses. Most previous investigators used a 
single approach in analyzing MMPI data, 
and none attempted to evaluate configural 
aspects of individual profiles, 

The present investigation represents an at- 
tempt to provide conclusive results concerning 
the effectiveness of the MMPI in distinguish- 
ing recidivists from nonrecidivists by compar- 


1 This article is based on a doctoral dissertation 
submitted to the University of Minnesota, 1967, and 
was prepared with the aid of Robert Wirt, adviser, 
and James Butcher and Peter Briggs, who served on 
the examining committee, 

Requests for reprints should be sent to James L. 
Mack, Norristown State Hospital, Norristown, Penn- 
sylvania 19401. 
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ly homogeneous delinquent popula- 
the exploration of small differences 
use of historical information and the 


ing (a) mean scale scores, (5) the frequency 
of abnormal profiles, (c) the proportions of 
various code types within each group, and 
(d) the mean ranks of each scale. In general 
it is predicted that the recidivists will be dis- 
tinguished by the relative importance of the 
F, Pd, and Ma scales and the nonrecidivists 
by the Hs, D, Hy, and Si scales. 


Merxop 


Subjects. All boys from Hennepin and Ramsey 
Counties, Minnesota (the primary metropolitan area 
of the state), who were paroled from the State 
Training School at Red Wing between January 1, 
1966, and July 31, 1966, were included in the study. 
Of the 159 Ss, 82 were recidivists, 69 nonrecidivists, 
and 8 remained unclassified. There was no difference 
between the groups with respect to age (M —169 
Yr, range — 14-19), IQ, and length of time institu- 
tionalized (M = 5.2 mo., range — 1-13). The majority 
of Ss were of normal intelligence, only 2 falling 
below an IQ of 70. The nonrecidivists were on parole 
for an average of 9.1 mo. (range — 4-16), while the 
recidivists had been on parole for an average of 4.2 
mo. (range — 1-12). i 
Procedure. Previous to his placement at the train- 
ing school, each S was administered the MMPI as 
part of a routine battery of tests at a reception 
center. Testing was thus carried out more than 5 
mo. prior to the parole of all Ss and, in some cases, 
more than 12 mo. earlier. Because of the possible 
effects of parole or institutional adjustment on test 
performance, no attempt was made to obtain more 
recent MMPI results. The decision of whether or 
not S had failed on parole was made by the Youth 
Commission of the Minnesota Department of Cor- 
rections upon the recommendation of the agent 
Supervising the S on parole, The status of eight Ss 


could not be clearly ascertained, and they were left 
unclassified, 


MMPI ano RECIDIVISM 


RESULTS 


There were no invalidating K or L scores 

(T scores > 75). High F scores in themselves 
were not considered invalidating in order to 
retain valid clinical information (Morrice, 
1957). Three Ss produced profiles suggesting 
inability to understand test items and were 
omitted, leaving 80 recidivists and 68 non- 
. recidivists. 
When the groups were compared on the 
| basis of their mean T scores on the 3 validity 
and 10 clinical scales, there were no signifi- 
cant differences at even the .10 level. These 
data are presented in Table 1. At the .10 
level of significance the probability that the 
research design would detect a difference as 
small as 5 T scores was .94. For a difference 
of 2.5 T scores, however, the probability of 
detection was only .42, indicating that the 
sample size was not sufficient to allow the 
identification of small group differences with 
any degree of confidence. 

All profiles were then coded according to 
the Welsh system (Welsh, 1948). The propor- 
tions of individual code types within each of 
the groups were compared in regard to two 
point codes exclusive of elevation, codes of 
all T scores greater than 69, and total codes. 
All codes were heavily weighted with eleva- 
tions on the Pd scale. The recidivists had the 


TABLE 1 
MMPI Mean T Scores 
Group 
Scale | Nonrecidivists Recidivists* z 
M SD M SD 
| 

L 48.60 | 680 | 4911 | 731 44 
E 61.12 | 842 | 61.06 | 924 | 04 
K 5207 | 856 | 53.16 | 8.18 78 
Hs 55.53 | 10.73 | 5508 | 916 | .27 
D 63.12 | 12.99 | 60.90 | 11.93 | 1.08 
Hy | 59.32 | 897 | 5728 | 7.27 | 1.50 
Pd 7544 | 865 | 7448 | 9.05 | .66 
Mf | 5337 | 8.81 | 5405 | 880 | .47 
Pa 62.35 | 947 | 6143 | 927 | .59 
Pt 66.97 | 10.20 | 65.10 | 10.43 | 1.10 
Sc 66.94 | 11.76 | 68.24 | 13.34 | -63 
Ma | 6546 | 11.70 | 63.50 | 1131 | 1.08 
Si 53.40 | 10.05 | 5239 | 878 | .64 


ù N for the Pa scale was 79. 
ddr for the Hy scale were significantly different 
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TABLE 2 
Mean Ranks or MMPI Scares 


PPP PUN OU 
Dawa PUY 
Saline carob asa 
[I Sas Go im ta ta tn bo 


Pd scale as a high point in 41 of 80 profiles 
and the nonrecidivists in 33 of 68 profiles. 
The groups were quite similar in almost all 
comparisons. The recidivists did show a 
greater proportion of profiles with Pd and 
nothing else greater than 69 (p = .036). The 
nonrecidivists had a greater proportion of ele- 
vations greater than 69 on the Hy scale (p 
= .036). 

To further evaluate profile configuration 
the mean rank of each scale was calculated 
for each group and comparisons made. No sig- 
nificant differences were observed, although 
the recidivists showed a slight tendency to 
have the Sc scale more highly ranked within 
their profiles (p < .13). Again, the predomi- 
nant influence of the Pd scale was evident. 


Discussion 


Previous studies have been criticized for 
failing to identify all recidivists. All non- 
recidivists in the present study had been on 
parole for longer than 6 mo., and statistics of 
the Minnesota Department of Corrections ? 
indicate that 80% of those who eventually 
violate their parole do so within the first 6 
mo, of parole. Furthermore, the proportion 


2 W. V. Sittler, R. L. Webb, and N. G. Mandel. “A 
follow-up study of parole violation of individuals 
released from Minnesota correctional facilities from 
January 1, 1964 through December 31, 1964.” Unpub- 
lished manuscript, Minnesota Department of Correc- 
tions, St. Paul, 1966. 
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of parole violators in the present sample, 
49.1%, compares to an overall rate of 48.7% 
from the previous year’s releases from the 
training school, lending weight to the assump- 
tion that Ss were correctly classified. 

Although the N was not sufficiently large 
to reveal small group differences, attempting 
to enlarge the sample by taking parolees re- 
leased over a lónger time period, from a wider 
geographical area, or from other institutions 
would have introduced unwanted sources of 
variance. The results of the present investiga- 
tion strongly suggest that the MMPI, con- 
sidered alone, is not associated with reci- 
divism to any important extent within such 
homogeneous populations as the present one, 
although the existence of a minimal associa- 
tion, of doubtful predictive ultility, cannot be 
ruled out. That two comparisons of more 
than 50 showed differences significant at the 
-05 level can scarcely be considered telling 
evidence for a relationship between the 
MMPI and recidivism. Both groups in the 
present study showed a considerable degree 
of personality disturbance, but they did not 
appear particularly different with regard to 
their personality adjustment. Further research 
with the MMPI and recidivism might well 
be restricted to samples of very large size 
where the purpose is to demonstrate very 
small group differences. 

Perhaps the most serious criticism that can 
be leveled at the present study is that no 
attempt was made to control for the effects 
of the environmental conditions into which 
the parolee was placed upon release, Nearly 
all were returned to their parental homes, the 
character of which may have been instrumen- 
tal in influencing the parole adjustment of 
the Ss. The evaluation of MMPI results in 
combination with historical information con- 
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cerning the character of the parental home 
has been carried out in several studies of de- 
linquency (Briggs, Wirt, & Johnson, 1961; 
Wirt & Briggs, 1959). Applications of these 
techniques may prove to be the most fruitful 
approach in future studies of recidivism. 
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RELATIONSHIP BETWEEN 


PERCEPTUAL DEPRIVATION 


TOLERANCE AND ADEQUACY OF DEFENSES AS 
MEASURED BY THE RORSCHACH* 


NANCY A. WRIGHT 


University 


AND JOHN P. ZUBEK ? 


of Manitoba 


The Buhler-Lefever standardization of the Rorschach was administered, pre- 
experimentally, to 18 Ss who successfully endured 1 wk. of perceptual depriva- 


tion and to 13 Ss, the quitters, whi 
Control based on the Effectiveness of 


o were unable to do so. An Index of 
Defense in controlling Defense Demand 


was derived for each S. The results revealed that the group with a high-control 


Index was composed largely of succe: 


ssful Ss whereas the group with a low- 


control Index consisted largely of isolation quitters. It was concluded that the 


Index of Control, derived from the 
predictor of isolation tolerance. 


It has long been known that not all volun- 
teers can endure prolonged periods of per- 
ceptual deprivation in an isolation chamber. 
Some can tolerate it for many days while oth- 
ers terminate the condition after only a few 
hours (Zubek, 1964). Despite the large in- 
dividual differences, no satisfactory predictive 
measures of deprivation tolerance have as yet 
been developed (see reviews of Myers, 1969; 
Schultz, 1965). Various paper-and-pencil tests 
of personality (e.g., Minnesota Multiphasic 
Personality Inventory, Edwards Personal 
Preference Schedule, and Thurstone's Tem- 
perament Schedule), together with certain 
perceptual measures, have been employed, but 
little success has been achieved in differentiat- 
ing between Ss who can and cannot tolerate 
prolonged isolation. Furthermore, any signifi- 
cant results that have emerged have frequently 
contradicted one another. 

Although paper-and-pencil tests of per- 
sonality have proved to be largely ineffective, 
Wright and Abbey (1965), in an exploratory 
study conducted at the Manitoba Isolation 
Laboratory, have suggested that the predic- 
tion of perceptual deprivation tolerance could 

' be improved greatly by examining the Ror- 
Schach manifestations of an individual's de- 
fense and control mechanisms and subse- 


l' This research was supported by the Defense 
Research Board, Canada (No. 9425-08) and by the 
National Research Council, Canada (APA-290). 

? Requests for reprints. should be sent to John P. 
Zubek, Department of Psychology, University of 
Manitoba, Winnipeg, Canada. 


Rorschach, may serve as an excellent 


' quently deriving his Index of Control. This 
Index appeared to be a reliable and valid 
means of determining whether or not drive- 
dominated material emerging during isolation 
could be controlled and, hence, whether or not 
S could endure isolation for the prescribed 
period of 1 wk, 

There was some support for this idea in the 
literature at that time (Christy & Rasmussen, 
1963; Nardini, Herrman, & Rasmussen, 
1962). In a more recent article (Kracke, 
1967), Erika Fromm is quoted as stating that 
the isolation situation created in some of the 
experiments is one that is as close as an 
experimental situation can get to the intra- 
uterine situation. Why, she asks, is it not 
sheer bliss? Kracke concludes that for many 
Ss the resulting stress is due to the ascendancy 
of primary process material in Ss for whom 
this is ego threatening. 

The original findings of Wright and Abbey 
(1965) were considered to be tentative since 
the 21 male Ss were tested several months 
after completion of perceptual deprivation 
and thus their responses may have been af- 
fected by their feelings of success or failure 
in the situation. It was therefore decided to 
replicate the study in the same laboratory, 
under the same conditions of perceptual depri- 
vation, employing the same duration (1 wk.), 
but instituting prior testing procedures. It 
was again hypothesized that the Rorschach 
records of successful Ss (those completing 1 
wk. of isolation) would be characterized by 
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TABLE 1 


Division or SUCCESSFUL AND UNSUCCESSFUL ISOLA- 
TION SUBJECTS ON THE Basis OF CONTROL OF 
DRIVE-DOMINATED RESPONSES DERIVED 
FROM THE RORSCHACH 


Index of Control 
Type of Ss 
High Low | Total 
Successful 14 4 18 
Unsuccessful (quitters) 1 12 13 
Total 15 16 31 


drive-dominated material that was not threat- 
ening and was well controlled and that the 
Rorschach records of unsuccessful Ss, or 
quitters, would be characterized by lack of 
control so that the drive-dominated material 
would become conscious and cause mobiliza- 
tion of anxiety and maladaptive and patho- 
logical defenses. 


METHOD 


A group of 31 male university students were placed 
individually in an air-conditioned isolation chamber 
for a prescribed period of 1 wk. Of this group, 18 
successfully endured the week, while 13, the unsuc- 
cessful Ss or quitters, were unable to do so (M 
duration = 56 hr.). All Ss wore translucent goggles, 
which permitted diffuse light but eliminated pattern 
vision, and a set of earmuffs through which white 
noise was constantly presented somewhat above the 
threshold of hearing, Further details of the isolation 
procedure are available in an earlier publication 
(Zubek, Aftanas, Hasek, Sansom, Schludermann, Wil- 
gosh, & Winocur, 1962). 

As part of the routine personality testing that pre- 
ceded isolation, the senior author administered the 
Buhler-Lefever (Buhler, Buhler, & Lefever, 1954) 
standardization of the Rorschach. This procedure di- 
rects S to give from three to five responses per card 
and it encourages him to elaborate in the inquiry. 
The number of responses and the amount of material 
are therefore roughly equivalent from S to S. The 
Rorschach protocols were then analyzed 2 yr, after 
their preexperimental administration, a procedure 
that ensured that the tester had no knowledge of 
which S endured or failed to endure the week of 
perceptual deprivation. As in the earlier exploratory 
study, the protocols were scored objectively accord- 
ing to the Holt-Havel (Holt & Havel, 1961) method 
for assessing primary and secondary process in the 


3See also R. R. Holt, Manual for the scoring of 
primary process manifestations in Rorschach re- 
sponses. Research Center for Mental Health, New 
York University. Unpublished draft of an experi- 
mental procedure, 1963. 
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Rorschach and the defense and control mechanisms 
that these demand. The proportion of the Effective- 
ness of Defense (DE) score to the Defense Demand 
(DD) score became the Index of Control (Wright & 
Abbey, 1965). 


RESULTS 


The 31 Ss were rank ordered according to 
their Index of Control. They were then split 
at the median, into a high-control group 
(Index range = .47—70; n= 15) and a low- 
control group (Index range = .78-1.50; n= 
16). Table 1 shows the number of successful 
and unsuccessful Ss in each group. It can be 
seen that in the high-control group, 14 of the 
15 Ss successfully completed the 1-wk. isola- 
tion period whereas in the low-control group, 
only 4 of the 16 Ss were able to do so. Since 
the expected frequency was n « 5 and n< 
20, the median test for y? was used (Siegel, 
1956). This analysis demonstrated that the 
effectiveness of S's control and defense mecha- 
nisms was positively related to success or 
failure in isolation (y? — 12.17, df —1, p 
< 01). 


Discussion 


Since the Index of Control in both this 
study and in the earlier exploratory report 
(Wright & Abbey, 1965) has clearly differen- 
tiated between the two types of isolation $s, 
it appears to be a useful measure for pre- 
dicting whether or not an individual’s defense 
and control mechanisms will be effective in 
controlling drive-dominated responses that 
might emerge during his attempt to success- 
fully complete a prolonged period of percep- 
tual deprivation. Thus, at long last a satisfac- 
tory method of predicting isolation tolerance, 
by means of prior psychological testing pro- 
cedures, seems to have been developed. This 
is a long-awaited finding and it is hoped that 
this method will be used by investigators 1n 
other laboratories in an attempt to further 
replicate these findings and develop norma- 
tive data. Furthermore, this method also may 
Prove useful in research on other types of 
Stressful conditions in which the objective 1s 
to differentiate between Ss who may or may 
not be able to endure them, 

It is interesting that in the present group 
of 31 Ss, the Index of Control scores tended 
to be better, that is, higher, than those in the 
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original group of 21 Ss who were tested fol- 
lowing their isolation experience. It is as if 
the defense and control mechanisms were on 
the alert immediately prior to entering such 
a stressful situation, and the efforts at control 
were therefore more successful, The scores of 
the low-control group in the original study 
also may have been depressed further by the 
feeling of failure that clouded the testing 
situation. 

Finally, these findings have some implica- 
tions for those investigators concerned with 
the beneficial or therapeutic effects of isola- 
tion. For example, Brownfield (1965), Shur- 
ley (1966), and Adams, Robertson, and 
Cooper (1966) have emphasized the impor- 
tance of determining which personality char- 
acteristics are essential for a relaxed and 
pleasant adaptation to isolation. If, in fact, 
the control of emergent drive-dominated ma- 
terial is the crucial characteristic, which the 
present study suggests that it may be, such 
investigators should employ a prior Ror- 
schach testing procedure using the Holt-Havel 
scoring method and the Wright-Abbey Index 
of Control. 
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THREE FACES OF EVE REVISITED: 
A STUDY OF TRANSIENT MICROSTRABISMUS 


WILLIAM S. CONDON, WILLIAM D. OGSTON, Ax» LARRY V. PACOE 
Western Psychiatric Institute and Clinic, University of Pittsburgh Medical School 


A film of a patient with multiple personality (the original Thigpen and Cleckley’s 
patient, in a film called “Three Faces of Eve") was examined frame by frame. 
Each of the three personalities of the patient emerged in this film. A micro- 
transient strabismus was detected and described, and its occurrence in relation 
to each of the three personalities analyzed. Eve Black, the least stable personality, 
exhibited five times as many of these strabismic occurrences as the other two 
personalities. A somewhat similar strabismus has also been observed in some 
films of schizophrenic patients. This strabismus has not been observed thus far 


in films of normal behavior. 


Thigpen and Cleckley (1954) fist re- 
ported a case of multiple personality about 
which they later wrote a book entitled The 
Three Faces of Eve, which was produced in 
Hollywood, dramatizing the original case ma- 
terial. Thigpen and Cleckley themselves made 
a 30-min. sound film of the actual patient in 
her three different personalities and at stages 
of her therapy. This article is based on find- 
ings derived from this latter film,? using a 
method of film analysis described more fully 
elsewhere (Condon & Ogston, 1967). 

In actuality, four personalities were re- 
ported by the authors, since one of the three, 
the final personality, Jane, displayed two as- 
pects. The initial presenting personality was 
Eve White, a demure, shy, and well-man- 
nered woman, although sadness and tension 
could be seen behind her reserve, Eve Black, 
the first of the multiple personalities to 
emerge, was a mischievous hoyden. Jane, who 
revealed two aspects (Jane I and Jane II), 
seemed to be a mature and capable person. 
She emerged late in therapy and ultimately 
became the dominant personality (Thigpen & 
Cleckley, 1957). 


METHOD 


The overall method of film analysis was essen- 
tially ethological in nature. The film was viewed over 
and over in the search for possible regularities. The 


1 Requests for reprints should be sent to William 
S. Condon, Western Psychiatric Institute and Clinic, 
3811 O'Hara Street, Pittsburgh, Pennsylvania 15213. 

2 Obtained through the Psychological Cinema Reg- 
ister, Pennsylvania State University, University Park, 
Pennsylvania. 


transformations of expressive behavior, especially 
facial, among the three personalities Eve White, Eve 
Black, and Jane, as described by Thigpen and Cleck- 
ley (1957), were clearly evident on a frame-by-frame 
analysis of the film. This analysis also revealed 
transient asymmetries of the face, particularly a 
transient yet rapid and pronounced strabismus that 
was not observable at normal projection speeds. 

To facilitate analysis of the film, each frame was 
consecutively numbered from the beginning in order 
to identify, by means of frame numbers, the loca- 
tion and duration of the derangement of conjugate 
eye movements. The entire film was scanned by 
means of a 16-mm. time-and-motion-study projector 
in which the film transport is operated manually. 
A frame or series of frames may be repeatedly 
viewed or contrasted with another frame or series of 
frames. Thus the frame at which one eye begins to 
deviate from the axis of the other eye could be 
noted precisely. 


Strabismus 


The phenomenon that is herein referred to as 
“strabismus” occurred as follows. The strabismus 
Occurred over a very short period of time; $ of a 
second or less. As an example, the right eye might 
move to the right for three frames (3 of a second) 
while the left eye remained still The ensuing di- 
vergent state is not marked. The movements into 
and out of divergence were not detectable at normal 
projection speed. When examined frame by frame, 
three characteristic types of divergent eye movement 
emerged. 


1. One eye would move horizontally to the left or 
tight while the other remained still. The eye might 
then shortly move back to its original position Or 
remain in the divergent state. In several instances, for 
example, the eye would move right for one frame 
(x of a second) and then move back left again 
on the next frame. This gave the appearance of à 
rapid “bobble” of the eye. ^ 

2. Both eyes diverged. This occurs quite rapidly, 
often at vx of a second. For example, the right ey? 
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will move right and the left eye will move left simul- 
taneously at the same 7x of a second, and only at 
that =x of a second. 

3. While both eyes are moving in the same direc- 
tion, one eye will move markedly faster than the 
other eye. In normal behavior the eyes appear to 
move together at approximately the same speed. 
This category of divergence is less precise, however, 
than the preceding two. 

There are two aspects of the strabismus that might 
be detailed: (a) the actual divergent movements of 
the eyes and (b) the sustained states that result from 
these diverging movements. 

In a word, the eyes move divergently from each 
other and may then sustain such states of divergence 
for varying periods of time. Since the sustained 
divergent states were difficult to detect accurately, 
the present article will deal primarily with the 
number of times strabismic movements occur with 
respect to the different personalities. Thus, the actual 
divergent movement of one eye in relation to the 
other, the going into or out of divergence, is 
presented rather than sustained divergent states. The 
subject’s face is often not directed exactly toward 
the camera, also rendering a reliable assessment of 
sustained deviance difficult or impossible. 


RESULTS 


In the beginning of the film it appears as 
if the patient is shown in her various per- 
sonalities as background while the therapist 
narrates the history of the case. Since no ex- 
plicit identification of the personalities oc- 
curred in this section it was not used as a 
source of data. This section comprised 6,400 
frames, and there were only three detectable 
instances of onset of strabismus. 

During the rest of the film, 37,300 frames 
(some of which comprised titles and lasting 
253 min., 73 instances of transient strabismus 
were clearly noted. Correlation of these phe- 
nomena with the three personalities Eve 
White, Eve Black and Jane appears in 
Table 1. 

There was also a suggestion of a left and 
right distribution to the divergent movements 
of the different personalities. Left-eye move- 
ments seem to predominate for Eve White; 
for example, it is the left eye that moves di- 
vergently while the right remains still. Con- 
versely, right-eye movements tend to predomi- 
nate for Eve Black, while a right-eye bobble 
seems related to Jane I. Table 2 is an anal- 
ysis of the types and frequency of strabismic 
movement for each of the personalities. 
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TABLE 1 


DISTRIBUTION OF STRABISMUS BETWEEN 
PERSONALITIES 


Frequency of Time each person- 


Personality strabismus ality Die in 

Eve White 11 7 min. 

Eve Black 56 9 min. 

Jane I 6 4 min. 

Jane II 0 23 min. 
Total 78 


Discussion 


There are no references in the literature to 
transient strabismus as described above. The 
rapidity and extent of such phenomena raise 
the issue of whether there had been any ref- 
erence to this patient’s eyes by Thigpen and 
Cleckley (1957). In their book, The Three 
Faces of Eve, they refer to the patient’s 
suffering from “blinding” headaches and from 
some of the many visual disturbances often 
reported in migraine. Referring to Eve Black, 
they observed that “her eyes dance in such 
quick spontaneous animation,” and later that 
“her eyes dance with mischief.” 

Another famous case of multiple person- 
ality, Miss Beauchamp, a patient of Morton 
Prince (1906), was described as having head- 
aches and visual disturbances along with 
many other symptoms. Sally Beauchamp, the 
personality almost identical to that of Eve 
Black, was described as having a peculiar 
form of anaesthesia in that, with her eyes 
closed, she felt nothing. Prince likens this 


TABLE 2 
FREQUENCY OF TypEs OF STRABISMUS 


Type 


Left eye moves and right eye 
is still 

Left eye moves faster than 
right eye 

Right eye moves and left eye 
is still 

Right eye moves faster than 
left eye 

Eyes diverge 

Eyes converge 

Right-eye bobble 


Total 
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anaesthesia to hysterical monocular amblyo- 
pia in which a subject cannot see with the 
blind eye if the other is closed, yet sight re- 
turns to the affected eye as soon as the 
opposite eye is opened. More significant is the 
statement by Sally that while Miss Beau- 
champ is concentrating her mind on central 
vision, she, Sally, need not pay attention to 
that, but can concentrate her own mind on 
peripheral vision and recognize things in the 
periphery. 

Anna O., the famous case of Freud's col- 
league, Breuer (Breuer & Freud, 1953), was 
described as being split into two personalities, 
one of which the patient called “her bad 
self." At the onset of her illness she showed 
the following visual disturbances: a conver- 
gent squint with diplopia; deviation of both 
eyes to the right so that when her hand 
reached out for something, it always went to 
the left of the object. 

Such clinical observations suggest the pos- 
sibility of a more than purely semantic re- 
lationship between the dissociation of a 
Personality and the dissociation of normal 
occulomotor parallelism. A somewhat similar, 
but less marked, dissociation of normal occu. 
lomotor parallelism has also been noted in 
several films of schizophrenic patients. This is 
a very tentative finding that will require fur- 
ther study. Strabismus of this form has not 
been observed thus far in any normal be- 
havior (over 100 films analyzed frame by 
frame), 
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The literature relating to occulomotor im- 
balance indicates that divergence of one eye 
from its parallel axis with the other eye is 
varied in its nature. Both systemic and local 
conditions may affect occular neuromuscular 
mechanisms, The occulomotor disturbance 
may be intermittent, as in the condition 
known as intermittent exotropia in which the 
two eyes appear for the most part to be well 
coordinated, yet one eye may suddenly turn 
out, often through a rather large angle of 
divergence. The patient is really not aware 
that the eye has turned out nor does he ex- 
perience double vision. The mechanism where- 
by coordinated binocular vision is interrupted 
is not known. 
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The hypothesis that relaxation instructions help the recall of verbal material 
by anxious Ss but hinder the recall by nonanxious Ss was tested. High- and 
low-anxious groups were selected on the basis of their performance on the 
Anxiety scale of the MMPI. The 121 Ss were presented either 10 easy or 10 
difficult paired associates under four conditions of relaxation: (a) no relaxation, 
(b) relaxation prior to acquisition, (c) relaxation prior to delayed recall, or 
(d) relaxation prior to both acquisition and delayed recall. The Ss were tested 
immediately following acquisition and 48 hr. later. Analysis of reciprocal 
latencies supports the hypothesis. Relaxation was associated with significantly 
better performance for Ss with high anxiety and poorer performance for Ss 


with low anxiety. 


The relationships between anxiety, relaxa- 
tion, and certain aspects of learning and per- 
formance are relevant topics for investigation 
because of the continuing interest in desensi- 
tization as a therapeutic technique (Paul, 
1966; Wolpe, 1958). Current clinical research 
has been largely directed toward examining 
the assumption that relaxation is a desirable 
antidote for a variety of problems involving 
anxiety, and little attention has been paid 
to other variables associated with the use of 
relaxation. Results have generally supported 
the use of desensitization as opposed to other 
methods of therapy, for example, test-anxious 
students report significant reductions of anx- 
iety following desensitization (Emery & 
Krumboltz, 1967; Garlington & Cotler, 
1968), 

From the point of view of behavior theory, 
there is no reason to expect all degrees of 
relaxation to be equally desirable or to expect 
overt behavioral changes to be the same for 
all Ss. Where simple approach-avoidance be- 
havior is involved, any reduction in specific 
anxiety should permit closer approach to the 
feared object. Where more complex behaviors 
are involved, such as with test taking, anxiety 
may be disruptive if it leads to avoidance or 
escape responses which interfere with per- 


This research was aided by a grant from the 
Sigma Xi RESA Research fund. 

2 Requests for reprints should be sent to James H. 
Straughan, Experimental Education Unit, University 
of Washington, Seattle, Washington 98105. 


formance or if it reduces the ability to make 
appropriate discriminations. In general, per- 
sons low in anxiety might do better on tasks 
such as test taking because they are less 
likely to have developed interfering behav- 
iors. Persons who do perform adequately 
(mostly those low in anxiety) should do 
worse following relaxation because their test- 
taking behavior is associated with a certain 
level of autonomic feedback. Changing the 
internal stimulation would reduce the likeli- 
hood that adequate test-taking behavior 
would occur. Persons who are severely anxious 
during examinations should be better when 
relaxed because interfering behaviors and ex- 
cessive stimulation from autonomic feedback 
are reduced, 

Some confirmation is offered for the pre- 
ceding analysis by research showing that it 
is useful to distinguish between facilitating 
and debilitating test anxiety (Alpert & Haber, 
1960). An example of an item on the facili- 
tating scale is: “Nervousness while taking a 
test helps me do better.” An item from the 
debilitating scale is: “Nervousness while tak- 
ing an exam or test hinders me from doing 
well.” The authors interpret the results to 
mean that whether anxiety is facilitating or 
debilitating depends on the kinds of be- 
haviors associated with feedback from anxiety 
reactions. 

Recognizing a distinction between facilitat- 
ing and debilitating test anxiety would help 
account for the finding that desensitization 
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TABLE 1 


MEAN RECIPROCAL LATENCIES, SECONDS, FOR 
Correct RESPONSES ON First TEST TRIAL 


Hard task Easy task 
Group 
Norelax| Relax | Norelax| Relax 
High anxiety .197 .251 -196 819 
Low anxiety .290 .238 .878 835 


does little to improve test-taking effectiveness, 
although it does reduce anxiety. Not all anx- 
ious students will be helped by a reduction 
in their anxiety. Neither Garlington and 
Cotler (1968) nor Emery and Krumboltz 
(1967) found a significant improvement in 
grades following desensitization. Garlington 
and Cotler pointed out that many variables 
are involved besides test anxiety and give an 
example of two Ss who lost motivation for 
study following desensitization. Paul and 
Eriksen (1964) found that test anxiety was 
associated with lower scores during examina- 
tions but that reducing the anxiety did not 
raise scores, Paul and Shannon (1966) did 
find significant differences between experi- 
mental and control groups, but the differences 
seem to have been largely due to a marked 
drop in the small control group’s grade-point 
average. In each of these studies grade-point 
average or final examination scores improved 
for the desensitization groups more than for 
control groups, suggesting that there is a 
small effect from relaxation. 

A laboratory experiment by Pascal (1949) 
bears upon questions of the effects of relaxa- 
tion on test taking. Pascal found that brief 
relaxation instructions significantly improved 
the recall of Previously learned nonsense 
syllables. He used several different time in- 
tervals between learning and recall and pre- 
sented the material auditorially to the S, 
spelling the nonsense syllables. Relaxation 
immediately prior to recall significantly im- 
Proved recall but relaxation instructions 
given immediately after learning was ineffec- 
tive. His Ss were not selected for anxiety 
level. 

One shortcoming of all of these studies has 
been the lack of control of relaxation during 
acquisition. If anxiety interferes with acquisi- 
tion, the effectiveness of relaxation during re- 
call must be limited. The study to be reported 
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here throws some additional light on these 
variables by controlling relaxation during 
both acquisition and recall in a laboratory 
learning task. It is predicted that (a) Ss who 
measure low in anxiety will be hindered by 
relaxation instructions but that Ss high in 
anxiety will be helped and (5) relaxation 
will have these effects during both acquisition 
and recall. In addition, the present study in- 
cludes two degrees of task difficulty in line 
with the suggestion often made that anxiety 
is more disruptive on more difficult tasks 
(e.g, Standish & Champion, 1960). 


METHOD 


Subjects. The Ss were 121 college students who 
were assigned to one of two groups on the basis of 
their performance on the Anxiety (A) scale of the 
Minnesota Multiphasic Personality Inventory 
(MMPI). The 77 Ss who scored low on the A scale 
(4 or below) were included in the low-anxiety 
group, and the 44 Ss who scored high on the A scale 
(20 or above) were included in the high-anxiety 
group. The criteria scores for inclusion in these two 
groups were set on the basis of preliminary tests at a 
level which seemed likely to insure that approxi- 
mately equal numbers of Ss would be available for 
each of the two groups and that these Ss would 
indeed be from the extremes of the population in 
terms of anxiety, 

Apparatus and materials. A list of 10 easy paired 
associates was constructed by using common associ- 
ates in a list prepared by F. R. Fosmire (1962). The 
association value of these words had been determined 
by using approximately 430 students at the Univer- 
sity of Oregon as Ss during 1960 and 1961. The 
association value of the words in the easy list ranged 
from J755 (table-chair) to .474 (butter-bread) with 
a median of .558. A list of 10 difficult paired associ- 
ates was constructed by taking 10 other stimulus 
Words from Fosmire's list and selecting for them 
Tesponse words which met two criteria. These cri- 
teria were that (a) the word had not been given 
as a response to the stimulus word by any of Fos- 
mire’s Ss and (b) the two Es agreed that the asso- 
ciation was a difficult one. As an additional precau- 
Hon taken in order to insure that the correct word 
was not initially dominant, stimulus words for the 
hard list were chosen only if they had responses of 
high-association value associated with them by 5s 
in Fosmire’s sample, For example, in the hard list 
the paired associate girl-brick was used. The common 
Tesponse given to girl was boy with an association 
value of .548, The common responses given to the 
stimulus words in the hard list. had association values 
tanging from .723 to 426 with a median of .548. 

Procedure. The items were presented in a memory 
drum with each of the 10 items being exposed for 6 
Sec. with a 6-sec. interval between items. The SS 
were given a training trial and two test trials. On the 
training trial each paired associate was presented 
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TABLE 2 
MEAN RECIPROCAL LATENCIES, SECONDS, OF CORRECT RESPONSES ON SECOND Test TRIAL 
Hard task Easy task 
Group 
Rel R x 
No reles &cquisition Den Tos No relax n Mem juna 
High anxiety | .262 (8) | .215 (4) .252 (7) | .218 (6) | .726 (5) .806 (5) | .789 (4) | .789 (5) 
Low anxiety | .309 (10) | .190 (11) | .266 (8) | .141 (10) | .867 (11) | .785 (8) | .751 (9) | .760 (10) 


Note.—Numbers in parenthesis are the number of Ss in each group. 


to S and on the test trials only the stimulus words 
were presented. The first test trial followed the 
training trial by 6 sec. and the second test trial 
was given 48 hr. later. The E measured latencies by 
means of a stopwatch and recorded results to the 
nearest tenth of a second. 

In addition to the divisions based on anxiety level 
and task difficulty Ss were divided into four groups 
on the basis of relaxation instructions. These four 
groups were: (a) relaxation instructions given im- 
mediately before the training trial, (b) relaxation 
instructions given immediately before the second test 
trial, (c) relaxation instructions given before both 
the training trial and the second test trial, and (d) 
no relaxation instructions. The relaxation instructions 
were a modified form of those used by Pascal (1949) 
and required about 5 min. to read to S. The S was 
asked to concentrate on relaxing major muscle 
groups one at a time and to coordinate relaxation 
with exhalation. 


RESULTS AND DISCUSSION 


Dependent variables for this study are re- 
ciprocal latencies (seconds) and number of 
correct responses. Table 1 presents the mean 
reciprocal latencies of correct responses on 
the first test trial. An analysis of variance 
shows that on the first test trial (a) Ss with 
low measured anxiety reacted significantly 
faster than Ss with high measured anxiety 
(F = 7.83, df = 1/113, p< .01), (b) reac- 
tion times were faster for the easy task than 
for the hard task (F = 439.96, df = 1/113, 
P «.001), and (c) the effects of the relaxa- 
tion instructions depended on the measured 
anxiety level of the Ss; that is, relaxation 
instructions made the nonanxious Ss slower 
and the anxious Ss faster (F = 11.42, df= 
1/113, p< .01). A significant triple inter- 
action can be interpreted to mean that the 
effect of the relaxation instructions is some- 
what dependent on task difficulty; that is, 
the effects of the relaxation instructions are 
greater for the hard task than for the easy 
task (F = 3.88, df = 1/113, p < 01). 


The mean reciprocal latencies of correct re- 
sponse on the second test trial are presented 
in Table 2. An analysis of variance shows 
two significant Fs. First, the latencies for 
the groups that had the easy task are signifi- 
cantly faster than for the groups that had 
the hard task (F = 384.17, df=1/105, p< 
001). The mean reciprocal latency for the 
easy task was .789 and that for the hard task 
was .230, Second, and more important, there 
was a significant Measured Anxiety Level x 
Relaxation Instructions interaction (F= 
3.75, df = 2/105, p< .05). The absence of a 
significant triple interaction can be inter- 
preted to mean that the double interaction 
based on the groups that had the harder task 
is not significantly different from the double 
interaction based on the groups that had the 
easy task. The data for the hard and easy 
tasks were combined into an A X B table in 
order to better examine the meaning of the 
interaction Anxiety Level X Relaxation, By 
inspection it appeared that the significant 
interaction was based on the differences be- 
tween the groups that did not have relaxation 
instructions and the groups that did. That 
this was the case is indicated by the results 
of t tests of the possible subinteractions using 
the error term from the preceding analysis of 
variance. The interaction Anxiety Level x 
Relaxation corresponds to that of the first 
test trial; that is, the relaxation instructions 
slowed the low-anxiety groups and speeded 
up the high-anxiety groups." An additional 
finding indicated by the / tests is that there 
was a significant interaction between anxiety 
level and certain of the other conditions of 
relaxation: (a) between relaxation during re- 


3 The high-anxiety groups that had the hard task 


were made slower by relaxation instructions (Table 
2) but not enough so to make the triple interaction 


significant. 
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call and relaxation during both acquisition 
and recall and (5) between relaxation during 
recall and relaxation prior to acquisition. The 
meaning of both of these interactions is that 
relaxation prior to acquisition is more effec- 
tive than relaxation prior to recall in making 
the high-anxiety group faster and the low- 
anxiety group slower. 

Another possible analysis of the effects of 
the independent variables on response laten- 
cies during the second test trial involves the 
changes from Trial 1 to Trial 2. An analysis 
of the variance in these changes shows no 
significant Fs. 

When the number of correct responses is 
considered as the dependent variable, all of 
the Ss performed so well on the easy task 
that no effects of the independent variables 
can be seen. Only one S missed as many as two 
responses on either the first or second test 
trial. On the hard task, although the results 
are in the expected direction with the relaxa- 
tion instructions making the high-anxiety Ss 
better and the low-anxiety Ss worse, the re- 
sults are not significant, An indication of the 
level of difficulty of the hard list can be seen 
from the fact that the average number cor- 
rect was 4.484 for the first test trial and 3.766 
for the second test trial. 

The results of this experiment support the 
view that relaxation does make a difference 
in the immediate and delayed recall of verbal 
material. As predicted, Ss who measured low 
in anxiety were hindered by relaxation in- 
structions, but Ss who measured high in anx- 
lety were helped. These results, when taken 
together with the results of experiments on 
test anxiety already cited, offer strong sup- 
port for the view that desensitization is help- 
ful to many persons who are anxious when 
taking tests. Under the conditions of this ex- 
periment relaxation prior to acquisition was 
more effective in helping the high anxiety 
group than relaxation prior to recall Al- 
though an extension of this finding to class- 
toom material is speculative, the obvious in- 
terpretation would be that desensitization for 
test anxiety should also involve the learning 
situation prior to test taking. The person who 
becomes less anxious during tests cannot per- 
form better if other anxieties have hindered 
his learning the material in the first place. 
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Although it was not anticipated, the pres- 
ent experiment affords an opportunity to re- 
examine the hypothesis of Kleinsmith and 
Kaplan's (1960) concerning activation level 
and consolidation. They found that stimulus 
words which caused more arousal as measured 
by the galvanic skin response were conducive 
to poorer immediate recall of a paired asso- 
ciate but better delayed recall than stimulus 
words which caused less arousal. In the pres- 
ent experiment the authors have different lev- 
els of arousal associated with Ss rather than 
with stimulus material. The authors have ex- 
amined the data on average numbers of cor- 
rect responses, but find no suggestion that 
delayed recall is superior for Ss who are more 
anxious. As was already stated, reciprocal 
latencies did not change significantly from 
Trial 1 to Trial 2. 
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This study attempted to determine the relative effects of social isolation and 
confinement, of 1-wk. duration, on measures of EEG activity, intellectual 
test performance, affect, and a variety of psychological and somatic symptoms 
derived from Myers postisolation questionnaire. No significant differences in 


performance were observed on any of the 11 tests of the intellectual battery or 
on measures of subjective stress and mood. However, a significant decrease in 
occipital alpha frequency was obtained in both the socially isolated and confined 
groups after the 1-wk. period. Since there was no significant difference between 
these two groups in the magnitude of the decrease, this EEG change appears 
to have resulted from confinement alone. An analysis of the questionnaire data 
revealed significant F ratios on 16 of the 22 content areas. Ten of the 16 
symptom categories were associated with confinement alone and the remainder 
were associated either with social isolation or a combination of social isolation 
and confinement, Finally, the results of this study were compared with those 
reported in several earlier, 1-wk. experiments on perceptual deprivation, a 
complex condition involving not only social isolation and confinement but also 


a reduction in sensory stimulation. 


In the typical perceptual deprivation (PD) 
experiment, S is exposed not only to a reduc- 
tion in sensory stimulation but also, of neces- 
sity, to social isolation and physical confine- 
ment in a small windowless environment. 
Although various suggestions have been made 
in the literature that social isolation and con- 
finement may be important factors, particu- 
larly in the long-duration experiments, little 
attention has been directed at dissecting the 
total PD situation and determining the rela- 
tive role of its various components. Further- 
more, what research is available has only pro- 
vided an incomplete picture, for example, a 
comparison of the effects of sensory restriction 
and social isolation (Persky, Zuckerman, 
Basu, & Thornton, 1966; Suedfeld, Grissom, 
& Vernon, 1964; Zuckerman, Albright, Marks, 
& Miller, 1962; Zuckerman, Persky, Hopkins, 
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Murtaugh, Basu, & Schilling, 1966) or a 
comparison of the effects of sensory restriction 
and a combination of confinement plus re- 
stricted motor activity (Zubek & MacNeill, 
1967; Zubek & Schutte, 1966). In none of 
these studies, unfortunately, was an attempt 
made to determine the specific role of con- 
finement per se. 

Recently, however, Zuckerman, Persky, 
Link, and Basu (1968), in an important ex- 
tension of their earlier work, analyzed the 
relative roles of not only sensory restriction, 
social isolation, and confinement, but also of 
experimental set and type of Ss. Using an 
8-hr. duration, measures were taken of two 
hormone metabolites (17-KGS and 17-KS), 
time estimation, affect, and responses to a 
variety of questionnaires containing items 
specifically designed for conditions of isola- 
tion and confinement. Their main conclusion 
was that “the stress effects of confinement are 
rather massive and are found even when Ss 
are neither sensorily or socially isolated [p. 
194].” In view of these results, it is possible 
that confinement alone may also be an im- 
portant factor in producing some of the other 
effects reported in PD experiments, for ex- 
ample, slowing of  electroencephalogram 
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(EEG) activity and cognitive deficits (see 
Zubek, 1969b, for review). 

The two-fold purpose of this study is (a) to 
determine the relative effects of social isola- 
tion and confinement, of 1-wk. duration, on 
Measures of EEG activity, cognitive test per- 
formance, affect, and various psychological 
and somatic symptoms elicited from a 242- 
item questionnaire and (5) to compare these 
effects with those reported in earlier studies 
employing 1-wk. of PD (e.g., Zubek, Aftanas, 
Sansom, Schludermann, Wilgosh, & Winocur, 
1962; Zubek & MacNeill, 1967; Zubek & 
Schutte, 1966; Zubek & Welch, 1963). 


METHOD 
Subjects 


The Ss were 66 male university students who 
volunteered to spend 1 wk. in the University of 
Manitoba isolation chamber. None of them had 
participated in any prior isolation experiment, The 
remuneration was $100.00 for the 1-wk. experimentals 
and $30.00 for the 1-wk. controls. 


Conditions 


Confinement group (CF). This group consisted of 
21 Ss who were placed individually in an air-condi- 
tioned, dome-shaped isolation chamber, measuring 7 
ft. in height, 9 ft. in diameter, and 7.5 ft. at the 
base, for a prescribed period of 1 wk. (for details of 
chamber see Zubek, Sansom, & Prysiazniuk, 1960). 
All Ss except 1 (duration = 17 hr.) completed the 
experiment. Their mean age was 20.2 yr. 

In this condition, the inside of the isolation 
chamber was covered with brightly colored pictures, 
no restrictions were placed on motor activity (exer- 
cising, however, was not permitted), and S had 
constant access to reading material, a variety of jig- 
saw puzzles, a radio, a portable television set, a 
mattress, and a chair with a tablet arm. The lights 
were put out for 9 hr. at night. In order to avoid 
Social isolation, E visited S at least twice in the 
morning, afternoon, and evening. Furthermore, two 
visiting periods per day were scheduled for Ss 
friends and acquaintances. Finally, the two doors 
leading to the chamber were left open enabling S 
to hear, but not see, any on-going activity in the 
outside control room area. 

Socially isolated group (SI). This group consisted 
of 25 Ss who were placed individually in the isola- 
tion chamber for a 1-wk. period. Of this group, 5 Ss 
terminated the condition Prematurely, quitting after 
a mean duration of 47 hr. The mean age of the 
20 successful Ss was 20.6 Yr. The experimental 
environment was the same as for the CF group with 
the exception of three modifications designed to 
achieve a condition of social isolation. First, the 
radio was replaced with piped in music by Muzak 
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(exclusively instrumental music). Second, a view. 
master with 300 travel slides was substituted for the 
television set. Third, no visitors were allowed and 
conversation between S and E was kept to a mini- 
mum (except at the relatively infrequent test ses- 
sions). Furthermore, the two doors leading to the 
chamber were closed at all times. 

Ambulatory control group (AC). This group, con- 
sisting of 20 Ss (mean age = 20.4 yr.), came to the 
laboratory at specified intervals during a 1-wk, con- 
trol period, that is, at the times corresponding to 
the test sessions of the experimental groups. However, 
they were free to leave and do what they wanted 
in the intervening periods. 


Measures 


Intellectual battery. This battery, developed by 
Zubek et al. (1960), consists of 11 different tests of a 
primary mental ability nature, with 8 equivalent 
forms per test, measuring numerical reasoning, num- 
ber facility, abstract reasoning, verbal reasoning, 
verbal fluency, space relations, digit span, recall, 
recognition, cancellation of numbers, and dexterity. 
This battery was administered before and at daily 
intervals during the 1-wk. period. Total administra- 
tion time was approximately 50 min. 

Measures of affect. An appraisal was made of Ss’ 
affective state by two paper-and-pencil tests. The 
first was the Subjective Stress Scale (SSS) used 
extensively in experimental situations inducing vari- 
ous degrees of stress (e.g, Berkun, Bialek, Kern, & 
Yogi, 1962; Myers, 1964). The SSS is a 15-item 
Thurstone scale, with higher scores indicating more 
adverse reactions. The second test, developed by 
Myers, Murphy, Smith, and Goffard (1966) speci- 
fically for isolation and confinement experiments, is a 
measure of overall mood. It consists of a 114- 
item adjective checklist which S answers as applying 
“not at all,” “somewhat or slightly,” or “mostly or 
generally” to him during a particular time period. 
Both the SSS and the mood test were administered 
prior to the experiment and at daily intervals during 
the 1-wk. period. On the pretest, the Ss were re- 
quired to answer the items in terms of their experi- 
Men occurring during the preceding week of normal 

e. 

Myers postisolation questionnaire. This question- 
naire, devised by Myers, Murphy, and Terry (1962) 
consists of 242 items embracing a variety of typical 
Tesponses to situations of isolation and confinement. 
Although scoring keys are Provided for 23 categories 
or content areas, one category (Army assignment) 
was not employed because of its inappropriateness for 
University students, as Ss. These 242 items were so 
devised as to be equally applicable to both experi- 
mental and control conditions. The SI and CF groups 
answered the items as they applied to the week in 
the isolation chamber; the AC group answered the 
Same items as they applied to the corresponding 
Period of normal life, 

EEG activity. The EEGs were obtained from 45 
Ss (15 Ss in each group) before and after the 1-wk. 
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period. The records were taken on an Offner Type T 
machine at the same time of the day (10:00 A.M.) on 
both test sessions with precautions to eliminate any 
drowsiness. Sixteen Ediswan monopad electrodes 
were applied to the head in International Federation 
10/20 positions and two S.L.E. suction electrodes to 
the ears. Average, bipolar, and monopolar recordings 
were taken. In order to obtain a quantitative measure 
of EEG changes, the period-count technique (Engel, 
Romano, Ferris, Webb, & Stevens, 1944) was used 
to determine the mean occipital alpha frequency of 
each S before and after the completion of the 1-wk. 
period. This involved counting, by means of a 
Marshall ruler, the number of waves occurring in 
each of 200, 1-sec, samples of artifact-free occipital 
lobe tracings and then obtaining the mean frequency 
per second, In several cases, 200 artifact-free samples 
could not be obtained and therefore a somewhat 
smaller sample had to be employed. 


RESULTS 


EEG activity. The mean posttest change 
(pre-post difference) in occipital alpha fre- 
quency after 1 wk. of the SI, CF, and AC 
conditions was —.62, —.50, and +.04 cps, 
respectively. The decrease in EEG frequency 
in both of the experimental groups was statis- 
tically significant (p’s < .01), using a ¢ test 
for correlated measures. Furthermore, 14 of 
the 15 Ss in each of the two groups exhibited 
a lower EEG frequency on the postexperi- 
mental relative to the preexperimental test 
period. On the other hand, the AC group, 
showing virtually a zero change, exhibited a 
chance distribution of increases and decreases. 
Finally, an analysis of variance comparing the 
magnitude of the EEG changes shown by the 
three groups of Ss, yielded a significant F 
ratio (F = 14.29, p< .001). Subsequent f 
tests revealed that the results of both the SI 
and CF groups were significantly different 
from those of the AC group (?'s < .001). 
However, the difference between the SI and 
CF groups was not significant. 

Intellectual battery. An analysis of vari- 
ance, comparing the relative changes between 
the three groups of Ss (pre-mean of 7 daily 
tests), was performed on each of the 11 tests 
of the intellectual battery. None of the F 
ratios were significant. An examination of 
trends, however, revealed that the SI group 
performed worse than did the AC group on 
all except 2 of the 11 tests (numerical and 
verbal reasoning). No such general trend was 
shown by the CF group: on only 5 of the 11 
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tests was their performance lower than that of 
the AC group. 

Measures of affect. The results on the SSS 
and mood scale indicated that both the SI and 
CF groups experienced a greater degree of 
subjective stress and possessed a more nega- 
tive mood than did the AC group during the 
1-wk. period. However, on neither of the two 
affective measures were any of the differences 
statistically significant. 

Myers postisolation questionnaire. The 
mean scores of the three groups of Ss on the 
22 content areas of the retrospective question- 
naire are summarized in Table 1. It can be 
seen that significant F ratios were obtained on 
16 content areas. The 6 categories that were 
not affected were attitude toward Es, religious 
thoughts, feelings of hunger, tedium, regrets 
about experimental participation, and feelings 
of well-being. Application of ¢ tests to the 
remaining content areas revealed that both 
the SI and CF groups reported a significantly 
greater incidence of a variety of phenomena 
than did the AC group, for example, visual 
experiences of a hallucinatorylike nature, in- 
efficient thought processes, subjective restless- 
ness, and worry ('s < .001). They also re- 
ported a greater frequency of reminiscence 
and vivid memories, restless acts, and changes 
in body image and self-appraisal (p’s < .05). 
However, on none of these 8 content cate- 
gories was there a significant difference be- 
tween the SI and CF groups. Thus, it would 
appear that all of these subjective phenomena 
resulted solely from confinement. Two other 
phenomena, that is, novel ideas and loss of 
contact with reality, also appear to be affected 
to some degree by confinement. Both of these 
were more prevalent in the CF than in the 
AC group (f’s < .01) and in the SI than in 
the CF group (?'s < .05). 

Further analysis of the questionnaire data 
revealed a significantly greater presence of 
persistent and vivid dreams, sexual thoughts, 
and speech difficulties in the SI than in the 
AC group (7's < .01). None of the other dif- 
ferences were significant, indicating that these 
effects were related to a combination of social 
isolation and confinement. Finally, the SI 
group showed a significantly greater amount 
of temporal disorientation, feelings of hostil- 
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TABLE 1 


MEAN Scores OF Two EXPERIMENTAL 


AND ONE CONTROL Group on Eacu 


OF TWENTY-TWO QUESTIONNAIRE CONTENT AREAS 


Perceptual Social E Ambulatory 
Content areas deprivations | isolation Sentapment controls F 
i i *'halluci- 

hc ina aie in 6.61 5.80 5.00 1.00 14.78*** 
Dreams 5.39 5.60 445 245 486". 
Reminiscence and memory 3.44 3.25 2.85 1.75 6.50% 
Sexual thoughts 2.67 2.05 1.55 0.85 5.25 A 
Novel ideas 2.50 3.15 2.00 0.55 13.09** 
Speech difficulties 1.78 1.30 0.95 0.30 474° 
Self-appraisal 3.11 3.50 2.80 1.50 8.54" 
Inefficiencies of thought 14.39 14.40 12.16 6.30 164077 
Loss of contact with reality 3.83 3.80 2.35 0.65 12,07' 
Positive attitude to Es 4.61 3.30 3.45 3.30 0.24 
Religious thoughts 2.00 1.45 0.90 0.90 0.93 Hd 
Loneliness 1.94 2.60 1.30 1.15 8.92** 
Hunger 3.22 3.50 2.65 1.95 2.81 
Tedium 1.72 1.80 1.25 0.70 2.78 
"Temporal disorientation 2.78 2.05 1.15 0.85 9,0575 
Subjective restlessness 3.50 3.00 3.10 1.00 16.97* 
Restless acts 1.94 2.55 2.20 1.50 7.50** 
Hostility (anger) 2.56 3.20 2.05 1.60 6.14** 
Regret participation 247 240 1.75 0.85 2.93 
Worry 11.22 13.45 12.15 5.25 9.32*** 
Feelings of well-being 4.94 4.75 4.55 4.20 0.74 
Body-image changes 1.72 1.60 1.25 0.30 5,39** 


ity, and loneliness than did either the CF or 
AC groups (?'s < .02). However, on none of 
these three content areas was there a signifi- 
cant difference between the CF and AC 
groups, Thus, they appear to be associated 
solely with social isolation. 

Table 1 also shows, for comparative pur- 
Poses, the mean scores of 18 Ss from an earlier 
study (Zubek & MacNeill, 1967), who re- 
ceived the Myers questionnaire after comple- 
tion of a week of PD. It can be seen that the 
scores of these PD Ss are approximately the 
same as those of the SI group on most of the 
22 content areas. The only areas in which 
substantially higher scores were obtained 
were visual experiences of a hallucinatorylike 
nature, sexual and religious thoughts, speech 
difficulties, attitude to Es, and temporal dis- 
orientation, 


Discusston 


In the light of the Zuckerman et al, (1968) 
study, indicating that 8 hr. of confinement can 
produce a variety of stress effects, it is not 
surprising that a prolonged period of confine- 


ment, of 1-wk, duration, should likewise pro- 
duce significant changes on several physio- 
logical and psychological measures. Of these, 
the slowing of EEG activity is of particular 
importance since this phenomenon has been 
reported in numerous PD experiments (see 
Zubek, 1969a, for review). In the present 
Study both the SI and CF groups showed à 
significant posttest decrease in occipital alpha 
frequency (—.62 and —.50 cps, respectively). 
Furthermore, since the difference between the 
two experimental groups was not statistically 
significant, this EEG effect appears to have 
resulted from confinement alone. Although a 
t Change was observed, its magni- 
ude, 


(1963) after 1 wk. of PD. Using Engel et al.'s 


(1944) period-count technique for analysis of 
the EEG records, the posttest decrease in 


finement and half by a reduction in sensory 
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stimulation. Social isolation, apparently, is 
not a contributing factor. 

In view of the relationship claimed to exist 
between the presence of EEG slowing and 
deficits on various cognitive and perceptual 
tasks (Pollack, 1963), significant impair- 
ments on the intellectual battery might be ex- 
pected. This was not the case. Although a 
general trend toward poorer performance was 
observed in the SI group, an analysis of 
variance showed no significant F ratios on any 
of the 11 tests. Perhaps in the present study 
the EEG change was not sufficiently great to 
produce significant impairments on the intel- 
lectual battery. Nevertheless, some experi- 
mental support for these negative findings has 
been provided by Smith (1969) who, in a 
review of the literature on small groups in 
confinement, concluded that there is very 
little evidence to indicate that intellectual im- 
pairments occur during periods ranging from 
5 to 60 days. In the studies cited, the tests 
involved measures of verbal reasoning, nu- 
merical ability, learning, memory, logical 
reasoning, and perception, that is, tests simi- 
lar to the authors’ intellectual battery. 

Although neither social isolation nor con- 
finement produced any intellectual deficits, it 
is important to note that the PD condition 
can produce such an effect. Using the same 
test battery and duration (1 wk.), Zubek 
et al. (1962) reported a significant impair- 
ment on measures of number facility, numeri- 
cal reasoning, abstract reasoning, space rela- 
tions, verbal fluency, recognition, cancellation 
of numbers, and dexterity. These two sets of 
results, therefore, appear to indicate that in 
the complex PD condition the restriction of 
sensory stimulation is the critical factor in 
producing the various intellectual deficits; 
social isolation and confinement appear to 
exert a negligible effect. 

The results on the two paper-and-pencil 
measures of affect indicated that both the SI 
and CF groups possessed a more negative 
mood and experienced a greater degree of sub- 
jective stress than did the controls. However, 
none of the differences were statistically sig- 
nificant. These negative results are not sur- 
prising since similar results, on these two mea- 
sures, were obtained in an earlier Manitoba 
study employing 1 wk. of PD (Zubek & 
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Schutte, 1966). Although the results on the 
SSS were not significant, one cannot conclude 
that the social isolation and confinement con- 
ditions were not stressful. The SSS, consisting 
of only 15 items, may be too crude a measure. 
The use of other, more sensitive indicators of 
stress, may have produced a different picture. 
This hypothesis is supported by the study of 
Zuckerman et al. (1968) who, using a com- 
prehensive battery of both behavioral and bio- 
chemical (17-KGS and 17-KS) indicators of 
stress reported that the stress effects of con- 
finement were “rather massive.” Further evi- 
dence in support of the fact that the experi- 
mental conditions were stressful, at least for 
some of the Ss, is indicated by the differential 
quitting rates. Five percent of the CF group 
failed to endure the full 1-wk. period in con- 
trast to 20% of the SI group. Moreover, an 
even higher quitting rate has been reported in 
1-wk. PD experiments. In four studies con- 
ducted at the Manitoba laboratory (Zubek, 
1963, 1964; Zubek et al, 1962; Zubek & 
Schutte, 1966), the quitting rate has ranged 
from 25 to 42% (M = 33%). Thus, using an 
endurance measure, it is clear that the various 
experimental conditions become increasingly 
less tolerable as one proceeds from confine- 
ment, to social isolation, and finally to per- 
ceptual deprivation. 

An analysis of the results on the Myers 
postisolation questionnaire revealed a variety 
of unusual psychological symptoms, phenom- 
ena similar to those first reported in the 
McGill PD experiments (e.g., Bexton, Heron, 
& Scott, 1954) and subsequently confirmed 
at a number of different laboratories (see 
Zuckerman, 1964, for review). Of the 16 con- 
tent areas that showed significant F ratios, 
8 were associated solely with confinement, 
viz., experiences of a hallucinatorylike na- 
ture, inefficient thought processes, subjective 
restlessness, restless acts, worry, reminiscence 
and vivid memories, and changes in body 
image and self-appraisal. A further two symp- 
toms (novel ideas and loss of contact with 
reality) also resulted to some degree from 
confinement but their incidence was signifi- 
cantly increased by the addition of social 
isolation, Thus, it is clear that most of the 
16 symptoms were affected by confinement. 
On the remaining content areas, 3 (temporal 
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disorientation, feelings of hostility, and lone- 
liness) were associated solely with social iso- 
lation and another 3 (persistent and vivid 
dreams, sexual thoughts, and speech difficul- 
ties) were related to a combination of social 
isolation and confinement. Finally, a com- 
parison of these results with those of an ear- 
lier 1-wk. PD experiment (Zubek & Mac- 
Neill, 1967), using the same questionnaire, 
revealed that the scores of the PD group 
were approximately the same as those of the 
SI group on most of the 22 content areas. 
The only symptoms in which the PD group 
showed substantially higher scores were ex- 
periences of a hallucinatorylike nature, tem- 
poral disorientation, speech difficulties, sexual 
and religious thoughts, and attitude to Es, 
The incidence of these phenomena, therefore, 
appears to be increased to some degree by 
the imposition of a reduction in sensory 
stimulation. 

Although Zuckerman et al. (1968) also 
used the Myers questionnaire in the only 
other relevant study, their results cannot be 
directly compared with the present authors’ 
since the duration involved was only 8 hr. 
Furthermore, two Ss were placed in their 
confinement condition, a situation which, ac- 
cording to Smith (1969), sometimes leads 
to "difficult interpersonal problems." Never- 
theless, the results show considerable agree- 
ment. Even with their short duration, Zuck- 
erman et al. observed that the effects of 
confinement were quite general, showing an 
increased incidence of most of the question- 
naire-elicited symptoms. In addition, they 
also reported that the effects of Sensory re- 
striction “tend to be more specific, consisting 
of an anxiety reaction which is related to the 
cutting off of one's normal sensory ties with 
reality and the appearance of unusual percep- 
tions and ideas [p. 194]." 

In interpreting the results of this experi- 
ment, two factors should be taken into con- 
Sideration. The first is the physical dimen- 
Sion of the chamber. The Manitoba isolation 
chamber is quite small, consisting of only 
45 sq. ft. of floor Space, and in view of this 
it is possible that the reported results may 
be related, to some degree, to the use of a 
very restricted physical environment, In this 
regard, it is interesting to note that two Ss 
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who complained about being “hemmed-in” 
showed the largest posttest decrease in alpha 
frequency (—1.00 and —1.18 cps) of the CF 
group, indicating that the size of the living 
space may be an important variable, Further 
research, systematically varying the size of 
the confinement area (both physical and per- 
ceived space) might prove fruitful. Second, 
in the usual PD condition Ss are provided 
with little or no cues to the passage of time, 
This was not the case in the present experi- 
ment. In order to achieve an essentially nor- 
mal, nonboring sensory environment during 
a prolonged period of time, the lights were 
put out at night, and in the CF condition, a 
radio and television set were provided, all 
of which provided Ss with time cues. The 
presence of this information, therefore, may 
have diminished somewhat the extent of the 
Social isolation and confinement effects. De- 
spite this, their effects were still considerable. 
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ERRATUM 
In the article, *Self-Verbalization and Emotional Arousal,” by David C. Rimm 


of the Journal of 


should read “It is reasonable to suppose that once a ver 
negative affect or drive, this will motivate instru 
“One might predict that when such a un 
function as punishment suppressing the particular | x 
“Suppose, hover that the negative affect did drive an instrumental response, 


culminating in drive reduction.” 
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DELINQUENTS USING RORSCHACH CONTENT AND 


THOUGHT PRO 


ALAN H. ROBERTS? 


University of Minnesota 


AND 


CESS SCALES' 


ROBERT V. ERIKSON 
Loysville Youth Development Center, Pennsylvania 


Institutionalized delinquent males were separated by judges into a more con- 
trolled (GGs) and more impulsive (SSs) group. Ss were matched by age, IQ, 
and ethnic origin. Rorschachs were individually administered and scored on 
six content scales and two thought process scales. A single control score derived 


from these scales significantly 


differentiated the two groups. In particular, the 


SSs produced more content similar to that hypothesized for manic patients while 
the GGs demonstrated more thinking which involved exaggerated attempts at 


control, 


Previous studies by Erikson and Roberts 
(1966) and Roberts and Erikson (1968) 
have demonstrated that two groups of insti- 
tutionalized delinquent boys can be reliably 
differentiated with respect to foresight and 
planning ability, impulse control, willingness 
or ability to delay gratification, and ratings 
of adjustment in a training school situation. 
The goal of predicting these variables in de- 
linquents is an important one; when predic- 
tion is possible, there is a rational basis for 
planning and counseling. 

The purpose of the present study is to de- 
termine whether impulsive and controlled de- 
linquents can be differentiated by the Ror- 
Schach content and thought process scales of 
Bower, Testin, and Roberts (1960). An anal- 
ysis of their intercorrelational and factor- 
analytic data suggested that eight of their 
Scales could be combined into two sets of 
four scales each, one set associated with “im- 
pulsiveness” (their Catatonic, Manic, and 
Character Disorder Content scales and the 
Disorganized Thought Process scale) and an- 
other set associated with “control” (their 
Paranoid, Depressive, and Obsessive-Compul- 


1 This study was supported in part by the Voca- 
tional Rehabilitation Administration (Training Grant 
RT-2). The authors thank the staff of the New 
Mexico Boys' School for their cooperation in this 
study and Peter Briggs, Paul E. Meehl, William 
Schofield, and Auke Tellegen for critically reading an 
earlier version of this paper. 

? Requests for reprints should be sent to Alan H. 
Roberts, Department of Physical Medicine and 
Rehabilitation, Mayo Memorial Building, University 
of Minnesota, Minneapolis, Minnesota 55455. 
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sive Content scales and the Arbitrary Tight- 
ness Thought Process scale). It is hypothe- 
sized that the more impulsive of the two 
delinquent groups (hereafter termed SSs) will 
obtain higher scores on the Catatonic, Manic, 
and Character Disorder Content scales and 
the Disorganized Thought Process scale 
while the more controlled group (hereafter 
termed GGs) will obtain higher scores on the 
Paranoid, Depressive, and Obsessive-Compul- 
sive Content scales and the Arbitrary Tight- 
ness Thought Process scale. On the basis of 
this hypothesis, a single control score was de- 
rived as described below. 


METHOD 


A group of 18 SSs and 29 GGs were selected by 
the staff of the school and from these two groups, 
10 pairs were matched by age, IQ as measured by the 
California Short Form Test of Mental Maturity, and 
ethnic origin. There were 8 Spanish-American pairs 
and 2 Anglo-American pairs. The mean age for the 
SSs was 16.5 (SD = 137) and the mean IQ was 88.2 
(SD = 12.51). For the GGs, the means were 16.6 
(SD — 1.19) and 88.5 (SD = 9,65), respectively. The 
two groups did not differ signifücantly with respect 
to length of stay at the training school, number of 
previous commitments, or grade in school. 


* The designations, GGs and SSs, are derived from 
the vernacular of the training school where staff 
and students refer to the well-behaved students as 
“good guys" and the poorly behaved students as 
"shit sticks.” The boys referred to as GGs are more 
often described by their dormitory supervisors with 
terms like causes no trouble, quiet, obeys the rules 
and regulations, etc, while the boys referred to as 
SSs are more often described with terms like fighting, 
mouthing off, general nuisance, etc. 
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TABLE 1 


INTERSCORER RELIABILITY AND Comparison OF SSs AND GGs ON CONTROL SCORE, 
CONTENT SCALES, AND THOUGHT Process SCALES 


iE SSs GGs hw 
Measure t Reliability 
M SD M SD 

Control score 11.7 5.80 18.1 6.39 2.31* — 
Control variables 

Paranoid 24 2.95 3.0 3.30 0.97 69 

Depressive 3.2 2.82 5.1 4.33 1.01 88 

Obsessive-Compulsive 0.8 0.92 0.6 0.97 —0.51 64 

Arbitrary Tightening 23 2.31 5.7 4.00 3.64** 71 
finpuss variables 

atatonic 1.8 2.78 0.6 1.07 1.31 84 

Manic 2.6 2.22 1.2 1.93 2.41* 81 

Character Disorder 24 2.37 1.6 147 1.06 79 

Disorganization 15 2.72 18 3.05 —0.20 58 


Note,—All scores except control score are in raw score form. N = 10in each group; SS = impulsive group; GG = controlled 


group. 
* 5 <.05, one-tailed. 
9* > <.01, one-tailed. 


Rorschachs were administered individually and the 
two groups did not significantly differ with respect to 
the total number of Rorschach responses given. For 
the SSs the mean was 12.1 (SD — 4.65) and for the 
GGs the mean was 11.5 (SD — 5.25). The instructions 
of Bower, Testin, and Roberts (1960) were used in 
scoring content and thought process scales. These 
scales were scored independently by two judges and 
their interscorer reliabilities are shown in Table 1. 

All scale scores were transformed to standard 
scores combining the two groups for this purpose. 
The standard scores of scales hypothesized to be 
associated with control were summed without 
weighting them, as were the scores hypothesized to be 
associated with impulsiveness. The impulsiveness 
total was then subtracted from the control total and 
a constant was added to each score to eliminate 
negative numbers, This final figure represented a 
control score with the higher values associated with 
greater control. 


RESULTS AND DISCUSSION 


'The mean control scores for SSs and GGs 
are shown in Table 1. The mean control score 
ior GGs is significantly higher than for SSs 
Supporting the overall hypothesis. 

The raw score means for the various indi- 
vidual scales are also compared in Table 1 
in order to get a better idea of which scales 
contribute most to this finding. All means 
are in the predicted direction except the 
Obsessive-Compulsive and Disorganization 


scales. Of the control variables, the Arbitrary 
Tightening scale significantly differentiates 
the two groups while among the impulse vari- 
ables, the Manic Content scale significantly 
differentiates the two groups. 

The predictive efficiency of the control 
score, using an ad hoc "best cut," was ex- 
amined. Maximum separation is achieved 
with a cutting score of 16. Using this score, 
8095 of SSs achieve a score of 16 or less 
while 70% of GGs achieve a score of 17 or 
more. Twenty percent of the SSs and 3096 
of the GGs are misclassified with this cut. 
While some information concerning the base 
rates of SSs and GGs is available (Roberts 
& Erikson, 1968), the application of Bayes' 
rule for calculating inverse probability 
(Meehl, 1956) may be questionable because 
of the use of matched pairs. While such an 
application would be "favorable" in this in- 
stance (ie. assertions based on the score 
would be more likely to be correct than in- 
correct), .these findings must be cross-vali- 
dated because of the ad hoc determination of 
the cutting score. 

The results of this study demonstrate that 
impulsive and controlled delinquent Ss can 
be differentiated by content and thought proc- 
ess scoring of the Rorschach. In particular, 
the more impulsive group appears to produce 
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content similar to that hypothesized for  Exrksow, R. V., & Roserts, A. H. A comparison of — 


s A H two groups of institutionalized delinquents on 
manic patients while the m controlled Porteus Maze Test performance. Journal of Con- 
group tends to demonstrate thinking which sulting Psychology, 1966, 30, 567. 
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involves exaggerated attempts at control. Meerut, P. E. Wanted—a good cookbook. American 
Psychologist, 1956, 11, 263-272. 
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REALITY CLOSENESS, PHANTASY, AND AUTOKINESIS: 
A DIMENSION OF COGNITIVE STYLE ' 


MARTIN MAYMAN? 
University of Michigan 


AND 


HAROLD M. VOTH 


The Menninger Foundation, Topeka, Kansas 


This study examines the validity and predictive power of an explanatory hy- 
pothesis for the occurrence of the autokinetic phenomenon. It assumes that in 
the autokinetic situation, as in a wide range of life situations, some people 
characteristically maintain a “reality-close” life style, while others character- 
istically allow attention to drift toward more subjective endopsychic experi- 
ences. The relationship between reality closeness, phantasy proneness, and 
autokinesis is examined in a pilot study and two replication studies. The 
relevance of these findings to the psychoanalytic theory of ego functioning 


is discussed. 


Almost anyone who looks steadily at a fixed 
pinpoint source of light in otherwise absolute 
darkness is likely soon to see the stationary 
light begin to “move.” The earliest explana- 
tions assumed that unconscious eye move- 
ments caused retinal shifts which would ac- 
count for the kinds of light movements re- 
ported by Ss, but this belief was easily dis- 
counted by a variety of experiments (Royce, 
Carran, Aftanas, Lehman, & Blumenthal, 
1966). Ocular-retinal explanations were soon 
replaced by speculations that the illusion can 
be explained in cortical neurophysiological 
terms, but all such explanations have lacked 
substance and failed to generate testable 
hypotheses (Reinwald, 1952). 

Contrasting with such simple neurophysio- 
logical explanations of the illusion is a grow- 
ing body of evidence that the illusion is di- 
rectly responsive to a wide range of experi- 
mental conditions affecting the psychological 
or mental state of the person in the autokinetic 
test situation (Santos, Farrow, & Haines, 
1965; Schwartz & Shagass, 1960; Voth, 1941; 
Voth & Mayman, 1963). Coincident with these 
studies has been the emergence of equally 
impressive evidence that the autokinetic il- 
lusion is directly related to central personality 


l'This research was supported by Public Health 
Service Research Grant No. MH 07547-03, from the 
National Institute of Mental Health. The authors 
are much indebted to Phyllis Levy and Marilyn 
Klein Silverman for their indispensable help in 
carrying out the study. 

2 Requests for reprints should be sent to Martin 
Mayman, Psychological Clinic, 1027 East Huron 
Street, Ann Arbor, Michigan. 


variables. Test-retest studies have shown high 
repeat reliability in the amount of movement 
seen (summarized in Voth & Mayman, 1963). 
Other investigators have shown that auto- 
kinesis covaries with personality style (Bush, 
Hatcher, & Mayman, 1969; Temerlin, 1956; 
Voth & Mayman, 1963), with clinical behav- 
ior (Voth & Mayman, 1966, 1967, 1968), 
and with choice of illness (A. C. Voth, 1947; 
H. M. Voth, 1962). 

The present study was undertaken in an 
attempt to further clarify the relationship be- 
tween autokinesis and personality style by 
turning attention to psychological processes 
which might account for individual differences 
in performance on the autokinetic test. 

In previous studies an intervening variable 
was formulated to help explain the link be- 
tween such seemingly disparate phenomena as 
autokinesis and character style. It was sug- 
gested that persons habitually differ with re- 
spect to their “closeness to" or “distance 
from” their concrete stimulus field. The more 
“reality close” the person, the less autokinesis 
will he experience. On the other hand, the 
more distance a person characteristically 
maintains from people and things about him, 
that is, the more “reality distant” the person, 
the more autokinetic movement does he see. 

In this study, the authors attempted to 
account for this relationship between auto- 
kinesis and life style by showing that a 
significant variable in autokinetic movement 
perception is S’s unintentional retention of an 
image of the darkened room during the auto- 


3 See especially Voth and Mayman, 1963 and 1966. 
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kinetic test. The authors tried to show that 
reality-close people * see little or no autokinetic 
movement because they retain a firm and last- 
ing impression of the autokinetic room in the 
dark and retain, thereby, an orienting frame 
of reference that prevents the occurrence of 
the illusion. Conversely, the more reality dis- 
tant the person, the quicker the experimental 
room fades out of awareness as a subjectively 
real presence and, consequently, the quicker, 
more compelling, and more extensive will be 
his experience of autokinesis. 

The authors also attempted to confirm the 
corollary hypothesis that those persons who 
are characteristically outwardly rather than 
inwardly oriented, and whose attention re- 
mains outwardly directed even during a fairly 
extended period of social and perceptual iso- 
lation, are less likely to experience imagery 
and phantasy in the autokinetic situation. On 
the other hand, the more reality distant a 
person, the more vivid, rich, and compelling 
will be the coincident imagery and phantasy 
which intrude and merge with his autokinetic 
experience. 

In short, this study tested two predictions: 
(a) Autokinesis will be positively correlated 
with the extent to which the experimental 
room fades out of awareness. and (b) The 
intrusion of imagery and phantasy into the 
autokinetic experience will be directly related 
to reality closeness, reality distance as mea- 
sured (a) by the autokinetic score, and (5) 
by the degree to which the room fades out of 
awareness in the course of the experiment. 


METHOD 


The S was seated at a table 9 ft. away from a 
small aluminum box mounted on a tripod stand 
which was clearly stationary. The aperture for the 
light source was .0280 in. in diameter, A $-in. photo- 
graphic filter inside the box produced diffused light, 
-0006 ftc. in intensity. Before S on a table was a 
drawing board on which was fastened a sheet of 
newsprint paper 22 X 28 in. The S was told he would 
be left alone in a totally darkened room except for 
the pinpoint of light which he was to look at steadily 
for 10 min. He was told that the light may or may 
not appear to move. Should it move, he was to 
trace its path with a pencil. If the light at any time 


2 *In previous papers the authors used the phrase 
"ego close" to designate the ego stance which is 
referred to here as "reality close." The term ego 
close proved misleading to many readers and was 
dropped from this study. 
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stopped moving, he was to make a heavy dot but 
leave his pencil on the paper ready to resume tracing 
without delay if the light started moving again, 
Should the pencil reach the edge of the board, he 
was to start again from the approximate center of 
the sheet and proceed as before. The room was not 
darkened when these instructions were given, and 
discerning Ss could easily tell that the light was 
actually a stationary one. No suggestion was given 
that the light would move, Ss were merely told it 
“may or may not appear to move." The score used 
in this study for representing autokinesis was the 
logarithmic conversion of the length of the line drawn 
by S to record the apparent motion of the light. 

No pretest period of dark adaptation was provided. 
The 10-min, test session gave all Ss ample oppor- 
tunity to adapt to the dark. Those who were most 
prone to see autokinetic movement did not need any 
time at all to accommodate themselves to these 
stimulus conditions; they saw autokinetic movement 
almost from the first moment the room lights were 
turned off. Those not inclined to see autokinetic 
movement saw little or none even after full dark 
adaptation had been achieved. 

In a pilot study and two replication studies, in- 
patient, outpatient, and normal volunteers were given 
the autokinetic test and then interviewed for their 
subjective experiences in the autokinetic situation. An 
interview guide was prepared to help Ss describe 
Observations bearing on the two predictions which 
were to be tested. To learn whether, and to what 
extent, the objective, three-dimensional space of the 
experimental room faded out, each S was asked a 
series of progressively more pointed questions, such 
as the following: 


(a) Can you remember your impressions of the 
room as you were watching the light? Did the 
room seem any different to you in the dark than 
it was when you first walked in? (What were 
your impressions as to its size? Did it seem as 
if you were in some different kind of room? If 
another kind of room, what kind?) (b) To what 
extent did you remain aware of some of the things 
in the room? Where you were sitting in the room? 
Where the light was located in the room? (¢) 
When (If) the room seemed so different to you, 
to what extent did it continue to seem to be à 
room at all? Did it still feel as if you were in- 
doors? 


To assess the degree to which images, personalized 
Dhantasies, and preoccupations were projected onto 
the autokinetic light, each S was asked such questions 
as the following: 


(a) What were your impressions of the light? 
(b) Did the light make you think of anything? 
Did it ever seem like something else, something 
Other than a point of light? (c) How much ur 
as was it? Was this a passing impression, 
or did it continue to seem that way throughout 
the ten minutes? Did the light ever resemble 
anything else? (d) Did you have any other 
thoughts about the moving light? Any speculations 
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about what the course of movement might tell 
about you? (e) When it seemed like you were 
watching a , where did you seem to be 
watching it from? 


The answers to each set of questions were tape- 
recorded, transcribed, and excerpted. Excerpts de- 
scribing reality closeness were pooled for each S and 
separated from those which described phantasy em- 
bellishments of the autokinetic experience. Five-point 
equal interval scales were used and each S was 
given independent ratings for reality closeness and 
phantasy embellishment. 

Ss who scored 1 on reality closeness typically re- 
ported continued awareness of the actual size and 
shape of the room throughout the 10 min. of dark- 
ness. In fact, they were puzzled by questions about 
how the room may have “seemed to change.” They 
remained more or less conscious of objects in the 
room, of incidental sounds which easily caught their 
attention, etc. A 5 rating for reality closeness was 
made when an S reported the feeling that he lost 
all sense of the room he was in during the 10 min. 
of darkness and that he felt at times as if he were 
outdoors on a dark night, or out in unbounded space. 
Implicit in this rating is a distinction between 
“reality testing” and the “sense of reality.” A high 
rating for reality closeness concerns itself only with 
changes in the sense of reality; that is, Ss described 
how the room felt to them in contrast to what they 
knew the room to be like in actuality. 

A 5 score for phantasy embellishment was given 
whenever the light seemed to take on some special 
meaning or when a phantasy replaced the more ob- 
jectively neutral light experience, for example, “ike 
watching headlights of an automobile which is driv- 
ing along a distant mountain road.” A 1 score on 
phantasy embellishment was given for reports in 
which the light was experienced as being nothing 
more than a pinpoint light. 

Examples of excerpts rated 1 through 5 on reality 
closeness and phantasy embellishments are available 
on request. 

In the pilot study, 48 inpatient and outpatient 
volunteers were tested by the first author, who then 
excerpted those statements about reality closeness 
and phantasy for independent analysis by three 
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raters. In the first replication study, 31 patient and 
normal volunteers were tested and interviewed by a 
third-year graduate student, and their protocols ex- 
cerpted by this student for “blind” rating by the 
authors. In the second replication study, a still 
more exacting set of experimental conditions was 
employed. To determine to what extent the relation- 
ship the authors were demonstrating was a function 
of the skill and subtlety of the interviewer, another 
group of volunteer Ss was tested, but this time by a 
person with no formal training in psychology. She 
was an alert, intelligent woman who had been the 
project secretary for some years. She did the inter- 
viewing of 60 Ss and excerpted the relevant passages 
for the two senior raters. 

All interviewers took precautions to avoid seeing 
their Ss’ autokinetic tracings to prevent unconsciously 
influencing Ss’ remarks in the interview which imme- 
diately followed the autokinetic test, and, of course, 
to avoid influencing their ratings for reality closeness 
and phantasy embellishment. Interrater reliability in 
scoring the two independent variables was satisfac- 
tory. The median reliability coefficient for phantasy 
was .78 for all raters and .85 for the two senior 
raters, The median reliability coefficient for reality 
closeness was .68 for all raters and .77 for the 
senior raters. 


RESULTS 


First Prediction: The Relationship of 
Autokinesis to Reality Closeness 


The pilot study and both replication studies 
showed that it is, in fact, true that the more 
oblivious one becomes to his concrete sur- 
roundings in the autokinetic test situation, 
the more autokinetic movement he is likely 
to see (Table 1). In the pilot group of 48 Ss, 
the correlations between autokinesis and 
reality closeness ranged from .53 when rated 
by the least experienced student rater to .77 
when rated by one of the authors. All cor- 
relations were significant at far better than 
the p < .001 level of confidence. 


TABLE 1 
RELATIONSHIP BETWEEN AUTOKINESIS AND REALITY CLOSENESS 


All cases combined 
Pilot stud First replication Second replication x 
(N 2a) (N = 31) (N 2160) (N = 139) 
Rater Rater Pearson r Rater Pearson r Rater Pearson r 
i ** T 
First author (MM) Wm Uh Len 


Second author (HV) 
Student rater PL* 
Student rater MK* 


aN = 45, 
» Rerated by MM a year later 
Correction was .79. 


p <.01, 
** > «001. 


to determine repeat reliability of reality-closeness ratings. Re] 


peat reliability without Yates 
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TABLE 2 
RELATIONSHIP BETWEEN PHANTASY AND REALITY CLOSENESS 


d replicati All Ss combined 
P ir Ppa coucaton ed. rope ion en ME 
Rater Pearson r Rater Pearson y Rater Pearson r Rater Pearson r 
First author (MM) —.38* MM; 25,81*9* MM 09 MM — Ape 
Second Repas (HV) —.32* MM." pp HV e i ad HV — 37% 
HV ume 


^ Rerated by MM a year later to determine repeat reliability of the phantasy ratings. Repeat reliability without Yates cor- 


rection was .88. 


In the first replication, the 31 Ss who were 
tested and interviewed by a relatively inex- 
perienced graduate student yielded correla- 
tions between autokinesis and reality closeness 
which ranged .50-.59 (p < .001). It is note- 
worthy that the highest correlations did not 
come from the rater who had done the inter- 
viewing and excerpting but were obtained by 
the two senior raters who had had no contact 
with Ss or with any potentially biasing infor- 
mation in the interviews. 

In the second replication study (N — 60) 
in which the data were gathered and collated 
by a relatively unsophisticated, unskilled in- 
terviewer, the correlations were more modest 
(41 and .46) but still significant at better 
than the p< 001 level. The hypothesized 
relationship of autokinetic movement to de- 
gree of reality closeness was strong enough to 
overcome measurement errors attributable to 
interviewer inexperience, 

The relationship between autokinesis and 
reality closeness for the entire group of 
139 Ss combined yielded correlation co- 
efficients in the mid-50s for the two raters 
who had rated all 139 Ss (.53 for one and 
.57 for the other). 

The prediction that autokinesis would be 
highly correlated with reality closeness in the 
autokinetic situation was amply supported by 
these findings, 


Second Prediction: The Relationship of 
Phantasy to Reality Closeness and A utokinesis 


The prediction that phantasy emergence 
will be positively correlated with autokinesis 
and negatively correlated with reality close- 
ness was similarly confirmed in the pilot 
study and each of the two replication studies, 


Phantasy emergence in the autokinetic ex- 
perience was closely related to loss of reality 
anchorage in the test situation (Table 2). 
The low correlations of the pilot study (—.32 
and —.38, p< 05) were attributable to oc- 
casional insufficiency of interview data about 
phantasy in the autokinetic experience. The 
relationship between the two variables became 
more pronounced when this deficiency was 
corrected in the replication studies. 

In the first replication, the correlation be- 
tween phantasy and reality closeness ranged 
from —.55 to —.81 (Table 2). All relation- 
ships were in the predicted directions and 
were significant at, or close to, the p < .001 
level. In the second replication (V = 57), the 
correlations ranged from —.41 to —.70. The 
correlations for all Ss combined (N = 132), 
using the more modest MM; (first author) 
data of the first replication study,’ were —47 
and —.37, both highly significant. i 

The relationship of phantasy to autokinesis 
is shown in Table 3. The more autokinetic 
movement seen, the more elaborate, vivid, and 
compelling was the phantasy evoked in the 
autokinetic situation. 

In the pilot study, (N = 44), the correla- 
tions between autokinesis and phantasy were 
44 and .39 when rated by the two senior 
raters, significant at the p< .01 level. Cor- 
relations were not statistically significant for 
the less experienced raters due to a systematic 
error in their understanding of the criteria 
Which was corrected in the replication studies. 


SMM; ratings were done by the first author 
Shortly after the first replication interviews were 
Completed and excerpted by MK, one of the er 
raters, The MM, ratings were done by the firs! 
author a year later to measure repeat reliability. 
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TABLE 3 
RELATIONSHIP BETWEEN ÁUTOKINESIS AND PHANTASY 


Pilot study First replicatie 
APR x PU. E Second, renin All ee conne 
Rater Pearson r Rater Pearson r Rater Pearson r Rater Pearson r 
First author (MM) Aa MMi .56** MM Apte MM 219e 
Second author (HV) .39* MM: .61** HV 45** HV .48** 
Student rater PL^ 23 HV .64** 
Student rater MK^ 25 MK :52* 
aN = 42, 
* 5 «0l. 
** p «001. 


In the first replication study (W = 31), cor- 
relations range .52—.64, significant at better 
than the p< .001 confidence level. Correla- 
tions in the second replication study (WV = 57) 
were .49 and .45, both at the same high order 
of significance. The correlations for all 132 Ss 
combined were .51 and .48, significant at far 
better than the p < .001 level of confidence. 

It should be noted that seven interviews, 
four in the pilot study and three in the second 
replication, failed to include questions about 
phantasy embellishment of the autokinetic ex- 
perience. Most of these Ss probably would 
have scored quite low on phantasy judging 
by the rest of their accounts of the autokinetic 
experience. An inspection of the data in these 
cases strongly suggests that, had these few 
interviews been conducted more carefully, the 
correlations between phantasy and autokinesis 
would probably have been even higher than 
those reported here. 

An analysis of these data to see if any of 
the results reported in the preceding para- 
graphs were attributable to sex differences or 
to differences in degree of psychopathology 
was largely negative. The authors found that 
all of the relationships reported occurred to 
a statistically significant extent in men as 
well as women, but were more pronounced (or 
at least more easily demonstrated) in women. 
This may reflect the fact that men sometimes 
seem more constrained in the autokinetic 
situation, less inclined to accept, enjoy, in- 
dulge themselves in, or perhaps admit to such 
relaxed, essentially passive states of conscious- 
ness, 

With respect to the saliency of the pre- 
dicted relationships in normals as compared 
With patients, no consistent differences were 


found. The relationships were as pronounced 
in patients as in normal Ss, despite the obvi- 
ous uneasiness which some patients showed 
when asked to describe the kinds of subjective 
experiences with which the study concerned 
itself. 


DISCUSSION 


This study was designed with two purposes 
in mind. One was to determine why the auto- 
kinetic test is so effective a measure of reality 
closeness. The other was to add to the under- 
standing of the autokinetic phenomenon. 

In previous studies, the autokinetic score 
proved to be closely related to a continuum 
of character structures and life styles. Normal 
Ss with low-autokinetic scores were typically 
characterized as outgoing, responsive, other- 
directed people with perhaps a certain histri- 
onic flair and vivid “presence.” -auto- 
kinetic people who were emotionally disturbed 
were typically described as emotionally labile, 
“alloplastic,” externalizing, and prone to rely 
more heavily on the defenses of repression, 
denial, and acting out. In contrast, people 
with high-autokinetic scores were more likely 
to be characterized as detached, intellectual- 
izing, deliberate, and inclined to isolate them- 
selves off from strong feelings and impulses. 
The more normal high-movement Ss were self- 
conscious, thoughtful, quiet but stubbornly 
independent people. The more disturbed high- 
movement Ss indulged heavily in obsessive 
rumination and schizoid phantasy. In short, 
the lower his autokinetic score, the more 
likely it is that a person lives his life in a 
close, vivid, direct, interactional involvement 
with the people and things around him. The 
higher his autokinetic score, the more likely 
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it is that a person experiences himself and his 
world with more detachment and with more 
subjectivity. 

The authors suggested that the common 
denominator in the life styles, experience 
styles, and defensive styles of the low auto- 
kinetic Ss was their “closeness” to reality, 
and the common factor in high-autokinetic 
people at the other end of the continuum was 
their greater distance or detachment from the 
compelling “pull” of their surroundings. 

What remained unexplained in these studies 
was why the autokinetic test provided so 
accurate an index of a person’s characteristic 
position on the reality-closeness-reality-dis- 
tance continuum. The present study helps fill 
this gap by showing that the reality closeness 
by which the authors previously characterized 
the adaptive styles of low-autokinetic Ss is 
demonstrable in a very concrete, almost literal 
sense in the autokinetic situation. Low-auto- 
kinetic Ss remain more focally aware of the 
concrete realities of the experimental room 
and are much less inclined to drift into phan- 
tasy or reverie than are high-autokinetic Ss. 
The amount of autokinetic movement seen by 
Ss in the present study is directly related to 
the degree to which they retained a reality- 
orienting image of the experimental room 
throughout most or all of the 10 min. of 
darkness. This conclusion is arrived at only 
inferentially here; it is given direct experi- 
mental support in another study." The con- 
clusion draws on, and lends further support 
to, frame-of-reference explanations of auto- 
kinesis reviewed by Royce et al. (1966) and 
by Reinwald (1952). 

The aforementioned formulations draw 
heavily on psychoanalytic ego psychology in 
defining qualities of reality contact, forms of 
engagement and interaction between “ego” 
and “reality.” In Rapaport’s (1958) terms, 
those people who are characteristically more 
autonomous from reality need more “stimulus 
nutriment” to keep them in touch with reality. 
They readily slip into subjective preoccupa- 
tions, phantasies, or reveries when external 
conditions are conducive to such a turn of 
mind. On the other hand, those persons who 
live their lives in close touch with objects and 


5 M. Mayman, H. M. Voth, and L. Coyne, manu- 
script in preparation, 
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events might be said to live with an excess 
of stimulus nutriment. Much of their aware- 
ness of inner psychological events is drowned 
out by the clamor of external experiences, 
For them, external reality is so compelling 
that they find it difficult to detach them- 
selves from external objects and turn attention 
to subjective experiences. 

There are two ways of viewing such indi- 
vidual differences in cognitive style. One 
might argue that Ss who cling to reality and 
prevent the emergence of phantasy do so 
defensively, that is, turn to reality in flight 
from endopsychic experiences. Similarly, one 
can hold that those who drift off have learned 
to escape from the exigencies of reality by 
withdrawing defensively into themselves. 
However, there is reason to maintain that, in 
part, perhaps in large part, individual dif- 
ferences in reality closeness-reality distance 
are not simply a product of developmentally 
acquired adaptive or defensive ego styles. 
They may derive ultimately from constitu- 
tionally based ego dispositions (Hartmann, 
1939; Rapaport, 1951, 1958). There are some 
telling data which point in this direction and 
which urgently call for confirmation. Eysenck 
and Prell’s (1951) study of autokinesis in 
identical and fraternal twins found that iden- 
tical twins correlated .73 and fraternal twins 
only .23 in the extent of autokinetic move- 
ment seen. Pending confirmation of these 
findings by replication studies, Eysenck and 
Prell’s study supports the hypothesis that 
reality closeness-reality distance may well be 
a constitutionally based ego disposition. 

These formulations about reality closeness 
and reality distance should not be confused 
with Witkin’s concepts of field dependence- 
field independence, despite their apparent 
similarities. On Witkin’s continuum, field de- 
pendence is a less desirable state than field 
independence. It indicates a degree of devel- 
opmental immaturity, of lesser “psychological 
differentiation,” less stable ego identity struc 
tures, etc. Reality closeness in the authors 
studies carries no such pejorative implications. 
Low-autokinetic Ss are not necessarily lacking 
in psychological differentiation. On the con- 
trary, many of the reality-close Ss are highly 
gifted, intelligent, competent people, with, if 
anything, a more emphatic sense of identity 
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than was found in many reality-distant Ss. 
The authors predicted, in fact, an orthogonal 
relationship between field dependence and re- 
ality closeness, a prediction which was tested 
and confirmed recently by Cancro and Voth 
(1969). One hundred and five paid, normal 
volunteers were administered Witkin’s rod- 
and-frame and Embedded Figures Tests to 
determine if these measures correlated with 
Ss’ scores on the autokinetic test. The Es 
could find no demonstrable relations, linear or 
curvilinear, between autokinesis and Witkin’s 
measures of field dependence-field indepen- 
dence. This result cannot be dismissed as a 
measurement artifact, because the rod-and- 
frame scores correlated highly with the em- 
bedded-figures scores, much as they have done 
in all of Witkin’s work with these tests. It 
seems clear that, operationally as well as 
conceptually, the Mayman-Voth explanatory 
constructs and Witkin’s are unrelated psycho- 
logical variables. 
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SELF-CONCEPT THERAPY FOR ADOLESCENT FEMALES? 
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Fourteen adolescent females referred for delinquent and acting-out behaviors 
were compared pre- and posttreatment on measures of self-concept, ideal self- 
concept, and anxiety, These Ss were exposed to a 10-wk. socialization program 
designed to produce positive changes in self-concept; however, 9 Ss were in 
a group run by community volunteers while 5 Ss were in a group run by 
mental health professionals, Self-concept changes in a nonreferred comparison 


There is considerable Support for the con- 
tention that adolescents who demonstrate dif- 
ficulty in school coupled with “delinquent” 
nonschool behavior (sexual acting out, steal- 
ing, etc.) are relatively easily described 
through the use of a “self-concept” theoretical 
framework. Many investigators have found 
that delinquent or “behaviorally disturbed” 
adolescents have a self-concept quite different 
from that of nondelinquent children (e.g., 
Curtis, 1963; Reckless, Dinitz, & Day, 1957). 
Implicit in such studies is the Suggestion that 
treatment for behaviorally deviant adolescents 
at self-concept alteration ; however, there has 
been no clear explication of a therapeutic 
mechanism for producing such change other 
than the traditional approach through long- 
term individual therapy. This study reports 
the results of a relatively restricted type of 
Broup treatment program aimed Specifically at 
producing self-concept changes in female ado- 
lescents with a record of behavior problems. 


METHOD 
Subjects 


Of an original pool of 38 adolescent females, 22 Ss 
were compared pre- and posttreatment on measures 


1 This investigation was supported, in part, 
Public Health Service Research Grant R 11-MH- 


the self-concept program, including brief case studies, 
may be obtained by writing the Executive Director, 
Southeast Wyoming Mental Health Center, Cheyenne, 
Wyoming. Write for Technical Report VII, “Com- 
munity Resources in Treatment.” 

2 Requests for reprints should be sent to Charles 
W. Cole, Department of Psychology, Colorado State 
University, Fort Collins, Colorado 80521. 


of self-concept and anxiety. These Ss ranged in age 
13-17 and were located in three comparison groups: ^ 


5 Ss (Group P) were in a group therapy experience 
run by professional mental health workers from ‘the 
Southeast Wyoming Mental Health Center, Cheyenne, 
Wyoming; 9 Ss (Group Q) were in a group of 
similar intent and design which was run by the 
Cheyenne chapter of Quota International, a volunteer 
service organization for women; and 8 Ss (Group N) 
Were selected from the public schools to provide a 
“normal” comparison group. The Ss in both the pro- 
fessional and volunteer Eroups were adolescent 
females who had been referred by various treatment 
and welfare agencies as delinquent or as having less 
severe behavior problems. All were considered by 
School authorities as potential drop-outs. A large 
number of these girls had been identified as acting 
out sexually. The Ss in the nontherapy comparison 
group were similar in age and academic performance, 
but had no record of acting-out behavior, This group 
Was used to provide normative data as well as to 
Provide a control for Changes which might occur 
simply as a function of time. 


Instrumentation 


Pre- and Posttesting for the three groups was 
Performed over a 10-wk. interval. The measures used 
Were identical at both testings. The test battery 
Provided measures of the concepts “Me as I am 
today” and “My ideal-self.” An index of the evalu- 
ative attitude of each S toward these concepts was 
Provided by a scale of 20 adjective pairs presented 
in a semantic differential format. All items had been 
selected for their high loadings on the evaluative 
factor of meaning as indicated by the work of 
Osgood, Suci, and Tannenbaum (1957). Representa- 
tive pairs are Eood-bad, valuable-worthless, e 
Summing down these 20 scales yielded total "value 
score for each concept. Each of the concepts was 
rated against the same set of scales, thus allowing the 
Computation of a score representing a self-ideal dis- 
crepancy along the value dimension, a commonly 
used measure of self-regard. The Anxiety Differential 
(Alexander & Husek, 1962; Husek & Alexander, 
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1963) was used to provide a measure of situational 
anxiety. 


Procedure 


The two “experimental” therapy groups were put 
through a 10-wk. program designed to produce posi- 
tive self-image changes in the areas of personal and 
social appearance. The program was presented, not 
as “therapy” but as a program concerned with 
training in socially relevant skills. Eight major 
topics dominated the sessions: make-up, hair styling, 
exercises and diet, clothes and accessories, posture 
and general demeanor, dating etiquette, review of 
make-up and hair styling, and problems in dining 
out. Walking, dancing, the art of conversation, and 
ease in social situation were stressed during the 
weekly sessions. Volunteers in the community spoke 
to the group on many of these topics. The speakers 
were selected for their value as feminine identifica- 
tion figures. One, for example, was a local beauty- 
shop operator. In addition, both of the group leaders 
could be described as mature, relaxed, and actively 
feminine. 

The program was planned to offer three direct 
approaches to the modification of the self-concept. 
First, the individuals selected to run the groups pro- 
vided appropriate feminine role models. It was in- 
tended that these leaders would establish relation- 
ships with the girls which would permit the girls to 
incorporate some of the characteristics of the leaders 
into their own self-concepts. Second, the content of 
the program focused directly on the appearance and 
the social behavior of the girls. Much of the program 
could be conceptualized as straightforward “attrac- 
tiveness training.” The basic procedure used for 
shaping new attitudes and behaviors in the girls 
was to change their appearance (e.g., giving them a 
free beauty treatment) or, in the case of a par- 
ticular kind of behavior, to have them “role play" 
the behavior. Positive changes in either appearance 
or behavior were then reinforced through the social 
approval of the leaders and the other members of 
the group. Finally, in order to create a situation in 
which “new ways of being me” could be tried, group 
discussions were focused on the acceptance of per- 
sonal limitations, and attempts were made to develop 
an “in group” feeling in order to give the girls a 
sense of being accepted by their peers. 

It was assumed that the provision of appropriate 
feminine role models accompanied by the teaching 
of feminine role behaviors would provide a milieu 
in which positive alterations in the self-concepts of 
Ss would occur and that, by making the members 
of the group feel more personally attractive, they 
would be less likely to seek to prove their femininity 
through socially unacceptable behavior. It is, perhaps, 
apparent that such a social learning approach is 
predicated on a behavioristic conception of the 
origin of affective states. In this case, it would be 
held that the self- and related concepts are formed 
through interactive experiences with others. While 
such states may mediate subsequent behaviors, they 
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are amenable to change through the provision of a 
new environment and a corrective set of interpersonal 
interactions. 


RESULTS 


The mean selí-ideal self-concept discrepan- 
cies of all Ss were arithmetically transformed 
to eliminate negative values. The mean pre- 
test self-ideal discrepancies for the nontreated 
group of normal Ss and for the professional 
and volunteer-led groups were 37.50, 28.00, 
and 36.20, respectively. While the means of 
the three groups did not differ significantly, 
the variances obtained were quite dissimilar 
(Group N, s? = 54.50; Group P, s? = 386.56; 
Group Q, s? — 449.25). The variances of each 
of the two therapy groups were significantly 
greater than the variance of the normal group 
(Group P, F=7.09, p< .05; Group Q, 
F = 8.24, p < .01); however, the two therapy 
groups did not differ signicantly (F — 1.16). 
While the pretest anxiety scores correlated 
473 (p< .005) with the self-ideal discrep- 
ancy scores when all three groups were com- 
bined, there were no significant differences 
between the therapy and normal comparison 
groups. 

At the time of posttesting, there were still 
no significant mean differences between the 
three groups; however, contrary to the pre- 
test findings, the variances of the groups 
no longer differed significantly (Group N, 
5? — 74.36; Group P, s? — 27.20; Group Q, 
s? = 150.91). While the control group re- 
tained approximately the same range, there 
were less extreme scores in the treatment 
groups, particularly in Group P. Again, there 
were no significant differences in measured 
anxiety. In all groups, a tendency for more 
positive self-evaluation was observed. 

Although there were no significant differ- 
ences among the variances of the three groups 
at the time of posttesting, there were differing 
levels of change over the 10-wk. interval. 
Over the pretest to posttest period, the non- 
behavior-problem Ss changed very little, 
while the variance of self-ideal discrepancy 
in the volunteer-led therapy group declined 
from 449.21 to 150.91 (F — 2.98). The re- 
duction in variance for the professionally led 
therapy group was greater—changing from 
386.56 to 27.20 (F = 14.21, p < .001). Sixty- 
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two percent of the Group N Ss moved to less 
extreme self-ideal discrepancy scores, while 
38% obtained more extreme scores. In the 
combined therapy groups, however, 9396 of 
the Ss had less extreme scores on posttesting, 
with only 796 tending toward more extreme 
self-ideal discrepancy. This movement was ac- 
counted for primarily by changes in the level 
of value placed on the self-concept rather 
than by ideal self changes. In general, ex- 
tremely high self-concepts were lowered, while 
extremely low self-concepts were increased. 


DISCUSSION 


There are several investigations which sug- 
gest that the relationship between adjustment 
and self-regard is basically curvilinear (e.g., 
Block & Thomas, 1955; Chordorkoff, 1954; 
Miskimins, 1967). These studies indicate that 
maladjustment may be reflected either in very 
high or very low selí-ideal discrepancies. 
Under this assumption, it may be stated that 
both therapy groups (P and Q) were, at the 
time of pretesting, more maladjusted than 
the comparison group of nonreferred normals. 
While many of the behaviorally disturbed 
adolescent girls evidenced high self-ideal dis- 
crepancy, there were some who showed ex- 
tremely low incongruence even to the extreme 
of rating the selí- higher than the ideal self- 
concept. Block and Thomas ( 1955) and 
Zuckerman and Monashkin (1957) have pro- 
posed that the “high self-acceptor" may in 
fact be more severely disturbed than the indi- 
vidual for whom a high self-ideal discrepancy 
may be obtained. This is consistent with the 
finding of Suinn and Hill (1964), replicated 
by the present study, to the effect that anx- 
lety and self-acceptance are negatively re- 
lated. In the therapy groups, many of the 
girls were defined by their test data as ex- 
tremely defensive; none of the nonreferred 
comparison Ss fell into this classification. The 
usual interpretation is that extremely high 
self-acceptors are defensive individuals who 
find their behavioral tactics useful in minimiz- 
ing feelings of anxiety, increasing day-to-day 
personal comfort, and in avoiding the dis- 
tresses of negative self-regard. 

It should be noted that there were no ex- 
tremely small self-ideal discrepancies among 
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treated Ss at the time of the posttest. Per- 
haps the self-concept program was particu- 
larly effective in dealing with adolescent 
females with this type of problem. It would 
appear that these girls became much more 
self-reflective, open, and spontaneous. Because 
of the nonspecific nature of the measure used, 
some evidence is provided in support of the 
contention that these changes in self-concept 
were not confined simply to the areas dealt 
with in the groups (e.g. grooming and hy- 
giene)—the changes which took place were 
very likely global. If so, generalization to 
behaviors in the extratherapy setting should 
have been enhanced. 

The instrument used in assessing the self- 
and ideal self-concepts was not designed to 
yield a general self-description, but rather to 
provide a measure of the level of value which 
each S associated with either of these con- 
cepts. Therefore, it is difficult to say whether 
Ss in the therapy groups saw themselves or 
their ideal selves as having different charac- 
teristics than did the comparison group of 
nondelinquent females. A similar score would 
mean, however, that they attached the same 
level of value to the concept. 

Perhaps the level of self-value and the self- 
ideal discrepancy are critical to adjustive be- 
havior. For at least the 6-mo. period follow- 
ing the self-concept program, none of the 
girls was reported to be in serious trouble; 
none had dropped out of school for either 
disciplinary or academic reasons. 

It cannot be said with any certainty that 
the critical factor in this apparent behavioral 
improvement was the “normalization” of the 
self—ideal-self relationship, but whatever the 
case, it may be concluded that there is con- 
siderable evidence for the efficacy of both 
Professional and volunteer-led programs in 
producing positive changes in the self-concepts 
of behaviorally disturbed adolescent females. 
The Ss in the self-concept program provided, 
at termination, levels of self-regard more typi- 
cal of normal adolescents, while changes in the 
nonproblem comparison group appeared to be 
nearly random. Furthermore, this change was 
accomplished by a direct program concen- 
trating on the learning of personal and social 
skills. It is also likely that the opportunity 
for constant interaction with highly adequate 
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feminine role models was a major ingredient 
in the program’s success. An advantage of 
such a program is its high face validity. The 
girls did not feel that they were in “treat- 
ment”; yet, there is strong evidence that they 
received many of the benefits usually associ- 
ated with traditional therapeutic approaches. 

While it is apparent that both of the 
therapy groups were effective to some degree, 
there is a question as to the differential ef- 
fectiveness of the two. There is some evidence 
that more marked self-concept changes were 
produced in the group led by mental health 
professionals. It is possible that, although the 
two programs were ostensibly identical, the 
volunteers may differ in basic attitudes about 
what they are doing. There is some evidence, 
for example, that they “lectured” more, and 
“discussed” less. Whatever, the reason, the 
practical aspects of the self-concept program 
should be considered. Trained professionals 
are harder to obtain for work such as this 
and, of course, must be paid, whereas the use 
of community volunteers provides a much 
larger pool of persons to conduct self-concept 
groups along with a considerable reduction in 
expense. Furthermore, certain adolescents may 
be equally benefitted by either professional 
or volunteer-led groups. A selection process 
which would provide a pretreatment rationale 
for group assignment seems highly desirable 
as a way of providing for the most effective 
use of available mental health personnel. 
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SUBJECTIVE MATCHING OF ANXIETY TO 
INTENSITIES OF WHITE NOISE 


RICHARD SULLIVAN? 
New York University Medical Center 


Twenty Ss were asked to match intensities of white noise to levels of experi- 
enced anxiety under stress and nonstress conditions, Anxiety measures were 
obtained through the use of a rating scale. The Ss were also asked to judge a 
minimal aversion threshold (MAT) of white noise and to rate a level of anxiety 
which matched the aversiveness of the noise. Anxiety and noise matchings indi- 
cated relatively lawful agreement during stress and nonstress conditions, It was 
found that at only one point during the stress condition did the anxiety and 
noise judgments exceed the mean subjective aversion level. The findings suggest 
that changes in the intensity of internal states may be measured through the 
use of judged matchings with changes in intensive sensory stimuli. 


Stevens’ (1959) validation of the psycho- 
physical “power law” through use of com- 
parative cross-modal judgments demonstrated 
that Ss are capable of lawfully relating the 
changes in one sensory mode to that of a 
second sensory mode, Stevens (1961) has 
also shown that “direct” methods of psycho- 
logical scaling may be used to measure the 
subjective experience of changes in the magni- 
tude of stimuli which have no related physical 
metric. The requirements of such scaling are 
that the stimulus continuum varies in experi- 
enced magnitude and can be assigned an 
ordinal rank by E according to a specified 
criterion. 

The success of Stevens’ techniques for cross- 
modal matching suggests that the subjective 
experience of anxiety may be measured 
through use of similar procedures. Anxiety 
may be considered an intensive continuum 
wherein experienced amplitudes vary relative 
to the stress of differing conditions. Tnitially, 
criteria for the assignment of ranks to dif- 
ferent conditions of stress may be based on 
the S's rated judgment of those conditions. 
However, before attempting to have Ss di- 
rectly estimate their levels of anxiety under 
different conditions it would have to be estab- 
lished that they could judge anxiety as an 
intensive continuum. A cross-modal compari- 
son of intensities of anxiety with that of 
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another intensive continuum seemed to be an 
appropriate means of obtaining that informa- 
tion. The present study was designed to 
determine whether Ss could match intensities 
of white noise to levels of experienced anxiety 
under different conditions of stress. 

A second aspect of the study dealt with Ss’ 
ability to judge a minimal aversion threshold 
(MAT) of noise and to estimate a level of 
anxiety which would match the aversiveness 
of the noise. These response measures were 
obtained on the assumption that since many 
intensive sensory continua become aversive at 
high subjective amplitudes, an origin or MAT 
could be specified above which all further 
increments in amplitude of stimulation would 
be considered aversive, Thus, if anxiety were 
to be meaningfully related to noise levels, 
when the rated anxiety was above the MAT 
level, the matching noise judgment should also 
be above the MAT established for noise. The 
Possibility that a factor of sensory aversive- 
ness may alter judgment (Ekman, Franken- 
hauser, Levander, & Mellis, 1964; Stevens, 
Carton, & Shickman, 1958; Sullivan, 1968a, 
1968b) is similar to the assumption that 
anxiety, at high levels of subjective magni- 
tude, may significantly influence subjective 
judgment, 


METHOD 


Subjects. Six female and 14 male students (ages 
19-52) in an introductory psychology course served 
as Ss, 

Apparatus. The apparatus consisted of a white 
noise generator in circuit with a key and an 8-ohm 
speaker. The speaker was placed in the center of à 
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semicircle of Ss seated in an ordinary classroom. 
The ambient noise level of the room varied 57-62 db. 
over various testing periods. White noise was pre- 
sented to Ss in all conditions over a continuous 
2-min. interval for both ascending and descending 
trials. Sound pressure levels (SPL) at various posi- 
tions in the semicircle for both ambient and experi- 
mental noise was measured by a General Radio 
sound level meter. Differences in sound levels within 
the room (maximum was 6 db, at distal sides) were 
accounted for by adjusting a given S’s judgment in 
terms of his position. The dial settings on the noise 
generator were arbitrary figures used to reduce E's 
possible bias in changing the noise levels over the 
2-min. interval of presentation. 

The anxiety rating scale consisted of a sheet of 
paper with a 7-in. line at the top having intervals 
spaced 0-7. Below the line, definitions of the intervals 
were given as follows: 0-1, no anxiety, perfectly 
relaxed, and comfortable; 1-2, little anxiety, feel 
comfortable, and pleasant; 2-3, some anxiety is 
present but not bothersome; 3-4, anxiety is present, 
not intense or bothersome; 4-5, anxiety is moderately 
intense and somewhat uncomíortable; 5-6, anxiety 
is fairly intense and unpleasant but not intolerable; 
6-7, anxiety is intense, unpleasant, and intolerable. 
The Ss in all conditions were asked to place an “X” 
on the line in a position which best described the 
anxiety they were experiencing at the time of rating. 

Procedure. In general, anxiety ratings and judg- 
ments of minimal aversion thresholds, and noise 
matchings were spread over various lecture hours 
(nonstress condition), Brevity in both ratings and 
noise judgments was crucial since Ss were to be asked 
to do both at five time intervals during the midterm 
examination (stress condition). The first half of the 
nonstress anxiety ratings, matched noise judgments, 
and minimal aversion judgments of anxiety and 
noise were obtained prior to the midterm exam; the 
second half of judgments and ratings was obtained 
between the midterm and final exam period. During 
the final exam, Ss were asked only to rate their levels 
of anxiety and were not asked to match those levels 
to intensities of white noise. 

MAT: Noise judgments. The Ss were asked to 
judge the level of white noise which they considered 
to be minimally aversive. The minimal aversion level 
of noise was defined as that intensity of noise which 
is at the point of becoming annoying or bothersome. 
The procedure for obtaining responses was to have 
all Ss close their eyes on signal and have them raise 
their hands when they felt the noise to be minimally 
aversive. An assistant of E's took down each S’s 
judgment by recording the dial setting on the noise 
generator when S raised his hand. Two trials were 
given at each of the four sessions. A trial consisted 
of a presentation of the noise at 50 db. (SPL re: 
.0002 dynes/cm?) which continuously increased over 
a 2-min. interval to an upper limit of 95 db. 

2Since no S went above 92 db. in judging the 
MAT, it was felt that 95 db. would be a suitable 
upper limit for the matching task. The possibility 
that this upper limit might have influenced judgments 
is discussed later. 
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Following a 30-sec. pause, the noise was presented 
at 95 db. and decreased over a 2-min. interval to 
the lower limit of 50 db. A 2-min. rest period was 
given between the two trials. During the descending 
series, Ss were asked to raise their hands when they 
felt that the noise was no longer bothersome, annoy- 
ing, or minimally aversive. 

MAT: Anxiety ratings. On four separate occasions, 
Ss were asked to judge their minimal aversion thresh- 
old for noise by rating the anxiety scale in terms 
of what level of anxiety they felt would be the 
equivalent of the MAT noise. 

Noise and anxiety matching: Nonstress condition. 
The Ss were requested to rate themselves on the 
anxiety scale at five different times during the four 
lecture periods (5, 30, 55, 80, and 105 min., respec- 
tively). After each rating, the sheets were collected 
and Ss were then asked to judge the intensity of 
white noise which matched the level of anxiety they 
were experiencing at that time. One ascending series 
and one descending series was given for each anxiety 
rating. 

Noise and anxiety matching: Stress condition. 
During the midterm exam period, Ss were asked at 
the five different times to rate their level of anxiety. 
Following the collection of the rating sheets, they 
judged the white noise intensity which matched their 
level of anxiety. The first rating and judgment pre- 
ceded the beginning of the exam (5-min. judgment) ; 
the fifth rating and judgment followed the comple- 
tion of the exam (105-min. judgment). The pro- 
cedure for obtaining responses was the same as 
outlined in the preceding paragraph. On a signal 
from E all Ss would pause, rate their anxiety, pass 
their sheets to E or E's assistant, close their eyes, 
and judge the noise. Two trials (one ascending and 
one descending series) were given for each anxiety 
rating. The Ss were now fairly practiced in the 
procedure and the ratings and judgments at any 
given interval took approximately 7 min. 


RESULTS 


Anxiety ratings and noise judgments ob- 
tained under the various conditions are plotted 
in Figure 1 (anxiety) and Figure 2 (noise). 
In both figures, the mean level of anxiety or 
noise which was judged to be minimally aver- 
sive in the nonstress condition is represented 
by a straight line (visual best fit) in the upper 
portion of the graph. (Mean MAT ratings for 
anxiety were 4.34, 4.55, 4.28, and 4.40; mean 
MAT judgments of noise were 85.18, 85.36, 
84.70, and 85.53 db.).* As seen in Figure 1, 
mean ratings of anxiety experienced at the 
five different times in the nonstress condition 
were relatively stable (lower curve). Mean 


8 There were no appreciable differences between 
median and mean measures in both the anxiety 
ratings and noise julgments. 
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levels of rated anxiety during both stress 
conditions (midterm and final exam periods) 
were essentially the same for each of the time 
intervals. Anxiety was rated highest prior to 
the exam (5-min. interval) and lowest at the 
completion of the exam (105-min. interval). 
A t test for matched pairs at the 105-min. 
time interval between stress and nonstress 
ratings was found to be significant (t = 3.59, 
df= 19, p < .005). 

Similar to the anxiety ratings in the non- 
Stress condition, the judgment of noise in- 
tensities matched to levels of anxiety at vari- 
ous times during the nonstress condition indi- 
cated relative stability over the 105-min. 
period (Figure 2, lower curve). During the 
Stress condition, the intensity of noise matched 
to the level of anxiety was highest prior to 
the exam (5-min. judgment) and lowest at 
the completion of the exam. (105-min. judg- 
ment). The only point at which the noise 
judgments and anxiety ratings were above the 
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MAT was prior to the exam at the 5-min, 
time interval. 

Table 1 presents a four-way analysis of 
variance of noise judgments matched to anx- 
iety levels at different times under stress and ~ 
nonstress conditions (stress versus nonstress, 
time intervals, ascending versus descending 
series, Ss), The main effect of stress was 
found to be significant (F = 35.89, df = 1/19, 
b < .001) as was the main effect of series (as- 
cending versus descending trials, F = 178.49, 
df = 1/19, p < .001). The significant Stress 
X Time interaction (F — 6.90, dj — 4/76, 
P < 001) indicated that, in terms of noise 
matching with anxiety, different levels of anx- 
lety were experienced at the different time 
intervals, The significant Stress X Series X Ss 
interaction indicated, as might be expected, 
that different Ss equated the noise intensities 
to different levels of anxiety at the differing 
times of judgment. A trend analysis of the 
noise matchings was done following the pro- 
cedure outlined in Edwards (1960) and indi- 
cated that the linearity of the slope of judg- 


MAT JUDGEMENTS 


STRESS JUDGEMENTS 
A&A MID-TERM EXAM 


Q0 NON-STRESS JUDGEMENTS 


8 


H 


82 


NOISE JUDGEMENTS IN db (re:0.0002 dynes/cm*) 


78 


105 


55 80 
TIME (MINUTES) 


Fic. 2. Mean noise judgments at five time inter- 
vals during stress (exam) and nonstress conditions 
(lecture periods). (Straight line in upper portion of 
the graph represents the mean judgment of minimal 
aversiveness of white noise.) 


ANXIETY AND INTENSITIES OF WHITE NOISE 


ments was significant (F = 27.9, df = 1/60, 
p< 001). 

A significant finding in the present study 

was that descending noise judgments were 

- higher than ascending noise judgments regard- 
less of the task (MAT judgment or anxiety 
matching). Means of the ascending and de- 
scending judgments of noise matchings at the 
five time intervals during the stress condition 
are presented in Table 2. 

The significant Stress X Time X Series in- 
teraction (F = 32.48, df = 4/76, p < .001) 
suggests that judgment variance within the as- 
cending and descending trials may have 
altered depending on the level of anxiety 
experienced by Ss at the time of judgment. 


DISCUSSION 


The findings indicate that Ss were capable 
of matching changes in the level of experi- 
enced anxiety with changes in amplitude of 
white noise. The results further indicate that 
when anxiety was judged to be above a rated 
minimal aversion level, the matching noise 
level was also above the minimal aversion 
threshold for white noise. The findings sug- 
gest, therefore, that a reasonably lawful mea- 
surement of anxiety may be obtained through 
use of cross-modal matchings of anxiety with 
other intensive continua. 

The mean level of white noise judged to be 
minimally aversive (85 db.) is approximately 
the same as that reported by Spieth (1956) 
for annoyance thresholds of Ss who imagined 
a noisy work environment while making judg- 
ments. The similarity in findings may have 
been due to the relatively high level of 


TABLE 1 


Four-Way ANALYSIS OF VARIANCE OF NOISE 
Jupcments MATCHED TO LEVELS OF ANXIETY 


Source df MS F 

Stress (A) 1 1270.5 35.89* 
Time (B) 4 197.0 13.97* 
Series (C) 1 1463.6 | 178.49* 

XB 4 161.5 6.90* 
AXC 1 109.5 43.80* 
BXC 4 16.8 1.56 
AXBXC 4 68.2 32.48* 
AXBXCXD 76 24 


.. Note.—Stress versus nonstress; five time intervals in ascend- 
ing and descending series. Components of the variance analysis 
or three fixed and one random variable was done according to 
d by Hicks (1964). 

< 001. 
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TABLE 2 


MEAN ASCENDING AND DESCENDING NOISE 
JUDGMENTS AT Five INTERVALS IN THE 
Srress CONDITION 


Time interval (in min.) 


Trial 
5 30 55 80 105 
Ascending (db.) 86.5 | 82.5 | 81.1 | 79.9 | 78.3 
Descending (db.) | 89.2 | 86.1 | 84.4 | 83.8 | 82.0 


ambient noise (57-62 db.) in the classroom 
while judgments were made by the present 
Ss. Other factors which may have contributed 
to the 85-db. MAT were adaptation changes 
occurring due to the continuous presentation 
of noise and anchoring effects. In a related 
study of the effect of adaptation on MAT 
noise judgments, Sullivan, Warren, and 
Dabice used adaptation levels of 70, 85, 95, 
and 110 db. Their findings indicated that the 
series effect (ascending versus descending 
trials) had a greater influence on noise judg- 
ments than the adaptation effect. The differ- 
ence between ascending and descending judg- 
ments in the present study may be due to Ss 
having made relative judgments on the basis 
of the upper and lower anchoring noise levels. 
Anchoring effects seem also to have influenced 
anxiety ratings. Thus, while most Ss reported 
great relief from anxiety at the end of the 
stress period, their anxiety ratings were still 
significantly higher (¢ = 3.59, df = 19, p< 
.005) than the ratings given at the 105-min. 
interval in the nonstress period. 

The procedure of instruction and practice 
appears to have had an important influence 
on the present results since most Ss expressed 
a preference for judging noise tolerance rather 
than noise matching. This preference seems to 
reflect the possibility of a summative effect of 
noise and anxiety in interfering with task per- 
formance, The presentation of an aversive 
noise may make an anxiety level appear sub- 
jectively more intense or an aversive level of 
anxiety may make a noise appear more in- 
tense. In a review of the effects of noise on 
behavior, Broadbent (1957) suggested that 
high anxious individuals have a low tolerance 
for noise. The implication of Broadbent's sug- 
gestion is that judgments of the aversiveness 
of noise stimuli are dependent on the level of 
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an S’s anxiety. However, other variables such 
as previous experience with different noise 
environments (Spieth, 1956), the individual’s 
subjective definition of aversiveness (Wolff, 
1964), and whether the noise is intrinsic or 
extrinsic to the task would all appear to in- 
fluence the relationship between noise toler- 
ance and anxiety. Using either the anxiety 
ratings or the matching noise judgments of 
minimal aversion as a criterion, it appears 
that task involvement reduced the level of 
anxiety experienced by Ss (5- versus 30-min. 
judgments in the stress condition). Task in- 
volvement also appeared to influence judg- 
ments of the annoyance of noise stimuli 
(Kryter, 1966). 

Although the distributions of anxiety ratings 
appeared to be normal, no inferences can be 
made with regard to the validity of the scale. 
A more appropriate measure of anxiety ratings 
may have been obtained through use of the 
method of successive interval scaling (Guil- 
ford, 1954; Torgerson, 1958). Changes in the 
rating styles during stress and nonstress con- 
ditions could then have been analyzed. How- 
ever, under conditions where the subjective 
state alters considerably over brief periods 
of time, direct methods of estimation would 
seem to be a more practical means of mea- 
surement, There is evidence which suggests a 
lawful relationship between interval and ratio 
forms of scaling procedures (Ekman & Kiin- 
napas, 1962; McGill, 1960). Until that rela- 
tionship is shown to be valid, Stevens’ (1961) 
criticism of the use of category scales to mea- 
sure intensive continua would suggest that 
such measurements may not apply to anxiety. 
Thus, the use of direct methods to measure 
anxiety may be a more appropriate and prac- 
tical procedure than the use of category-type 
scaling techniques. However, even the direct 
methods of measurement may not apply when 
stimulus intensities are within an aversive 
range of stimulation. It would, therefore, have 
to be determined that judgments of aversive 
stimuli have dispersion characteristics similar 
to those of nonaversive stimulus ranges, 

Whether or not anxiety can be directly 
related to magnitude changes in intensive 
sensory continua, it does appear possible to 
monitor changes in anxiety states by use of 


| 
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judged variations in matchings to intensive 
continua. The same procedure would seem 
to apply to other subjective states, such as 
clinical pain, hunger, thirst, and fatigue. 
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PSYCHOLOGY, METAPSYCHOLOGY, AND 
PSYCHOANALYSIS 
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Recent critical examination of psychoanalytic theory has revealed important 
inadequacies in the energy formulations and in the metapsychology in general. 
Several authors have suggested that psychoanalytic investigation should be 
concerned with the purpose or meaning of behavior rather than with the 
construction of impersonal causal models. It is argued in the present article that 
both purposive and causal perspectives on behavior are essential to a mature 
psychology. Focus upon the meaning of behavior may be an efficient strategy 
for using the unique data of the psychoanalytic hour, but, for other data, 
process descriptions often seem more useful. The role of energy constructs is 
discussed. It is concluded that the lack of measures in psychology quantifiable 
on an interval scale prevents energy constructs from having the utility they do 
in physics, but that energy constructs have served to highlight important issues 
of capacity and limits that must be considered by any meaningful alternative 


formulation. 


Perhaps the most encouraging trend in 
psychoanalytic theory since the death of 
Freud has been the tendency to dismantle 
the iron curtain surrounding its brilliant in- 
sights and enabling fruitful intercourse and in- 
tegrative merger between the observations and 
viewpoints of psychoanalysts and those of 
academic psychologists. Such integrative ef- 
forts are essential to counter the narrowed 
perspective and narrowed definition of prob- 
lems that result from the limited sources of 
data upon which theories in psychology are 
generally built. Thus, the structure of psycho- 
analytic theory, the nature of its assumptions 
and emphases, derives not only from the na- 
ture of Freud’s personality and training 
(Amacher, 1965; Holt, 1965), but also from 
the fact that its primary data source is the 
free associations of neurotic patients. Other 
theories in psychology place similarly heavy 
emphasis on the bar pressing of rats, the 
problem solving of college students, etc., and 
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such emphasis cannot help but constrict the 
range of questions asked and models pro- 
posed. 

In the course of the development of psycho- 
analytic theory, it became the hope of Freud 
and his followers that psychoanalytic theory 
would became a general psychology, that it 
would account not only for neurotic phenom- 
ena but for all phenomena in the realm of 
psychological science. An almost inevitable 
result of such efforts will be a change in the 
structure of psychoanalytic theory. As the 
data base of the theory expands, as it at- 
tempts to encompass new observations, to 
include, for example, the data of laboratory 
experiments, the theoretical concepts that tie 
together the observations will require revision. 
This has been the case generally in the history 
of science, and even in the history of psycho- 
analytic theory itself. Thus, as psychoanalytic 
observation began to focus upon the mecha- 
nisms of defense, the topographic model of the 
conscious, preconscious, and unconscious sys- 
tems (Freud, 1959) had to be replaced by the 
structural model of ego, id, and superego; the 
former model was adequate to account for 
the early observations but could not survive 
new data. One can imagine that efforts to 
expand the explanatory realm of psychoana- 
lytic theory may one day become so successful 
that the original theory will disappear of its 
own success. To the extent that the theory 
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truly becomes a general psychology it will 
cease to be psychoanalytic theory, and will 
tend to be simply psychological theory. The 
total disappearance of separate theories in 
psychology and the emergence of a single psy- 
chological theory free of the parochial origins 
of its parts is of course an asymptote from 
which we are still quite distant, but it is im- 
portant to keep the emerging goal in mind, for 
the goal determines what is the essential 
product of psychoanalytic research. 

Freud was far less reverent toward his theo- 
retical constructions than are many of his 
followers. He recognized (Freud, 1949) quite 
clearly that it is the facts discovered by psy- 
choanalytic work that will stand as his major 
achievement, rather than the theories. The 
latter were merely ingenious working prin- 
ciples to tie together the data that had 
emerged up until then and to point toward 
new observations. Freud’s discoveries regard- 
ing unconscious motivation, the expressive 
meaning of dreams, and the organization of 
primitive thought will remain valuable long 
after the words ego, id, or libido have become 
obsolete. Concern with the distinction between 
psychoanalytic data and the theory used to 
account for it has led to increasing critical 
discussion of the basic theoretical assumptions 
of psychoanalytic theory in recent years. 
That aspect of the theory that Freud called 
his metapsychology has come under the most 
critical scrutiny. Some of this criticism has 
been helpful in clarifying assumptions and 
opening the way for theoretical formulation 
more suited to our present level of knowledge, 
but blanket dismissal of all metapsychological 
perspectives as “reductionistic” (Loevinger, 
1966) may obscure some important issues. In 
particular, it will be suggested below that the 
energy constructs of Freudian metapsychol- 
ogy, while clearly in need of drastic revision 
and perhaps even amputation, have served to 
direct researchers to an important realm of 
questions, Without careful consideration of 
the role of energy constructs and other im- 
personal process descriptions in the theory, 
there is a danger that a poor formulation may 
be replaced not by a more adequate one, but 
by a reluctance to encompass the observations 
that led to the formulation in the first place. 
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MEANING AND METAPSYCHOLOGY 


Psychoanalytic theory is characterized by 
two rather separate theoretical approaches, 
The clinical theory consists of “inferences of 
directional gradients in behavior, and of the 
object-relations involved in these directions 
[Klein, 1966, p. 10]”; it represents a search 
for meaning and purpose in behavior. Psycho- 
analytic metapsychology, on the other hand, 
reflects Freud’s view that purposive concepts 
are scientifically unacceptable. It is a mechani- 
cal quasi-thermodynamic model in the lan- 
guage of forces and energies. Klein (1966) 
has called it a theory to explain a theory, and 
has argued that, unlike the clinical proposi- 
tions, metapsychological concepts are “im- 
pervious to the type of data obtainable from 
the psychoanalyst’s hour [p. 15].” 

Tension between a humanistic and a mech- 
anistic view of man, between the sense that 
man is a choosing responsible moral agent and 
the idea that human actions are as imperson- 
ally determined as the motion of the planets, 
is woven tightly into the fabric of psycho- 
analysis. One could not expect such tension 
to be absent from any meaningful psychology, 
for the paradox of human choice has been a 
central problem in the study of man for many 
centuries. Klein (1966) has noted that the 
impersonal, metapsychological perspective on 
human behavior has been fallaciously put 
forth as more basic than the purposive ex- 
planations of the clinical theory. To under- 
stand a man’s strivings and feelings, to know 
him from the inside, is somehow regarded as 
inferior to observing him as an object in the 
most literal sense. 

Questions of freedom and determinism in 
Psychoanalysis do not directly parallel the 
conflict between the clinical and metapsycho- 
logical perspectives, Much of Freud’s argu- 
ment for a determinist position was in the 
service of building the clinical theory, of dem- 
onstrating that seemingly meaningless events 
such as dreams and slips of the tongue could 
be given meaning if the determining chain of 
associations were followed back to its source. 
Nonetheless, the role of metapsychological 
theorizing is related to the view of human 
behavior as ultimately determined by an A 
personal “natural” process, even if “meanings 
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are a convenient description of the interven- 
ing events that occur. A brief examination of 
the psychoanalytic view of determinism and 
freedom is therefore in order. 


FREEDOM AND DETERMINISM 


Knight’s (1946) discussion of the freedom- 
determinism issue is probably as close to a 
standard psychoanalytic position on this issue 
as one can find. Knight believed that deter- 
minism was an essential assumption for a 
scientific psychology. Yet like many other 
analysts and therapists (e.g., Mazer, 1960; 
Wheelis, 1956), Knight recognized that a 
deterministic viewpoint may be used by pa- 
tients as resistance to effort at therapeutic 
change. Many patients claim that they simply 
cannot help what they are doing, that their 
behavior is determined and they have no 
choice or responsibility. Such an attitude may 
effectively inhibit necessary effort on the part 
of the patient and may function as a self-ful- 
filling prophecy that indeed results in the 
patient remaining the same. 

In dealing with this dilemma, Knight em- 
phasized the distinction between determinism 
as a construct and freedom as a subjective 
experience. For Knight, freedom is an experi- 
ence of harmony between one’s behavior and 
one’s character, a sense that one is behaving in 
one’s own best interests and is free from com- 
pulsion, anxiety, and irrational doubts; 
this kind of “freedom” is experienced only by emo- 
tionally mature, well-integrated persons, it is the 
goal sought for one's patients in psychotherapy; and 
this freedom has nothing whatever to do with free 
will as a principle governing human behavior, but is a 
Subjective experience which is itself determined 
[italics added, Knight, 1946, p. 372]. 

_ Knight claims that such freedom, while sub- 
jective, need not be spurious. 
The behavior of a well-integrated civilized person 
can be objectively assessed as “free.” Observers see 
that such a person makes ego-syntonic choices, that 
his motives are “good,” and that he is able to carry 
out what he wills to do [p. 372]. 
Nonetheless one is left with the impression 
from Knight’s paper that man’s every act and 
thought is really determined, and that free- 
dom, however precious, is finally just an 
epiphenomenon. 
à Knight is certainly correct that the term 
free will" confuses the issue. One can hardly 
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use the term without calling up visions of 
medieval theological debate. But simply to 
define free will as a subjective experience, 
causally determined in the same way as the 
experience of pain upon being burned is de- 
termined, seems to be begging the question. 
Such an approach assumes determinism with- 
out ever questioning it, and deals with the 
perplexing issue of freedom by redefining it, 
and thereby essentially dismissing it. 

Free will is a frustrating conception because 
no one is clear what is meant by "free." It is 
easy to topple the straw man who speaks of 
freedom as if it were caprice, of freely willed 
acts as those that are unrelated to a person's 
history, character, or environment. But the 
great hold that the concept of freedom has 
had on men does not stem from the belief that 
man's actions are random. It stems from the 
belief that men's choices are meaningful and 
that the agony of decision is not simply an 
interesting phenomenon, to be explained by 
causal laws external to it, but rather the cen- 
tral fact from which to launch an investiga- 
tion of man. 

In this light, it may be seen that the deter- 
minism subscribed to by Knight and other 
analysts and the freedom proclaimed, for ex- 
ample, by the existentialists are two different 
perspectives on the same set of observations. 
Both groups of theorists see men walk and 
talk and make love and war. In order to 
organize and make sense of these observations, 
the determinist views the men he observes as 
he views all other objects in the universe. 
They are to him (while he is wearing the cap 
of the determinist) ultimately understood as 
bits of moving matter whose motions are de- 
scribed in terms similar in principle (though 
far more complex and perhaps at a different 
“Jevel”) to those describing the motion of any 
other particles. From this perspective, free 
will is indeed simply a “subjective experience,” 
to be explained causally like any other 
phenomenon. 

But there is another perspective on human 
behavior, another way of organizing the data, 
which all (healthy?) determinists use in much 
of their personal lives, and which some psy- 
chologists (e.g., the phenomenologists) use in 
their professional lives as well. Starting not 
with what is similar between man and the rest 
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of the universe, but rather with what seems 
unique to man, one may view men’s actions 
as through their own eyes, one may attempt to 
construct their phenomenal world, to under- 
stand them as men who act and decide on the 
basis of what they see and feel. From this sec- 
ond perspective (one may call it a “common 
sense” perspective, though modern psychoana- 
lytic and phenomenological research have 
carried it far beyond common sense) deter- 
minism is neither right nor wrong—it is 
simply irrelevant, a different way of looking at 
things. The terms of discourse for such a per- 
spective are feeling and decision rather than 
cause and effect. 

Both perspectives on the facts of human 
behavior have their utility. As Klein (1966) 
seems to suggest in a different context, they 
are mutually incompatible ways of organizing 
and comprehending observations of human 
beings (including oneself). Perhaps (though 
it is doubtful) some day a synthesizing for- 
mulation will be created that resolves the 
contradiction and shows each perspective to 
be a special case of some general and con- 
sistent view, For the present, however, it 
seems necessary to recognize the need for both 
perspectives. No man can get along well with 
just one, and to reduce “psychological sci- 
ence" to those organizations of observations 
that employ the determinist perspective is 
simply to impoverish the realm of psychologi- 
cal inquiry. To completely exclude a determi- 
nist viewpoint is of course equally productive 
of sterility. 


CAUSAL AND PURPOSIVE PERSPECTIVES 


To the extent that metapsychology is, as 
Klein (1966) suggests, a theory to explain a 
theory, it reflects a nonrational, quasi-reli- 
gious commitment to a determinist position, 
There is false hope (or fear) in the view that 
we have not yet explained all the nuances of 
human thought, feeling, and behavior in terms 
of energies and forces. The Present author is 
in agreement with Klein that there are diffi- 
culties in principle with automatically “con- 
verting the terms of clinical observation to 
Impersonal process terms." Neurophysiologi- 
cal or information-processing models, no less 
than energy models, represent only one 
perspective on human affairs, and have no 
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greater claim to describing an ultimate truth 
than do descriptions of striving and meaning. 
Both perspectives, if used wisely, can provide 
useful insights. 

Psychoanalytic observation is tuned in on 
a unique level of coherence (Klein, 1966). 
The structure of the analytic situation facili- 
tates perception by the analyst of patterns 
of thought organization in the patient, and 
enables the analyst to notice characteristics 
of the patient’s thinking and behavior that 
might otherwise go undetected. Certain seem- 
ingly unrelated ideas may be expressed in 
close contiguity with remarkable frequency. 
Or free association may lead to the expression 
of ideas and thoughts that are rarely directly 
expressed at all in any other context. Who 
would have expected, on the basis of other 
models of observation, that free-associating 
patients would so often express incestuous 
fantasies? On the other hand, psychoanalytic 
observations also highlights gaps in expres- 
sion not generally as visible elsewhere. The 
analyst may notice that every time his pa- 
tient’s thoughts seem to be leading toward 
criticism of an authority, something happens 
to halt or deflect the progress of his associa- 
tions. Or he may notice that certain topics, 
or certain expected reactions, simply never 
come up as the patient talks about himself 
and his life day after day. 

Such observations, and the generalizations 
derived from them, are what give psycho- 
analytic thinking its special cast. Psycho- 
analysts are trained to notice when a person 
says he is trying to do one thing and con- 
sistently acts as if he were trying to do quite 
another (or more frequently some odd combi- 
nation of both). Greater knowledge of the 
full range of human strivings and of the mani- 
fold fears and compromises that bend, dis- 
guise, and complicate those strivings is the 
special fruit of psychoanalytic research. For 
knowledge of mechanism, on the other hand, 
and for the building of models based on physi- 
cal and biological analogies, the data of psy- 
choanalytic sessions are largely irrelevant. 

Klein (1966) is careful to point out that 
impersonal causal models of behavior are not 
without a place in psychology; they simply 
are a poor strategy for dealing with the par- 
ticular opportunities for observation provided 
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by the psychoanalytic session. Loevinger 
(1966), however, seems to dismiss metapsy- 
chological perspectives more categorically. She 
is concerned with finding a set of principles 
that is distinctively psychoanalytic. But what 
does it mean for a principle to be psycho- 
analytic? Does it mean that the data upon 
which it is based came originally from psy- 
choanalytc sessions? Are other sources of data 
to be dismissed, with psychoanalytic princi- 
ples remaining “pure” but of little value for 
answering the more sophisticated questions 
called for by new observations? Or is a prin- 
ciple psychoanalytic if it is in essence a theo- 
retical statement formulated by Freud or at 
least an elaboration of such? Then psycho- 
analytic theory is destined to becoming an 
interesting historical curio, to be respected 
for its eminence among early twentieth-cen- 
tury efforts, but of little interest to those 
concerned with understanding the latest ob- 
servations of human behavior. 
Psychoanalytic insights are far too im- 
portant to be so embalmed and preserved. 
The psychoanalytic tradition in psychological 
thought has been one of the most vital and 
original. Perhaps the historical accident of its 
isolation from the academic world and its 
development into a “movement” contributed 
to that vitality. But there is now danger that 
what was once a revolutionary way of think- 
ing is becoming a conservative, consolidated 
“system,” an outcome that is fostered by that 
same isolation, A psychoanalytic perspective, 
that is, a view of human behavior based 
largely on psychoanalytic observations, and 
emphasizing the role of unconscious motiva- 
tions, defenses, compromise formations, etc., 
is still vitally relevant to psychological in- 
quiry. A separate psychoanalytic theory, 
however, at least to the extent that such a 
theory is not receptive to data from non- 
Psychoanalytic sources, seems increasingly a 
Poor enterprise in which to invest effort. That 
Some metapsychological propositions are “as 
congenial to many antianalytic psychologies 
as to psychoanalysis [Loevinger, 1966, p. 
433]” is irrelevant. What matters is how con- 
genial they are to the latest data. 
» Let us consider Loevinger's solution to 
one of the most baffling theoretical problems 
of psychoanalysis [p. 437],” how an impulse 
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contributes to its own control. She claims 
that notions such as bound cathexis and 
countercathexis add little to the explanation, 
and suggests instead that “one is controlled, 
simultaneously or in turn, by one’s impulses 
and one’s parents, and one therefore needs to 
control something in return, if not one’s par- 
ents, then one’s impulses [Loevinger, 1966, p. 
437].” This may perhaps be an adequate 
account of why the developing personality 
wants to control the impulse, but it says 
nothing about why he succeeds. What is lack- 
ing in a psychology solely of meanings, as Loev- 
inger calls for, is adequate consideration of 
capacities. As did early psychoanalytic writ- 
ings, Loevinger’s suggested system implies a 
kind of motivational omnipotence, whereby it 
is sufficient to explore the motives for be- 
havior without considering the ability of the 
individual to efficiently order and execute all 
the wishes pressing for fulfillment at a given 
moment. As an example, if a person gets into 
an accident, it is certainly not always correct 
to say that he is expressing a wish to hurt 
himself. A self-destructive wish would no 
doubt lower the threshold for such events, 
but it is clear that such wishes are often kept 
from direct expression by a strong ego, and 
phenomena such as “distraction” may often 
play a more important role in accounting for 
accidents. 

Phenomena such as distraction are at the 
heart of what is missing in Loevinger's 
scheme. Korchin (1964), for example, has 
summarized a great deal of evidence sug- 
gesting that anxiety may reduce the capacity 
to “do two things at once.” The perceptual- 
cognitive effects he describes seem vitally 
relevant in understanding the kinds of be- 
havior that psychoanalysts observe. Certainly, 
the exact form that behavior in an anxious 
state takes, as well as the origin of the anxiety 
itself, is largely a matter of meanings; but 
without consideration of the constriction of 
perspective and general narrowing of cogni- 
tive organization characteristic of the anxious 
individual, our understanding of his experi- 
ence and behavior is incomplete. Similarly, 
the accounts of individual differences in the 
organization of attention attempted by Gard- 
ner, Klein, and their colleagues (e.g., Gard- 
ner, Holzman, Klein, Linton, & Spence, 
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1959) represent a psychoanalytically oriented 
approach to character style complementary 
to and consistent with Freud’s discussions of 
character in terms of libidinal organization. 
It is a strength rather than a weakness of 
such work that it is almost impossible to dis- 
tinguish that part of the conceptual under- 
pinning that comes from psychoanalytic 
theory and that which comes from other psy- 
chological approaches. 


COGNITIVE STYLE AND MEANING 


The need-in-perception studies were an ef- 
fort to counter the overly impersonal view of 
perception prevalent at the time and to high- 
light the personal meaning of the stimulus for 
the perceiver. Many of these studies both 
explicitly and implicitly borrowed from psy- 
choanalytic discoveries of the role of drive 
and wish in shaping perceptual and cognitive 
events. As Klein (1958) has pointed out, 
however, these corrective efforts themselves 
tended to be one-sided, erring in the direction 
of overemphasizing the role of drive and dis- 
regarding the structural constraints upon 
drive influence. More precisely, it was the 
recruiting or meaning-inducing activity of 
drives that was overemphasized, to the exclu- 
sion of the accommodative structures, for 
Klein’s (1958) conception of drive is a uni- 
tary one, which emphasizes the coordination 
of intentions and the means of carrying them 
out, 

One does not understand an individual very 
well if one knows only the wishes that domi- 
nate the recruitment aspects of his personality 
Structure. One must also understand his 
modes of accommodation and the hierarchic 
ordering of his wishes and cognitions. Human 
perception is only sometimes distorted, but it 
is always selective, and such selection is al- 
ways limited even if accurate, Regardless of 
what wishes are currently active, the field- 
independent individual (Witkin, Dyk, Fater- 
son, Goodenough, & Karp, 1962) will per- 
Celve and experience a different world than 
the field-dependent, the world of the sharp- 
ener will differ from that of the leveler (Gard- 
ner et al, 1959), etc. The origin of such dif- 
fering modes of accommodation may well be 
largely a matter of the meaning to an indi- 
vidual of certain ways of experiencing the 
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world.? Such characteristics of the individ- 
uals accommodative efforts, however, also 
enter into his encounters with the world as 
a kind of automatic filter that he can no 
longer readily change. Some aspects of per- 
sonality often designated as styles may better 
be thought of as capacities than as stylistic 
preferences (Wachtel, 1968b). 

The need for some impersonal process de- 
scriptions in personality theory (even for 
understanding the events of the psychoana- 
lytic hour) is clearer, however, when one con- 
siders the somewhat automatic effects of such 
variables as anxiety upon cognitive function- 
ing. To be sure, the effects of anxiety also 
are probably subject to individual differences 
related to the meaning of the anxiety-provok- 
ing stimulus. There are indications, for ex- 
ample, that anxiety due to a threat perceived 
as possible to avert seems to have different 
consequences than anxiety due to an inevita- 
ble harm (Wachtel, 1968a). Nonetheless, the 
breadth of the individual’s cognitive and at- 
tentive field does seem to be usefully regarded 
as a dependent variable in a functional rela- 
tionship between anxiety and attention (Eas- 
terbrook, 1959; Korchin, 1964; Wachtel, 
1968a), and breadth of attention is important 
to consider in understanding phenomena of 
the therapy hour. 

Consider the following clinical example: A 
female patient, in her early twenties, has a 
great stake in perceiving herself as a young 
child, though such an image is, of course, 
not experienced without considerable conflict. 
This image of herself helps to reassure her 
that she is not like her mother, who she per- 
ceives as selfish, nasty, and controlling. She 
lives in a black and white world, where to be 
a child is to be innocent, pretty, and accepta- 
ble, and to be an adult is to be dirty, vora- 
cious, and despised. Without full awareness, 
she often behaves in childish ways that call 
forth responses from others appropriate to 
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2 Shapiro (1965), for example, has vividly illus- 
trated the personal meaning of individual differences 
in cognitive style. 

* For purposes of exposition, the dynamics of the 
case must here be greatly simplified. There are, for i 
example, many indications that her identification with 
her mother is also a cherished key to power, and 
further that her mother in many respects is also 
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Viewed as a kind of ideal. | 
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dealing with someone much younger. Though 
she complains at times about such treatment, 
it seems to be quite comfortable for her. On 
several occasions, she has reported situations 
in which she was treated with greater respect 
than usual, and compliments, which she found 
gratifying, were combined with requests for 
her to assume responsibilities commensurate 
with her considerable talents. She found her 
distress on these occasions puzzling, and it 
was suggested to her that she would have 
preferred to receive a simple pat on the head 
that did not imply that she was an adult 
capable of making an adult’s contribution. On 
each of these occasions, she became quite agi- 
tated and complained that her therapist was 
talking to her as if she were a child. Similar 
complaints were made at times about her hus- 
band, always at times when he asked her to 
behave like an adult, for example, by keeping 
track of her checkbook balance. 

Her response to her therapist, as to her 
husband, obviously had great personal mean- 
ing. Threatened with the exposure of and 
threat to her wish to be a child, she shifted 
her attention from the content of the thera- 
pist’s comments to their tone; she further 
gratified her wish right in the hour by ex- 
periencing him as talking to her as if she 
were a child. But how did she manage to see 
her therapist in such a fashion? It is not suf- 
ficient merely to note that her perception 
was influenced by the pressure of strong 
wishes, for as we noted earlier there are con- 
siderable constraints upon the distorting effect 
of wishes, We must assume, I believe, that this 
patient focused upon some actual features of 
her therapist’s behavior. No doubt her gen- 
erally childlike manner called forth in her 
therapist some responses that were more ap- 
propriate to addressing a child. The rhythm, 
tone of voice, or style of phrasing must have 
provided the cues that led to her perception. 
That these aspects of the interaction, which 
meshed so well with her needs, should have be- 
come the focus of her attention is not surpris- 
ing. What is worth noting, however, is the de- 
gree to which they remained unmodified by 
other aspects of the situation. Her field of at- 
tention was largely limited to these isolated 
cues. The larger context in which they oc- 
curred, including an explicit message directly 
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opposite to her perceived meaning (i.e. you 
were being treated as an adult—people per- 
ceive you as an adult), was hardly noticed. In 
her state of arousal and anxiety, she experi- 
enced a narrower range of her therapist's be- 
havior than she normally does, and this nar- 
rowing of her field of attention enabled her to 
focus exclusively on the infantalizing aspect of 
his remarks. Clearer understanding of the ex- 
act nature of the narrowing of attention oc- 
curring in anxious states (Wachtel, 1967) 
would clarify the structural limits within which 
her motivational and perceptual efforts were 
expressed, and would suggest what efforts 
would be needed to correct and amplify her 
perception of the interpersonal situation. Un- 
derstanding of the “meaning” of infantilization 
to her is necessary but not sufficient to guide 
the therapist’s approach to such behavior. 


ROLE or ENERGY CONSTRUCTS 


The aspect of psychoanalytic metapsychol- 
ogy that has come under the most intense and 
persistent criticism has been the economic 
viewpoint (Rapaport & Gill, 1959), or energy 
theory. Holt (1967) has pointed out that de- 
spite Freud’s explicit commitment to physi- 
calistic mechanistic explanation, his concept 
of psychic energy “is a vitalistic concept in 
the sense of being similar to and influenced 
by vital force, and being to a large extent 
functionally equivalent to it. They are at the 
least historically and methodologically homol- 
ogus—buds from the same branch [pp. 24— 
25].” Holt has discussed in detail the limita- 
tions that are thereby inherent in the tradi- 
tional psychoanalytic energy concept. 

It should be noted, however, that the en- 
ergy concepts of psychoanalysis have had 
several different functions in the theory. One 
main function of the energy concept was to 
deal with the mind-body problem. Freud re- 
ferred to instincts as a borderland concept 
between mind and body; and as Holt (1967) 
points out, Freud often discussed libidinal 
energy as if it were not simply an abstract 
psychological concept but rather a form of 
physical-neural energy, transformed much as 
heat may be converted to kinetic energy. It 
is within this framework that loose meta- 
phorical thinking and  anthropomorphism 
have characterized discussion of psychic en- 
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ergy (libido seen as pushing, pressing for 
discharge, etc.). 

There has been, however, another use of 
energy concepts by psychoanalytic thinkers 
that is less subject to the difficulties noted 
above. The concept of ego energies has been 
more consistently employed as an abstract 
nonexistential construct representing the ca- 
pacity to do (psychological) work, and has 
thus played a role in theory construction 
somewhat more akin to the role of energy in 
physics. As we shall note, this use of an 
energy concept has its drawbacks too, but ex- 
amination of the utility of the concept thus 
far and of the nature of its limitations can 
pave the way for a better replacement. 

Consider the concept of attention cathexis, 
an aspect of ego energy emphasized by Rapa- 
port (1967). It is based in a general way 
upon the obvious observation that at any 
given moment attention to certain aspects of 
the inner or outer world makes awareness of 
other aspects less likely. The energy construct 
is used to coordinate reciprocities among 
psychological phenomena, relating changes in 
different parts of the system to each other, 
much as physical energy is a hypothetical 
construct simplifying the coordination of reci- 
procities among physical phenomena. If we 
divest the concept of the controversial theories 
of the origins of such energies (cf. Hartmann, 
Kris, & Loewenstein, 1949; White, 1963), we 
seem to be left with a useful scheme for relat- 
ing psychological phenomena, one that may 
lead to testable hypotheses, 

For example, central to the Psychoanalytic 
conception of defense and to the theory of 
neurosis is the idea that defenses use energy 
and require a constant effort to prevent 
the defended-against material from reaching 
awareness, Rapaport (1951) has described 


the psychoanalytic conception of conscious- 
ness as 


a matter of the distribution of attention cathexes, 
which are available only in a certain quantity. Evi- 
dence seems to suggest that these attention-cathexes 
are identical with those used in countercathecting and 
where excessive energies are required for the latter 
it limits those available for the former [p. 699]. 


Gardner et al. (1959) have used this concep- 
tion in accounting for the empirical relation 
between the cognitive Style of leveling and 
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repressive defenses, and many neurotic phe- 
nomena are also explained psychoanalytically 
by the impoverishment of the ego, which such 
diversion of energies implies (Fenichel, 1945), 
One might expect from this view that if an 
individual were defending against perceiving 
an aspect of the stimulus field, his responsive- 
ness to the rest of the field would also be 
reduced, for part of his capacity to attend is 
engaged in preventing perception of the of- 
fending stimulus.* 

Some authors, on the other hand (e.g., 
Kellner, Butters, & Wiener, 1964), have 
maintained that defenses do not involve a 
continuing struggle of the sort implied above 
but rather are a matter merely of the indi- 
vidual's reinforcement history making some 
responses more likely and others less likely. 
If this is so, then defending would not con- 
sume any of the individual's capacity to at- 
tend, and no reduction in overall responsive- 
ness to the field should be evident in those 
instances of conspicuous inattention to partic- 
ular features from which defense is inferred. 
Tn fact, since some features are avoided, the 
rest of the field would have less competition 
for the individual’s attention and should be 
more completely perceived. Defense studies 
that focus not on whether or not a particular 
brief stimulus is perceived, but rather on the 
individual's selective attention to a complex 
field through time, should be able to shed 
light on the merits of these two competing 
conceptions, and thereby on the important re- 
lation between defense on the one hand and 
character and neurosis on the other. 

It is possible, then, to use energy concepts 
to guide empirical inquiry. A fair number of 
studies have originated at least in part from 
economic concepts (e.g., Lustman, 1957; 
Schimek & Wachtel, 1968; Schwartz & Schil- 
ler, in press; Spence & Greif, 1968; Toman, 
1954; Wachtel & Blatt, 1965). The utility of 
Such concepts is considerably limited, how- 
ever, by the difference in the kinds of quanti- 

* The issue is of course far more complicated than 
can be described in this brief presentation. To main- 
tain and reinforce the defense, for example, some 
Particular aspect of the field might be more likely 
noticed when defending (eg, loving implications 
when defending against hostility), but overall aware- 


ness of nuances in all aspects of the field might still 
be reduced. 
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tative statements made by physicists and psy- 
chologists. Energy in physics is measured (in- 
directly, of course, as in psychology) on an 
interval scale. There are a precisely specifi- 
able number of footpounds of potential energy 
embodied in an object suspended above the 
earth. If that body falls, its velocity upon hit- 
ting the ground can be exactly. predicted by 
the formula 1/2 mv? (kinetic energy) = ghM 
(Potential energy). If these two numbers are 
not exactly equal, then one knows precisely 
what portion of the potential energy has been 
used in moving the resisting air particles, heat- 
ing the falling object, etc. Since the same 
“quantity” is presumed to be involved in, for 
example, mechanical, electrical, and thermal 
phenomena, a powerful set of equations can 
be devised to precisely predict changes in 
seemingly diverse phenomenal realms. There 
are even “economic” consequences in the 
more common meaning of the term, for the 
electric company can bill people for a specific 
number of kilowatt hours of energy, regard- 
less of whether they heated their house, lit 
their lamps, cooked a meal, ran electric trains, 
or brushed their teeth. 

The case is quite different in psychology. 
Here energy is measured on an ordinal scale. 
One can say only that there is more or less 
energy used in a given process. To state, for 
example, the simple functional relationship 
that the more one exhibits signs of defending 
against forbidden wishes or ideas, the less one 
will be attentive to his (internal and external) 
environment, seems to be exactly equivalent 
to the proposition that defenses use atten- 
tional energy. The formulation in energic 
terms adds nothing. This is quite different 
from the situation in physics, where numeri- 
cal specification of the quantity of energy 
involved leads to specific determination of 
other measures and to the possibility of 
checking whether all the energy in the system 
is accounted for. Physicists and engineers 
would not need an energy concept if the most 
that could be said was: The more water that 
falls on the generator, the more houses we 
can light. It is the possibility of measuring 
the energy on an interval scale that makes 
the postulation of such a quantity superior 
to simple functional description of the rela- 
tionship between two observable variables. If 
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one could actually quantify on an interval 
scale the amount of defensive activity, the 
amount of perceptual activity, the amount of 
organizing and structure formation, etc., and 
show their sum to be a constant, then psycho- 
logical energy would be an invaluable con- 
struct. The day when such measurement is 
possible, however, seems at this point a mat- 
ter for psychological science fiction. 

At our present level of knowledge, the use 
of energy constructs may do more harm than 
good. That the term is used superfluously 
does not in itself seem a serious matter; it 
can still be consistently related to observa- 
tions, and the imagery evoked by the particu- 
lar metaphor may inspire original hypotheses 
in some investigators. (Rapaport, 1967, for 
example, created a complex set of hypotheses 
that might not have occurred without an 
energy notion.) It is a serious matter, how- 
ever, if the illusion of an adequate explana- 
tion blinds investigators to the many unan- 
swered questions that remain and delays 
seeking more precise descriptions of the state 
of affairs. It is not being unfaithful to an old 
friend to ask of a theoretical conception: 
What have you done for me lately? On the 
contrary, such irreverence is essential for sci- 
entific advance. Many concepts are perfectly 
adequate at an early stage of scientific inves- 
tigation but may freeze progress if held on 
to too long. Energy statements have the ring 
of a finished system, and may perhaps best 
be held in abeyance until such a system is a 
more imminent possibility. 

If the energy construct is discarded, how- 
ever, it will be important to bear in mind the 
reasons for discarding it. In scientific en- 
deavor it is often as important why a concept 
is abandoned as whether it is. The difficulties 
inherent in the use of an energy construct at 
this stage of theoretical development should 
not obscure the need for some kind of process 
description and causal analysis in psychologi- 
cal investigation. 

The metaphor of a limited quantity of en- 
ergy that may be deployed in various Ways 
has its problems and limitations, but it does 
highlight some important issues that it is well 
to keep in mind. Tn particular, it is essential 
to explore the nature of the limits and conse- 
quences of human choices, to understand the 
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price that one pays for committing oneself to 
a course of action, a style of life, or a mode 
of perception. The economic point of view 
emphasizes the limits or boundary conditions 
of human existence, and the importance of 
its message should not be dismissed because 
of the crudity of its formulation. Loevinger 
(1966) is correct that understanding the 
meaning of psychological events is an essen- 
tial aspect of psychoanalytic research, per- 
haps the most important contribution that 
such research can make. But unless such 
understanding is integrated with a firm knowl- 
edge of the fabric of constraints within which 
meaning is given, the psychoanalytic endeavor 
is not likely to have much meaning at all. 
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The dichotomous conception of different forms of alcoholism generally has led 
to experimental results where one group was found to be homogeneous and the 
remainder, as a group, heterogeneous. However, prior research has indicated the 
existence of many such groups. A multivariate correlation grouping technique 
was performed on the clinical and validity scales of the MMPI over two large 


independent samples composed of hospitalized alcoholics. Four alcoholic types 
were identified and, from actuarial studies of the MMPI, all appeared to be 
addictive personality patterns. The most common type was characterized by 
poorly controlled anger and other open forms of emotional expression in 


frustrating situations. 


Most theoretical approaches to the classifi- 
cation of alcoholics are dichotomous (e.g., 
chronic-acute, process-reactive, etc.) and, as 
such, suffer from the common failing of any 
dichotomous explanation: If an occurrence is 
viewed from one position, events that do not 
fit the requirements of that position, by the 
very nature of the dichotomy, must be in- 
cluded in the other (or opposite) position. 
When applied to a nosological entity this 
practice generally leads to the identification 
of at least one heterogeneous group of ques- 
tionable validity. Jellinek (1952) dismissed 
the bipolar approach to classification and defi- 
nition of alcoholics. Instead, he proposed five 
different types of alcoholism that, for want of 
a better and more descriptive terminology, he 
labeled with the first five letters of the Greek 
alphabet. This classification system admits as 
applicable all forms of evidence: psychologi- 
cal, physiological, sociological, biological, and 
nutritional. As such, Jellinek argues, alcohol- 
ism is a “true” disease having effects on all 
areas that comprise the total life space of 
the individual. This is the only theory of 
multiple alcoholic types that has persisted 
even though little research has been accom- 
plished that lends quantitative support to 
this theory. 

1This research was conducted, in part, through 
financial support provided by the Scientific Advisory 
Council to the Licensed Beverage Industries, Inc., 
While the senior author was at Purdue University. 

? Requests for reprints should be sent to Steven G. 
Goldstein, Department of Medical Psychology, Uni- 
versity of Oregon Medical School, 3181 S.W. Sam 
Jackson Park Road, Portland, Oregon 97201. 


The majority of previous attempts to clas- 
sify alcoholics have employed the Minnesota 
Multiphasic Personality Inventory (MMPI). 
Reviews of pertinent literature (Sutherland, 
Schroeder, & Tordella, 1950; Syme, 1957) 
have evaluated these attempts and have con- 
cluded that the lack of any homogeneity in 
score or profile patterns precluded a quanti- 
tative description of the generalized alcoholic 
personality pattern. Descriptive research with 
the MMPI using alcoholic 5s seems in agree- 
ment on only two concepts: (a) alcoholic 
MMPI profiles involve high Pd scores; and 
(b) nosologically there seems to be no dis- 
tinct alcoholic group. In fact, it is probable 
that there might be several different groups 
of alcoholics. 

Traditionally, research dealing with some 
theoretical framework is designed with the 
constraints of the theory in mind. The results 
of such experiments cannot be clearly inter- 
preted, as experimental elegance becomes vic- 
tim to the demands of theory that may or 
may not have been derived originally from 
empirical tests. The research and resulting 
theory dealing with addiction do not have a 
large history of quantitative validation to ex- 
emplify the problems that arise from this 
molar-to-molecular approach. The purpose of 
this study was to attempt to discover whether 
quantitative support could be given to a hy- 
pothesis of multiple alcoholic types. This was 
accomplished through the use of a multivari- 
ate grouping procedure applied to MMPI 
profiles obtained from a total sample of 513 
male inpatient alcoholics. The major hypothe- 
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sis tested was that use of the proposed meth- 
odology would yield groups within a large 
subsample of this alcoholic patient sample 
that could be identified and by MMPI scale 
scores differentiated from normals. These 
groups would be reproducible upon replica- 
tion. 


METHOD 


MMPI profiles were obtained from 513 behavior- 
ally identified male alcoholics who were tested within 
5 days of admission to a state hospital, thus vitiating 
the problem of the effects of prolonged hospitaliza- 
tion on the test. All Ss had been committed to this 
state hospital under an act governing treatment of 
such persons whose alcoholic intake made them un- 
able to function without care. The Ss were assigned 
randomly to either standardization or replication 
subsamples prior to scoring. Thirty unanswered ques- 
tions were used for the cutoff criterion of S accept- 
ability. Using this criterion, 239 Ss were assigned to 
the standardization subsample, and 258 Ss were 
included in the replication subsample. Sixteen Ss 
were lost due to an excessive number of unanswered. 
questions or otherwise invalid test results, 

The basic problem was to divide N Ss into k 
subsets under the restriction that the average of all 
distances (ie. profile differences) between Ss within 
subsets be minimized and the distance between sub- 
sets be maximized. Prior research in this area re- 
viewed by Melda and Linden (1965) used distance 
functions (Nunnally, 1962; Overall, 1964), However, 
grouping procedures based upon a distance function 
would appear to have some important drawbacks 
with regard to the shape of the individual profile and 
knowledge of the elevation of that profile. While this 
procedure may be suitable in cases where these vari- 
ables are relatively unimportant, interpretation of 
the MMPI necessitates a knowledge of both shape 
and elevation. For example, an elevated Depression 
Score associated with depressed Hypochondriasis and 
Hysteria scores in an inverted V pattern (i.e., shape) 
certainly indicates a different type of person than 
one who produces what has been called the clinical 
V (Dahlstrom & Welsh, 1960). 

The procedure used in this Study was modification 
of that originally proposed by Lorr, Bishop, and 
MeNair (1965) and obviates many of the objections 
raised above. The Procedure begins with the con- 
struction of a variance-covariance matrix yielding 
an indication of elevation, From this matrix clusters 
of Ss are selected whose Profile elevation and shape 
are similar. The first cluster in this procedure is 
built around a pivot profile that has the highest 
average correlation with all cluster profiles above a 
critical limit that is defined as the lower cutting point 
(Cx). Cr generally is determined as the .05 level of 
significance for the Pearson r (taking into considera- 
tion the & variables) less one-half of the associated 
standard error for that Statistic. Retained in this 
cluster are all profiles that correlate above the Cu 
with the pivot profile and all members initially as- 
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signed to the cluster. This procedure is preferable to 
adding profiles that correlate only with the pivot 
profile, as such profiles may not correlate with other 
members of the cluster. 

An upper cutting poing (Cv) is determined, for 
example, as the .05 level of significance for the Pear- 
son r plus one-half of the standard error of that 
statistic. All profiles with an average r greater than 
the Cu are removed to prevent clusters Írom over- 
lapping. This limit also serves to eliminate profiles 
that are common to more than one cluster, Using 
the residual N a second cluster may be attempted 
following the same procedure as described for the 
first cluster, Again, all profils whose avarage r is 
above the Cu with the second cluster are removed 
and another clustering procedure initiated for the 
residual N. 

A problem occurs here; for example, a cluster 
made up of two profiles easily might destroy larger 
clusters because its average correlation with any 
given cluster might be quite high because of the 
limited sample size of the cluster. Thus a limit is set 
on the minimum number of profiles needed to form 
a cluster. It was suggested by Lorr et al. that the 
minimum number of profiles required for any cluster 
be four. 

A final check on the procedure may be assessed 
by recomputing the average correlations between and 
within clusters and deleting the few profiles that fail 
to meet the criteria specified above. 

This form of correlational clustering was applied 
independently to both standardization and replica- 
tion alcoholic substamples for k = 15 MMPI scales— 
the 3 validity scales, 10 clinical scales, Barron's (1956) 
Ego Strength Scale and Cuadra’s (1956) Control in 
Psychological Adjustment Scale. 


RESULTS AND Discussion 


Applying the correlational cluster proce- 
dure to the standardization sample of 239 
hospitalized alcoholics, four types were iden- 
tified. The member profiles of each type had 
Positive and significant average correlations 
with other profiles of that type and corre- 
lated either negatively or nonsignificantly with 
members of any of the other three types iso- 
lated. One hundred and fourteen Ss (45.42%) 
of the standardization sample were assigned 
to one of the four clusters in question. The 
group of alcoholics (N = 251) set aside for 
replication was clustered using the same cony 
straints as in the case of the standardization 
group. Again, four types were identified. One 
hundred and six Ss (42.23%) of the replica- 
tion sample were clustered by this procedure. 

It should be noted that less than one-half 
of the entire sample was included in the types 
identifed. On the basis of the reported re- 
search using this and other grouping proce- 
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TABLE 1 


Means AND STANDARD DEVIATION: 


S FOR CLUSTER STANDARDIZATION 


AND REPLICATION GROUPS ACROSS VARIABLES 


Type I Type II Type III Type IV 
MMPI 
scales Sb Re s R s R s R 
(N = 46) (N = 34) (N = 39) (N = 44) | (N =25) (N = 22) (N = 4) (N = 6) 
L 
M 6.83 6.38 3.21 2.86 3.08 2.91 4.00 4.83 
SD 2.85 1.81 1.76 1.94 1.47 1.07 1.83 2.04 
F 
M 4.59 3.97 11.08 9.27 5.60 5.14 7.25 6.17 
xD 2.01 1.73 6.93 3.08 2.35 1.93 3.30 2.32 
M 19.96 20.09 10.44 8.89 8.60 11.91 15.50 14.00 
SD 3.52 2.60 3.84 3.21 2.99 3.39 4.43 1.79 
As+.5k 
M 13.96 13.56 20.97 16.50 8.80 10.73 14.25 12.67 
D SD 3.14 3.31 541 5.71 2.45 2.62 1,26 3.45 
M 21.87 20.59 31.08 27.77 20.40 19.91 17.75 20.00 
2E 3.80 4.12 5.25 5.36 3.46 3.39 2.63 4.34 
y 
M 23.20 23.15 28.23 23.80 14.92 19.45 24.00 16.83 
SD 3.70 4.68 6.34 5.85 3.03 3.67 1.83 3.76 
Pd+-4k 
M 29.74 28.44 29.18 27.93 25.12 24.91 29.50 28.33 
up 4.04 4.08 5.02 4.55 4.12 4.01 2.65 4.80 
M 20.78 22.65 23.64 24.11 20.36 21.05 23.50 17.33 
x SD 3.69 3.46 3.83 4.67 3.50 3.39 5.00 2.80 
a 
M 8.41 8.76 12.72 12.55 8.64 7.18 10.75 8.83 
SD 1.88 1.99 421 3.05 3.15 2.50 1.26 3.25 
Pi Ak 
M 26.52 25.26 37.79 33.59 24.60 23.64 28.25 2547 
SD 4.16 3.01 6.86 5.50 4.51 4.10 4.27 1.94 
Sc+-1k 
M 24.83 24.18 35.82 30.48 20.84 20.23 27.00 2547 
SD 4.27 3.57 10.24 6.23 5.60 3.45 2.16 2.64 
Ma+.2k 
M 17.67 17.06 21.85 21.27 20.88 22.23 25.50 23.50 
2 2.63 2.99 5.33 3.48 3.55 3.13 1.29 3.21 
D 
M 22.61 21.29 37.92 35.48 32.28 24.00 24.00 26.83 
d 5.64 5.84 6.55 6.14 5.14 747 3.46 4.07 
M 46.00 45.97 3241 35.86 42.68 47.23 41.50 41.50 
G SD 3.13 4.37 5.66 5.63 3.33 4.65 4.80 3.89 
n 
M 21.24 21.21 29.90 29.20 28.00 29.41 24.75 22.50 
SD 4.26 3.62 3.66 4.49 3.58 3.51 4.42 4.72 
* Raw score. 


bS = Cluster standardization group. 
*R = Replication group. 


dures (e.g., Lorr et al, 1965; Lorr & Rad- 
hakrishnan, 1967), this result was antici- 
pated. More of the sample could have been 
clustered had the limit for association been 
more liberal. However, lowering the critical 
values used would have increased the stan- 
dard error of the mean obtained. Thus, the 
final decision of what critical value to use 
Was a compromise between the number of Ss 


included in four clusters identified and the 
size of the standard error of the mean. This 
problem becomes more acute as the number 
of variables under consideration increases. 
For this study it was felt that because the 
similarity of profile shape was important for 
profile interpretation, the critical level for 
association should be set at a relatively high 
level. 
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TABLE 2 


SINGLE CLASSIFICATION ANOVA SYNOPSES FOR 
STANDARDIZATION AND REPLICATION SAMPLES 
OF CLUSTER ÁNALYsIS 


Standardization sample| Replication sample 
MMPI 
scale 
F » F » 
E — 

L 3131 | .0005 |'2924 | .0005 
F 14.62 | .0005 | 25.11 | ‘0005 
K 80.58 | “0005 | 8442 | .0005 
Hs 4591 | (0005 | 8.76 | .0005 
D 43.73 | 10005 | 1935 | ‘000s 

Hy 36.68 -0005 5.28 .005 

Pd 6.10 .001 5.08 .005 
Mf 606 | :001 6.84 | .0005 
Pa 13.59 | .0005 | 18.50 | 0005 
Pt 42.75 | 0005 | 3603 | ‘0005 
Sc 2736 | .0005 | 2607 | 10005 
Ma 8.53 | .0005 | 1202 | ‘000s 
Si 55.62 | .0005 | 40.63 | ‘0005 
Es 70.77 | 0005 | 36.09 | ‘0005 
Cn 48.77 | 0005 | 29.00 | ‘0005 


^ Fos = 4,50, df = 3/120; F w = 5.79, df = 3/120; F sms 
= 6.34, df = 3/120, 


Table 1 shows the means and standard de- 
viations for the four standardization and 
replication types. These types are profiled in 
Figures 1-4. Single classification analyses of 
variance (ANOVAs) were performed across 
the four types considering 15 variables for 
both subsamples. The results of the analyses 
summarized in Table 2 indicated that there 
were significant differences between cluster 
means across all variables. 

Newman-Keuls procedures for the identifi- 
cation of significant differences between in- 
dividual cluster means were performed. The 
results of these tests indicated that while sta- 
tistically there were significant differences 
present, these differences were not consistent 
for types across variables, As discussed below, 
the four types identified have been listed sepa- 
rately in volumes of MMPI Profiles (e.g., 
Gilberstadt & Duker, 1965; Hathaway & 
Meehl, 1951; Marks & Seeman, 1963) and 
can be described in behavioral terms as being 
distinct from one another. The profiles in- 
cluded within a type also satisfy the require- 
ment that the average correlation within types 
across variables deviates from zero-order cor- 
relation at or beyond the .05 level of sig- 
nificance. The lack of consistent significant 
differences among types across variables may 
be viewed as a difficulty inherent in MMPI 
profile data. Changes in Profile shape and 
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elevation tend to be confounded through the l 
use of such devices as the Welsh code (Dahl- 
strom & Welsh, 1960) and thus prevent the 
production of consistent statistically signifi- 
cant differences when one variable at a time 
is tested because of shifts in the positions of 
ordered means and the intersection of profiles 
on a given variable. Interpretation of the 
mean profiles as types must be made through 
the interpretation of the profile as a whole 
and not through the use of individual vari- 
ables. 

The four alcoholic types identified have dis- 
tinct dynamic descriptions based on actuarial 
studies. 


Type I 


Both Gilberstadt and Duker (1965) and 
Hathaway and Meehl (1951) list this profile 
type. Persons presenting such a profile are 
described as exhibiting a personality trait dis- 
turbance (emotionally unstable personality) 
whose most outstanding trait is a poorly 
controlled anger that frequently results in 
“temper tantrums” and other overt forms of 
emotional expression when presented with 
frustrating situations, Prior to the revision of 
the American Psychiatric Association’s Diag- 
nostic Manual, this profile type was commonly 
associated with a diagnosis of psychopathic 
personality, emotional instability. Preu (1944) 
has pointed out that one of the indications 
of such a personality disturbance is drug 
addiction and/or chronic alcoholism. 


Type II 


An approximation to this profile type is 
noted in all three actuarial volumes. Gen- 
erally, the prime nosological term applied to 
Persons presenting such a profile is psycho- 
neurosis. The qualifying description is either 
anxiety reaction or reactive depression. In 
either case this appears to be a severe, chronic 
alcoholism associated with a great number of 
Somatic complaints. Suicidal ideation is evi- 
dent in the large majority of these cases as 
well as difficulty in marital relationships char- 
acterized by a dominating wife who allows à 
dependency relationship to become extreme. 


Type III 


Only Hathaway and Meehl (1951) list 
this profile. They regard this type, however, 
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Fic. 1. Standardization and replication profiles for Cluster Type I over MMPI scales, 


to be clearly alcoholic and this is their prime 
diagnosis for such profiles. Secondary to this 
is the psychopathic personality, mixed-type, 
classification. Generally, the person presenting 
such a profile has a long history of chronic 
alcoholism interspersed with acute alcoholic 
episodes. He generally exhibits a history of 
sporadic hospital admissions followed by un- 
successful attempts at abstention. Length of 
stay in a hospital is generally short and is 
marked by the patient’s feeling of no need 
for treatment and an uncooperative patient 
attitude. Discharge prognosis is usually poor. 


Type IV 


This code type bears the primary diagnostic 
classification of alcoholism with secondary 
characteristics of drug addiction and para- 
noid features (Hathaway & Meehl, 1951). 
The person presenting such a profile has used 
alcohol most of his life in amounts that would 
be considered excessive in terms of United 
States normative data. His alcoholic episodes 
are marked by the addiction of drugs (such 
as barbiturates) and the substitution of 
liquids with alcoholic content (such as terpin 
hydrate or aftershave lotion) for the more 
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standard alcoholic beverages. Persons produc- 
ing a profile such as this have been able to 
stop drinking completely for indeterminate 
periods of time and, during these “dry spells,” 
have been known to lead normal and responsi- 
ble lives, 


ConcLusIoNs 


research has been that a group of persons 


exhibiting addictive behavior were grossly 
similar only in overt behavioral symptom- 
ology. Further, it was believed that the under- 
lying personality dynamics could vary de- 
pending upon the disorder for which the 
addictive behavior may have been sympto- 
matic, It would appear that this contention 
has some validity insofar as a group of hos- 
pitalized alcoholics was concerned. The dy- 
namics inferred from the MMPI scale scores 
of persons constituting each of the four alco- 
holic types identified suggested that such 
types well might benefit from differential treat- 
ment programs designed for the specific type 
of alcoholic rather than to provide a general 
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Fic. 3. Standardization and replication profiles for Cluster Type III over MMPI scales. 


program for the treatment of all persons 
called alcoholics. 

Independent attempts at further validation 
by the senior author have suggested that pro- 
files of portions of the group of Ss who were 
termed Type II degenerate over time into 
Type I profiles. The reasons for this degen- 
eration are speculative. Both types evidence 
a relative loss of control in terms of an in- 
ability to express emotional content without 
anger. The Type II profile contains other in- 
dications, which generally leads to the diag- 
nosis of some form of psychoneurosis. As these 
somatic indications respond relatively quickly 
to treatment, the obtained profile may well 


degenerate into a characterological one similar 
to the Type I profile. 

A question may be raised as to the fre- 
quency of the existence of the type of alco- 
holic called here “Type IV.” In the total 
sample of 497 cases, this type of alcoholic 
profile appeared only 10 times for a relative 
frequency of occurrence of .02. The fact that 
it occurred on two successive independent 
samples producing similar profiles would seem 
to reject a hypothesis of chance occurrence. 
On the other hand, mean profiles based upon 
such small samples generally do not demon- 
strate reliability. It is the contention of the 
authors that the successive occurence of the 
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Fro. 4, Standardization and replication profiles for Cluster Type IV over MMPI scales. 


Type IV profile is indicative of its existence 
but this profile type should be viewed and 
interpreted with caution. It might very well 
be that this profile is essentially that of the 
outpatient alcoholic. The type may exist, but 
with an increase in the size of the sample it 
may exist in a different profile shape. There 
is also the possibility that the profile is de- 
generative; that is, it is actually one of the 
other three profile types and is indicative of 
an acute dysfunction that may subside into 
another type over a relatively short period 
of time. It should also be noted that only 
45% of the standardization and replication 


of both samples were considered as residual 
profiles, 

The literature contains a number of re 
ported alcoholic patterns that did not appear 
in the results reported here, Button (1956), 
Hoyt and Sedlacek (1958), Frame and Os- 
mond (1956), and Rosen (1960), among oth- 
ers, report profiles that show high points inn 
the D, Pd, and Ma scales. While the high 
Points appear in the present results, the re- 
maining portions of the obtained profiles are 
at variance with these earlier reports. It 
might be speculated that the discrepancy in 


samples could be grouped. Thus the majority 
| 
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reports is a function of sample selection. 
However, the present authors felt that this 
explanation was not adequate to account for 
all of the discrepancies. It would appear that 
while univariate approaches used by prior re- 
searchers are sufficient to extract high point 
differences among different subgroups within 
a sample, they tend not to account for other 
more subtle profile differences. 

This study was directed at the problem of 
the identification of alcoholics. The samples 
employed consisted of persons legally com- 
mitted to a state hospital under the law in 
effect at that time (1955-57) dealing with the 
management and treatment of alcoholics. 
Even though it was felt that the time of test- 
ing (within 5 days of entry to the hospital) 
was optimal in regard to sobriety and lack of 
institutional effect on the S, the fact that the 
alcoholic process was deemed serious enough 
for committal cannot be overlooked. Thus the 
implications of the results presented here 
would seem tenuous at best in regard to out- 
patient alcoholics because of the complex set 
of interactions between the socioeconomic 
status of a given person, the severity of the 
addiction, and the strength of the basic pre- 
morbid personality structure. Holding the last 
two constructs constant, the socioeconomic 
position of a patient alone is enough to deter- 
mine his disposition in terms of horizontal 
(outpatient, day, or inpatient care) or vertical 
treatment attempts (private clinic, sanato- 
rium, or public hospital). With this in mind, 
it should be realized that the sample of alco- 
holics used in this study were most likely low 
on any socioeconomic scale and chronic in 
terms of recividist rate. Such persons are not 
the usual candidates for outpatient care. 
Therefore, it would seem that the most logical 
extension of this form of identification and 
classification would be a replication of the 
entire study on an outpatient alcoholic sample. 
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THE CLINICIAN AS AN ABERRANT ACTUARY: 


ILLUSORY CORRELATION AND THE INCOMPLETE 
SENTENCES BLANK? 


B. JAMES STARR ? anp EDWARD S. KATKIN 
State University of New York at Buffalo 


An experiment was conducted in which 60 Ss observed five completed sentences 
paired equally frequently with each of five statements purporting to describe 
the major problem of the patient who had completed the paired sentence. The Ss 
were graduate students in clinical psychology (CGS), graduate students in non- 
clinical areas of psychology (NCGS), and undergraduate students (US). All 


groups reported illusory correlations with paralleled relationships generated by 
a group of experienced clinical psychologists, The groups differed with regard to 
the degree of confidence which they dispayed in their erroneous choices, The 
US group indicated that they were “sure” and “guessing” more often than the 
other groups, Reasons for the differential use of confidence categories by the 


three groups are discussed. 


Chapman and Chapman (1967) have 
pointed to 4 systematic error of observation 
that may explain clinicians’ continued use of 
hypotheses and signs which have failed to be 
validated by research or usage. Chapman 
and Chapman (1967) described this error, 
which they have called “illusory correlation,” 
as 


the report by an observer of a correlation between 
two classes of events which in reality (a) are not 
correlated, or (b) are correlated to a lesser extent 
than reported, or (c) are correlated in the opposite 
direction than that which is reported [p. 194]. 


Collecting data from six experiments they 
revealed that the clinical Signs commonly 
employed with the Draw-a-Person (DAP) 
test were discovered anew by naive Ss observ- 
ing data which contained zo relationship be- 
tween drawing characteristics and symptoms. 
The DAP was chosen for study because of its 


1This research was supported, in part, by Na- 
tional Institute of Dental Research Training Grant 
1T1-DE-170 and Research Grant No. MH-11989 
from the National Institute of Mental Health, 
United States Public Health Service. Much of the 
data analysis and the use of some of the equipment 
in this investigation were made possible through 
the aid of the staff and facilities of the Computing 
Center at the State University of New York at 
Buffalo. The Computing Center is partially supported 
by National Institutes of Health Grant FR-00126 and 
National Science Foundation Grant GP-7318. 

? Requests for reprints should be Sent to B. James 
Starr, Department of Psycholo, » State University 
of New York at Buffalo, Buffalo, New York 14214, 


wide usage (Sundberg, 1961), despite re 
Ported disconfirmation (Swenson, 1957) of 
many of the clinically relevant signs defined 
by Machover (1949), The viability of the 
illusory correlation which they found is at- 
tested by the fact that Ss still manifested this 
observational error even under conditions de- 
signed to greatly facilitate the reduction of 
error. 

Although the DAP is a projective tech- 
nique which has come under severe criticism, 
the Sentence Completion Test seems to have 
fared better in the hands of researchers, The 
Rotter Incomplete Sentences Blank (ISB) 
has been shown to be a reasonably valid clini- 
cal instrument (Rotter & Rafferty, 1950), and 
it has been employed profitably as a research 
tool (Goldberg, 1965). The ISB, in common 
with other sentence completion tests, may 
derive its validity from the fact that it is a 
“high-fidelity, narrowband” test (Cronbach, 
1960). Furthermore, a survey has revealed 
that the Sentence Completion Test ranked 
thirteenth in general usage and second among 
group tests employed in this country (Sund- 
berg, 1961). 

The present study represents an attempt to 
investigate illusory correlation through the 
use of completed ISB sentences paired equally 
frequently with each of a number of state- 
ments of patients’ problems. The Ss in the 
Chapman and Chapman (1967) study viewed 
stimuli consisting of pairs of symptom state- 
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ments associated with each of a number of 
drawings. Since the present study employed 
five descriptive statements paired equally 
frequently with five completed sentences, the 
two studies clearly differ with regard to the 
amount of information that an S must process. 
The Chapmans’ Ss had to relate qualities 
which they had abstracted from each of at 
least 15 drawings to pairs of symptom state- 
ments. The Ss used here had only to observe 
the same five completed sentences and the 
same five descriptive statements over and over 
again in different pairings. Thus, S proceed- 
ing through the task in the present study had 
much less new information to process. It was 
hypothesized that all groups taking part in 
this experiment (a clinically sophisticated 
group and two “unsophisticated” control 
groups) would display the illusory correlation 
effect. A further hypothesis was that the 
clinically trained group, with an expectancy 
for pairs that might “go” together and some 
confidence in the ISB, would show more con- 
fidence in their decisions. Thus, confidence 
ratings were also obtained in this study. 


METHOD 
Subjects 


Three groups of 20 Ss each, all students at the 
State University of New York at Buffalo, were em- 
ployed in the present investigation. A group of under- 
graduate students (US) participated in order to earn 
credit toward fulñlling one of the requirements of a 
course in introductory psychology. Groups of grad- 
uate students in clinical psychology (CGS) and 
graduate students in nonclinical areas of psychology 
(NCGS) also agreed to participate. 


Stimuli 


Five statements describing the problems of patients 
were paired with sentences supposedly completed by 
these patients, Both the descriptive statements and 
the completed sentences were devised especially for 
the experiment. The problem descriptions employed 
were: 


1. He has difficulty controlling his aggression. 

2. He fears taking positive action. 

3. He is disturbed about the strong sexual 
urges he feels for other men. 

4. He feels he has nothing to live for. 

5. He complains of perpetual fatigue and 
illness, 


The ISB sentence stems and their completions were: 


1. The happiest time—can never be. 
2. I regret—nothing I have done. 
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3. At bedtime—I cry myself to sleep. 

4. The best—way to avoid trouble is to do 
nothing. 

5. My greatest fear—is too bad to talk about. 


The descriptions and the sentences were typed on 
IBM punch cards. The descriptive statements were 
prefixed with the letter *D" followed by a colon, 
and the completed sentences with “S” followed by a 
colon. All possible pairs of sentences and descriptions 
were then combined and stapled together with the 
descriptions immediately below the completions. 
Twenty-five different pairs resulted. 

Two different random orders of presentation were 
generated. The final stimulus deck was constituted by 
placing the pairs in the two random orders together. 
Each stapled pair was separated from the succeeding 
pair with a pink IBM card in order to insure the 
reading of only one pair at a time. The result was 
50 pairs of descriptive statements and completed 
sentences, each pair having an equal frequency of 
occurrence in the deck (ie., twice). 


Apparatus 


Five card stackers were placed on a table in a 
small room. The stimulus decks were located to the 
right of the stackers. The instructions and prear- 
ranged signals (employed to assure synchronous 
reading by all Ss) were presented on a magnetic tape 
which had previously been recorded by E. 


Procedure 


On entering the room, a group of Ss was asked to 
seat themselves in front of the stackers. The groups 
consisted of three, four, or five persons at each test- 
ing session. They were told that during the first part 
of the experiment they would be reading through 
the decks of cards in front of them. The following 
instructions were then administered by tape re- 
corder: 


As some of you already know, the Rotter Incom- 
plete Sentences Blank is a projective test used to 
help in psychological assessment. We want to 
test your powers of observation. You will be 
shown sentences which have been completed 
by men with various emotional problems. With 
each completed sentence you will receive a 
statement describing what is, in our opinion, 
the most important problem of the man who has 
completed this sentence. Many of these men have 
the same problems. Please study each pair— 
that is, the completed sentence and the descriptive 
sentence—very carefully. When you have looked 
at all of the pairs, I am going to ask you about 
the sentences completed by men with each type 
of problem. In front of you there are decks of 
IBM cards. When you hear the word “Start,” 
remove the rubber band and the first pink card. 
Study the first pair of sentences. The cards are 
stapled together. The completed sentences are 
labeled with an S and the descriptive statements 
with a D. Do not go on until you hear the word 
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“Turn.” Then remove the next pink card and 
read the new pair. Read one pair at a time and 
do not look back at previous pairs. Are there any 
questions? 


The tape recorder was then stopped and relevant 
portions of the instructions were repeated in answer 
to any questions which might have arisen. The E 
then said: “Please regulate your reading speed in time 
with the tape recorder.” Following this E played the 
recording of the timed intervals. The Ss had 10 sec. 
in which to read each stimulus pair. 

After Ss had finished reading the stimulus decks, 
they were instructed to familiarize themselves with 
the material on the first page of forms which had 
been distributed to them. The top line of the form 
required each S to fill in his major field and year 
level. The form stated: 


Some of the sentences were completed by men 
with the following problem: He has difficulty 
controlling his aggression. Below are five com- 
pleted sentences. Choose one which was most 
often associated with the above problem. Indi- 
cate at least a first and second choice. If you feel 
that some of the sentences were associated 
equally frequently with this problem, please 
indicate this. [Immediately below these instruc- 
tions were the five completed sentences used in 
the present study. These were followed by 
further instructions.] Please rate below how sure 
you are that the above chosen sentence(s) is 
(are) representative of the data you observed 
for patients with this type of problem. 


Positive fairly sure guessing 


[After Ss had read the form, E answered their 
questions.] There are five pages to this form 
with two types of items to be answered—a 
choice of a sentence or sentences and a confidence 
rating. The confidence rating refers to your first 
choice or choices. Please be sure to answer both 
types of items on each page. 


Succeeding pages in the form referred to different 
descriptive statements, The total time for each of 
the experimental sessions was less than 30 min. A 
short debriefing session in Which Ss were told about 
the experiment and asked not to talk about it 
followed. 


Survey of Clinical Psychologists 


In order to facilitate interpretation of any illusory 
correlation which might emerge, it was necessary to 
poll a group of clinical Psychologists who were 
familiar with the ISB. The primary Purpose of this 
Survey was to ascertain whether Ss would “discover” 
the sentence-description relationships to which ex- 
perienced clinicians subscribe. The clinicians were 
given mimeographed forms with the following 
instructions at the top: 


Below are 5 sentences which you should assume 
have been completed by male patients, The 
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sentences are followed by five descriptions of the 
emotional problems of these patients, Please 
indicate which description best goes with each 
sentence. Please do not sign your name, 


Below the instructions were the five ISB completions 
and the five descriptive statements employed in this 
study. At the bottom of the form were two questions 
designed to assess the utility of the ISB for the 
respondent. A third question asked the clinician to 
indicate the number of years in which he had been 
doing diagnostic work. 

Eight doctoral clinical psychologists employed by 
the State University of New York at Buffalo partici- 
pated in this phase of the experiment. The anonymity 
of the respondent was preserved to facilitate the 
clinician’s candor (Chapman & Chapman, 1967). 
The data obtained from the clinicians remained 
unanalyzed until after the data collection in other 
parts of the study was complete. 


RESULTS 
Response Measures on Clinicians 


The clinical psychologists had a mean of 
9.25 yr. of experience involving diagnostic 
work. Completed sentences which the majority 
of the clinicians saw as being related to each 
of the descriptive statements are presented in 
Table 1. The percentage of agreement among 
members of this group is reported in the 
second column, 


Illusory Correlation in the Experimental 
Groups 


All sentence completions selected as the 
single “most frequent” correlate for a par- 
ticular description by at least one-fourth of 
the Ss indicating only one “first” choice are 
also listed in Table 1. The last column of 
Table 1 indicates the percentage agreement 
computed on the data for the combined 
groups. The first column in that table contains 
the expected Percentages under the null hy- 
pothesis. Random distribution of choices (i.e, 
observed frequencies approximating the ex- 
pected frequencies) implies an absence of 
illusory correlation. Cochran Q analyses were 
computed for the descriptive statements to 
determine if the choices were randomly dis- 
tributed among the five sentences from which 
Ss had had to pick (Siegel, 1956). All three 
groups were significantly nonrandom (i.e. 
Consistent) in their choices for the second 
and fourth descriptions. Beyond this, the 
third descriptive statement attained signifi- 
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TABLE 1 


PERCENTAGE OF CLINICIANS AND EXPERIMENTAL GROUPS REPORTING THE Most 
POPULAR CORRELATES OF THE Five DESCRIPTIVE STATEMENTS 


Groups 
fare Expected per- 
epee Clinicians* CGS» NCGS? Us* Duos uH 
Description 1: Difficulty controlling aggression 
Regrets nothing 20 78 29 33 32 31 
Avoids troubles 20 11 41* 27 21 29 
9 17 15 19 51 

Description 2: Fears taking action 

Avoids troubles 20 83 61*** 67*** 67*** (a 
8 18 15 18 51 

Description 3: Strong sexual urges 
Cries to sleep 20 0 41* 27 32 93^ 
Greatest fear 20 100 41* 33 42* 39* 
N 7 17 15 19 51 

Description 4: Nothing to live for 
Never happy 20 60 ose so** 50** 53" 
Cries to sleep 20 40 24 38 22 27 
N 10 17 16 18 51 

Description 5: Perpetual fatigue and illness 

Never happy 20 33 29 25 25 27 
Regrets nothing 20 0 35 25 20 27 
Cries to sleep 20 50 24 31 25 27 
N 6 17 16 20 53 


i , Note.— The criterion for inclusion in this table required that an item be chosen by more than 25% of all the Ss included 
in this analysis. CGS = graduate students in clinical psychology; NCGS = graduate students in nonclinical areas of psychol- 


ogy; US = undergraduate students. 


of whether or not an item had previously been used. 


* Intragroup variation of N is due to the clinicians choice of the best combination of sentence and description, irrespective 


H idee fae variation of N is due to the exclusion of Ss who had subscribed to two or more items as equally ‘‘most frequent.” 


cance for the CGS group and fell just short 
of significance (.05 < p < .10) for the US 
group. Five Q analyses were calculated from 
the data by combining the three experimental 
groups. These analyses revealed some con- 
sistency in the choice-behavior of the Ss for 
each of the descriptions. The first description 
attained a p< .01; the second, third, and 
fourth descriptions each attained a p < .001; 
and the fifth description attained a p < .02. 
Five Ss, four in the NCGS group and one in 
the CGS group, correctly maintained that the 
five sentences were associated equally fre- 
quently with each of the descriptive state- 


ments. Three other Ss, two in the CGS group 
and one in the NCGS group, detected the 
uniform frequencies with regard to one or 
more (but not all five) of the descriptions. 
All data in which two or more sentences had 
been chosen as the most frequent correlates of 
a description were excluded from considera- 
tion in subsequent statistical tests. 
Inasmuch as the majority of the Ss re- 
ported a single sentence as the most common 
correlate of a particular description, a more 
specific test of the illusory correlation hypoth- 
esis was possible. Since the response measures 
for these Ss were obtained in a forced-choice 
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TABLE 2 


PERCENTAGE OF SusjEcrs REPORTING THE Most 
POPULAR CORRELATES OF THE FIVE DESCRIPTIVE 
STATEMENTS As A First OR SECOND CHOICE 


Groups 
Item 


ccs | NCGS| US 


Description 1: Difficulty controlling aggression 


T—— 
Regrets nothing | 40 47 60 60 56* 
N 17 15 20 52 
Description 2; Fears taking action 
Avoids trouble 40 77** | 87*** | 85***| gss 
N 18 15 20 53 
Description 3: Strong sexual urges 
T— 
Greatest fear 40 65* | 60 74** | G7*** 
N 17 17 20 54 


Description 4: Nothing to live for 
eee 
Never happy 40 76** | 76** | g5***| goess 
Cries to sleep 40 65* | 76** | 55 65r 
N 17 17 20 $54 


Description 5: Perpetual fatigue and illness 


40 41 
17 


69* |45 51 
16 20 |53 


Cries to sleep 
N 


by at least. SO E the Ss as a first or second choice. The table 
uc Ose Ss who picked two sentences as “most 
frequent,” one of these being a popular correlate. See Table 1 


type of situation any one of the five sentences 
should be chosen by 20% of the Ss. Thus, 
binomial tests could be applied to these data. 
The significance levels reported in the middle 
three columns of Table 1 were based on the 
binomial test, while those in the last column 
were based on the Vates corrected z formula 
for approximating binomial probabilities, 
The percentage of Ss Picking the most 
popular correlates as either a first or a second 
choice is shown in Table 2. The objective 
probability of an S selecting any critical 
sentence as either a first or a second Choice is 
two-fifths as shown in the first column, Given 
the sample sizes in this study, the z approxi- 
mation of the binomial test can be applied to 
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all of the data in Table 2 (Hays, 1963). It 
may be seen that the data in Table 2 are quite 
comparable with the data in Table 1. Further, 
it is apparent that a number of illusory 
correlations have emerged from the present 
investigation, and that there is a striking simi- 
larity among the clinicians choices and the 
illusory correlates. 


Analysis of Confidence Ratings 


Unfortunately, analysis of the confidence 
ratings was vitiated because most Ss avoided 
the use of extreme categories. However, a chi- 
square test (x? = 14.961, df=4, p « 01) 
revealed that Ss in the three groups did not 
employ the confidence categories in the same 
way. The frequency of category choices 
among the three groups is shown in Table 3. 
The major component of the chi-square was 
derived from the use of the “positive” cate- 
gory. The CGS group used this category 
infrequently—about half as often as the 
NCGS group. The last-mentioned group, in 
turn, was much less “positive” than the US 
group. Another large component of the chi- 
Square can be traced to the relatively infre- 
quent use of the “guessing” category by 
the US group when compared to the other two 
groups. 


Discussion 


The binomial and z tests applied to the 
data revealed a number of illusory correla- 
tions. In addition, collapsing the data from 
the three experimental groups permitted the 
inference of a trend toward illusory correla- 
tion for each of the descriptive statements 
used in this study. These inferences are 
grounded in the fact that S’s choices were 
significantly nonrandom for each description. 


TABLE 3 


FREQUENCY OF CATEGORY CHOICES IN CONFIDENCE 
RATINGS BY EXPERIMENTAL GROUPS 


Category 
Group 
Positive | Fairly sure Guessing 
CGS 5 58 37 
NCGS 12 52 36 
US 21 57 22 


Note.—See Table 1 for explanation of abbreviations. 


CLINICIAN AS AN ABERRANT ACTUARY 


A significant Q in the absence of a specific 
illusory correlate would seem to indicate that 
two or more sentences together (neither by 
itself significant) were most commonly chosen 
as frequent associates for a particular descrip- 
tive statement. Thus, this study confirms the 
phenomenon described by Chapman and 
Chapman (1967). 

The confirmation of the Chapman and 
Chapman (1967) findings in the present study 
underscores the hardy nature of the illusory 
correlation. Perhaps one of the most striking 
findings is how few of the 60 Ss said that the 
sentences appeared with equal frequency. The 
present investigation also verifies the paral- 
lelism between the choices of the experienced 
clinicians and experimental Ss. Especially sup- 
porting here are the responses to the fifth 
description. Both the clinicians and Ss had 
difficulty agreeing on a sentence which they 
could ascribe to this statement. 

Perhaps the most parsimonious explanation 
of the analysis of the confidence ratings would 
be that the CGS and the NCGS groups have 
learned to be less extreme in subscribing to 
“facts.” They have probably overlearned the 
notion that there are few unequivocal psycho- 
logical relationships. Further, the purposes of 
the experiment could easily be suspect for 
members of the CGS group. Having been 
trained in sentence completion methodology, 
this group might well have perceived the situ- 


675 


ation as evaluative and threatening. If this 
were so, it might have seemed easier for these 
Ss to make an erroneous choice to which they 
were not firmly committed. The Ss might also 
have felt that an observer would be less criti- 
cal of S's error, if S had been unsure of 
himself. 
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VISUAL PERCEPTION OF SEQUENTIAL NUMEROSITY 


BY NORMALS AND RETARDATES 


DONALD H. THOR? anv EDWARD A. HOLDEN, Jr.? 
Edward R. Johnstone Training and Research Center, Bordentown, New Jersey 


Fifty-millisecond light stimuli were separated by intervals of 100, 200, 300, 400, 
500, and 600 msec, Sequences of 2 to 10 flashes were presented at each inter- 
stimulus interval for multiple- and single-locus conditions to 20 educable 
retardates and 20 equal mental age normals, Numerosity discrimination of the 
younger, normal children was more accurate at all interstimulus intervals 
(p< 01) and at all flash numerosities (p € 01). Locus condition interacted 
significantly with groups ( < .01), the normals making more accurate discrimi- 
nations with single locus presentation over a wider range of interstimulus dura- 


tions than did the retardates. Locus condition also interacted significantly with 
numerosity (p< 05), normals making more accurate discriminations over a 


wider range of flash sequences with single 


locus presentation. The results suggest 


that retardates process sequential visual stimuli at a slower rate than do normals 


of equal mental age. 


Repetitive visual stimuli, such as flashes of 
light, must be separated by a minimum dura- 
tion of time to be perceived as successive; a 
longer duration is required for accuracy in 
specifying the number of flashes. Investiga- 
tions involving intermittent stimulation have 
been numerous and extensive (Brown, 1965; 
Misiak, 1967). Variables such as age (Cross, 
1963; Misiak & Loranger, 1961), diurnal 
variation (Walsch & Misiak, 1966), and 
drug effects (Misiak, Zenhausern, & Salafia, 
1966) have been demonstrated to influence 
critical flicker frequency (CFF). 

Retinal location stimulated has been inves- 
tigated in normal young men by Forsyth and 
Chapanis (1958) and in brain-injured chil- 
dren by Mark, Meier, and Pasamanick 
(1958). The latter study was conducted to 
compare foveal and peripheral (15? nasal) 
CFF in children with diagnosed pyramidal- 
tract involvement and normal children. Mean 
foveal CFF thresholds of the brain-injured 
group were lower, and magnitude of reduction 
of threshold values from foveal to peripheral 

1 Requests for reprints should be sent to Donald 
H. Thor, Johnstone Training and Research Center, 
Bordentown, New Ji ersey 08505. 

? The authors thank William Gretsch, principal, 
East Garfield School, Willingboro, New Jersey, and 
members of his staff for permission and assistance in 
obtaining normal school children. The authors are 
also indebted to H. H. Spitz, John J. Winters, jr. 
and other members of the Johnstone research staff 
for their support and critical reading of the 
manuscript. 
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conditions were significantly greater in the 
normal group. 

The present approach was suggested by 
the Mark et al. data as well as a more recent 
Study by Spitz and Thor (1968) comparing 
performance of normals and retardates on à 
visual backward masking task. The method 
of multiple sequential stimuli in a numer- 
osity task was imposed to magnify the demand 


upon the visual processing system and thus ! 
possibly reveal discrepant perceptual behavior 


not observed in a two-flash stimulus presenta- 
tion task. 

Retardates with no evidence of brain dam- 
age or other neurological impairment (familial 
type) and normals of equal mental age were 
given a sequential numerosity task under two 
conditions of stimulus presentation: single- 
locus (SL) and multiple-locus (ML) light 
flashes. A 2-10 flash range allowed an analysis 
for difficulty in terms of flash numerosity as 
well as interstimulus interval (ISI). Although 
a number of previous studies (Zigler, 1966, 
1967) have failed to discriminate between 
these populations, it was hypothesized that 
differential performance within groups on the 
SL and ML conditions would discriminate 
between the retarded (R) and the equal 
mental age normals (EMA). 


METHOD 
Subjects 


The Ss were 10 male and 10 female edic 
retardates (mean chronological age, CA, = 17.66, 


SD =1.08; mean IQ = 68,95, SD = 5.62; mean men- 
tal age, MA, — 12.17, SD — 1.16) in residence at the 
Edward R. Johnstone Training and Research Center 
and 10 male and 10 female equal MA normals (mean 
CA=10.93, SD -— 28; mean 1Q— 116.50, SD— 
7.05; mean MA = 12,73, SD = .72) from a neighbor- 
ing elementary school. 

Inspection of institutional case history files on all 
retardates revealed no firm neurological evidence 
of brain damage. In two cases there was suspicion of 
mild neurological involvement and in one case a 
history of motor seizures, Typically, the retardates 
came from impoverished and broken homes, had been 
neglected in infancy and early childhood, and had 
long histories of intellectual impairment. 


Apparatus and Stimuli 


The task required Ss to count the number of light 
flashes presented successively in sequences varying 
from 2 to 10. In the SL condition the same light 
flashed repetitively, whereas in the ML condition 
sequential flashes were distributed among lamps in 
different positions. The stimulus display source con- 
isted of a 10 X 10 matrix of neon lamps (NE-2s) 
i paced 1 in. horizontally and vertically between 
‘centers. Horizontal and vertical visual angle between 
adjacent lamps was 48’; visual angle of the matrix 
of lamps was 7°7'. The array of lamps was covered 
with translucent black cloth so that the unilluminated 
lamps could not be seen. The matrix was mounted 
vertically with its frontal plane surrounded on the 
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top and sides by a 3 X 4 ft. black facade. This com- 
posite was set upon a table 31 in. high and was 
iluminated by a 73-w. írosted incandescent lamp 
located 7 ft. in front of its center and 7 ft. 
above the floor. 

Positions of the lamps to be illuminated for each 
numerosity sequence in the ML condition were 
selected by use of a random numbers table. In the 
SL condition the same lamp, approximately in the 
center of the matrix, was illuminated for all numer- 
osity sequences. Control of numerosity in this con- 
dition was effected by manual operation of a switch 
composite. 

Length of ISI was controlld with Gerbands 
Model 135 recycling timer, and in all presentations 
each lamp remained illuminated for 50 msec. 

A 65-db. white noise background (measured from 
S's position relative to the source) effectively masked 
auditory components of the switching operations. 


Procedure 


The Ss were tested individually in sessions requir- 
ing approximately 40 min. Half of the males and 
females in each group received the SL condition fol- 
lowed by the ML condition, the remaining Ss receiv- 
ing these conditions in reverse order. A 10-20 min. 
rest period intervened between the two halves of the 
session. Except for the rest period, S sat in a chair 
6 ft. in front of the lamp matrix. Prior to each condi- 
tion S was instructed to count the number of flashes 
which would appear and report the number observed 
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Fic. 1, Error frequency (A) and absolute magnitude of error (B) at varying interstimulus 
intervals by normals and retardates on single locus and multiple locus viewing conditions. 
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Fro. 2. Error by groups and viewing condition at varying number of light flashes. 
(Same data points as that of Figure 1.) 


when the flashes ceased. Several pretest trials at 
varying flash numerosities were then given at a 
l-sec. ISI to establish that S could count and follow 
instructions, Three retardates were excused from 
participation for failing this pretest, and another 
retardate was replaced for having a history of 
psychosis. After pretesting, S was further instructed 
that every time he gave a correct answer an M&M 
candy would drop from a dispenser into the recepta- 
cle at his side and that all winnings could be retained 
at the end of the experiment. Retardates were given 
the alternative of exchanging candy for: cigarettes, 

Each numerosity sequence was preceded by a 1- 
sec., 65-db. 500-Hz. warning signal, the termination 
of which was followed within .6 sec. maximum by 
the first stimulus light. Under each locus condition 
all Ss received the same order of 54 trials consisting 
of all possible combinations of numerosities 2, 3, 4, ie 
6, 7, 8, 9, and 10 with ISIs of 100, 200, 300, 400, 
500, and 600 msec. Within the 54 trials one random- 
ized order of numerosity was repeated three times, 
with ISI order varying randomly across the entire 
trial sequence. Intertrial intervals varied from 3 to 
10 sec. 


RESULTS 


Sex and order variables within the R and 
EMA groups were nonsignificant and were 
collapsed for subsequent analyses, In addition, 
no significant correlations were found between 
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6 7 8 10 


binocular visual acuity scores and number of 
correct responses on SL and ML conditions 
within groups. 

Figure 1 describes the decrease in error with 
increase in ISI by change in percentage of 
error (A) and by change in mean magnitude 
of error (B). Percentage of error (sum of 
errors/total trials X 100) and mean magni- 
tude of error (sum of absolute deviation 
error/n) describe essentially similar relation- 
ships. : i 

A three-factor repeated-measures analysis 
of variance (Winer, 1962, p. 319) was com- 
Puted with error frequency data and with 
error magnitude data. Normals made signifi- 
cantly fewer errors (F = 8.32, dj = 1/38, ? 
S 01) and errors of lesser magnitude (F = 
7.39, dj = 1/38, p < .01). Within groups, SL 
and ML presentations did not differ signifi- 
cantly in error frequency or magnitude. The 
interaction of Locus Condition x Groups was 
Significant for error frequency (F = 7.40, df 
= 1/38, p < 01) but nonsignificant for error 
magnitude, Expected significance of ISI was 
apparent with F = 144.83 and 159.86, df= 
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5/190, p < .001 for error frequency and mag- 
nitude, respectively. ISI also interacted sig- 
nificantly with groups for error magnitude 
(F=3.19, df = 5/190, p<.01) but was 
nonsignificant for error frequency. The Locus 
Condition X ISI interaction was significant in 
both analyses with F = 6.16, df = 5/190, p 
< .001 for error magnitude and F = 3.73, df 
= 5/190, p< .01 for error frequency. The 
triple interaction of Locus Condition x ISI x 
Groups was nonsignificant for the error fre- 
quency and significant for error magnitude (F 
— 3.02, df = 5/190, p<.05), and can be 
attributed to the greater magnitude of error 
at the shorter ISIs. 

An analysis of variance was also computed 
to determine the effect of numerosity upon 
accuracy (Figure 2) and in this case the data 
were collapsed over ISI and expanded over 
the nine flash sequences. Groups differed sig- 


nificantly (F = 8.54, df= 1/38, p< 01), 


retardates making more errors at each numer- 
osity level. The interaction of Groups X 
Numerosity was not significant, indicating no 
group differences in performance with increase 
in number of flashes. Locus condition was not 
significant. The interaction of Locus Condi- 
tion X Groups was significant (F = 6.78, df 
= 1/38, p < .05). Numerosity (F = 71.46, 
df = 8/304, p< .001) and the interaction 
Locus Condition x Numerosity (F — 4.35, df 
= 8/304, p « .001) were both highly signifi- 
cant. The triple interaction was not signifi- 
cant. 

Interactions of the single and multiple locus 
conditions with groups can be seen in the 
crossing of dashed and solid lines connecting 
open and closed data points (Figures 1 and 
2). More difficult trials at the 100-msec. ISI 
(Figure 1) reveal a multiple locus contribu- 
tion to accuracy in both groups that declines 
with increasing duration of ISI. The point 
where the SL condition no longer contributes 
to a greater mean error is reached at approxi- 
mately 200-msec. ISI by normals and at ap- 
proximately 400-msec. ISI by the retardates. 
A similar interaction is also apparent in Fig- 
ure 2 in terms of numerosity. Normals are 
more accurate on the SL condition over a 
greater span of numerosities than are the 
retardates, who are more accurate on the 
ML condition. 
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LOCUS CONDITION 


Fic. 3. Direction of error at single and multiple 
locus viewing conditions by groups. (Algebraic sum 
of frequency of positive error and frequency of nega- 
tive error for each S, summated by group and con- 
dition, and divided by 7.) a 


Correlations computed with IQ and number 
of correct responses were r = .45, p < .01 and 
r=.53, p<.01 for the retarded group 
(Wechsler full scale) on the SL and ML con- 
ditions, respectively; for the equal MA group 
(Kuhlmann-Anderson) r = .08, ns (SL), and 
r= 44, p< 05 (ML). 

Figure 3 describes direction of error for SL 
and ML condition by R and EMA groups. 
The difference between positive errors (fre- 
quency overestimates) and negative errors 
(frequency underestimates) were obtained for 
each S at SL and ML conditions of presenta- 
tion. A constant was added to eliminate nega- 
tive numbers and an analysis of variance was 
computed for Groups X SL-ML Condition 
(Winer, 1962, p. 302). Groups differed signifi- 
cantly (F = 10.88, df = 1/38, p< 01) as 
did the interaction Groups X Condition (F — 
4.80, df = 1/38, p < .05). Normals made sig- 
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nificantly fewer negative errors, t = 4.33, df 
= 38, p < .001, than did retardates on the 
ML condition and the groups did not differ 
significantly, t= 1.81, df = 38, p> .05, on 
the SL condition. 


DISCUSSION 


The gross differential in accuracy of per- 
Tormance between groups was distributed uni- 
formly over all ISI and flash numerosity con- 
ditions. On each stimulus condition, the 
retardates made more errors at every duration 
and numerosity than did the normals. 

Comparison of SL and ML conditions by 
group mean error and magnitude of error 
(Figure 1, A and B) demonstrates that nor- 
mals are able to use SL more accurately than 
ML over a range of ISI while retardates do 
not use the SL condition to a distinct ad- 
vantage at any ISI. The interaction Groups X 
SL-ML Condition may be interpreted in refer- 
ence to a fusion of images when presented in 
the same locus, If such a fusion occurs at a 
longer ISI in retardates than in normals, one 
would expect an interaction similar to that 
described. That is, the SL condition would 
contribute to greater error with decreasing 
ISI sooner for retardates than for normals. 

Critical flicker fusion (CFF) has been re- 
lated to brain damage by Mark, Meier, and 
Pasamanick (1958). Performance of children 
with pyramidal-tract damage was compared 
with that of normals and “the change in 
threshold from foveal to peripheral conditions 
discriminated between the two groups (p< 
0.01) [p. 685].” Consistent with the present 
data was their finding that peripheral thresh- 
old decreased in the normals and remained 
constant in the brain injured, 

Relatively greater difficulty with longer se- 
quences of light flashes by retardates does not 
explain their greater overall error as the inter- 
action Groups X Numerosity is not significant 

(Figure 2). It may be noted, however, that 
the interaction Groups x Locus Condition 
over numerosities is significant and compar- 
able with the same interaction over ISI. With 
longer sequences of light flashes normals 
make fewer errors in the SL than in the ML 
condition. Retardates do not reach a point at 
any numerosity where the SL condition leads 
to significantly fewer errors, 
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The significant interaction Groups X Locus 
Condition described in Figure 2 is open to 
several interpretations. The fusion theory is 
only admissible if numerosities of five or less 
are discounted since one would expect fusion 
of 3 and 4 flash sequences on the SL condition 
as readily as with sequences of 6 or 7 flashes, 
An alternative may be that normals are more 
capable of retaining their fixation for a longer 
duration on a single locus than are the re- 
tardates. Another alternative may be that 
retardates simply do not have a foveal cone 
of visual acuity (Thomas, 1968, p. 93) com- 
parable with normals. In any case, the signifi- 
cant interaction appears to be due to the 
atypical curve of the normals on the SL 
condition, 

The change from negative toward positive 
error in the ML condition by the normals 
(Figure 3) was further interpreted by inspect- 
ing a tabulation of error by distribution ac-9* 
cording to sign. Normals shifted to positive 
along the entire distribution, but the total 
number of trials correct decreased only from 
737 to 701. Retardates shifted slightly toward 
more negative errors on ML, with an increase 
in total number of trials correct from 563 to 
616. Although significant in terms of the 
2 X 2 analysis of variance, change in direction 
of error showed little effect in the previous 
analyses for percentage of error and magni- 
tude of absolute error. Accuracy within groups 
was not influenced by the shift in error direc- 
tion; thus the normals may be considered 
as more sensitive to the ML condition in giv- 
ing higher estimates of flash numerosity in 
more difficult discriminations than did the 
retardates. Interpretation of the shift toward 
higher numerosity estimates for normals un- 
der the ML condition is highly speculative; 
but it can be conjectured that lower counting 
rates (Vega, 1965) in the retardates handi- 
capped any additional perceptual advantage 
accruing from the spatially distributed stimuli. 

Nonsequential numerosity discrimination in 
R, equal MA, and equal CA groups has re- 
cently been investigated by Spitz, Hoats, and 
Holden (1968). Dot patterns presented tach- 
istoscopically did not reveal a significant dif- 
ference in performance between R and equal 
MA groups although a significant difference 
was obtained between R and equal CA groups. 
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Evidently; sequential presentation of stimuli 
involves behavior that discriminates equal MA 
and retarded Ss. 

In conclusion, retardates appear to differ 
from equal MA normals in their rate of visual 
processing of sequential stimuli. The experi- 
mental design served to treat the stimulus 
condition of single or multiple locus as a 
within-group variable, thus excluding the 
source of variance attributable to normal-re- 
tardate differences. Emotional, motivational, 
or attentional differences between groups were 
thus statistically separated from the variables 
of primary consideration. 

Sequential and nonsequential presentation 
of stimuli to other sense modalities may serve 
to further delineate behavioral distinctions of 
retardates and normals. Etiology of such dis- 
tinctions would appear to be related to central 
nervous system trauma or malformation in 
the retarded. 
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ALCOHOLISM AND CLINICAL SYMPTOMS 


ALLEN GOSS: Aw» THOMAS E. MOROSKO 


Texas Research Institute of Mental Sciences, Houston 


The MMPI was employed in an investigation of the symptom patterns of 200 
male alcoholics. The Psychopathic, Depression, and Psychasthenia scales singu- 
larly and in combination were the most frequently observed T scores with 
elevations above 70. Indications for drug usage to alleviate clinical symptoms 
versus techniques to alter drinking behavior were discussed. 


Weingold, Lachin, Bell, and Coxe (1968) 
have reported on the assertion that depression 
is a characteristic feature of alcoholism. Hy- 
pothesizing that alcoholics exhibit clinical 
signs of depression they selected the Zung 
(1965) scale as a measure of depression. 
Also of interest in their study was the evalu- 
ation of standard drug’ therapy effects in 
reducing depression, Their results indicated 
that 53 of 73 alcoholic Ss exhibited scores 
which fell above the normal range of Zung’s 
depression scale. The purpose of this study 
was to investigate symptom patterns of alco- 
holics with an instrument which permits 
greater variability and completeness of symp- 
tomatic expression. 


METHOD 


Subjects. The Ss were 200 male consecutive admis- 
sions to the Texas Research Institute of Mental Sci- 
ences Alcoholic Treatment Unit. All of the patients in 
this sample were outpatients. 

Procedure. The Ss on the first day at the Institute 
complete a rather extensive admittance procedure 
which included psychological testing. The Minnesota 
Multiphasic Personality Inventory (MMPI), Ed- 
wards Personal Preference Schedule (EPPS), Pea- 
body Picture Vocabulary Test (PPVT), Internal-Ex- 
ternal Scale (I-E), and the Bender-Gestalt Test 
constituted the psychological test battery. Results 
from the MMPI receive primary attention in this 
article. 

The population was divided into two groups of 
100 each for the data analysis. The sample size was 
restricted to an N of 100 in order to increase the 
meaningfulness of differences and to have a cross- 
validation population for comparison purposes. 


1 Also at the University of Saint Thomas and the 
Baylor College of Medicine. Requests for reprints 
should be sent to Allen Goss, Texas Research Institute 
of Mental Sciences, 1200 Moursund Avenue, Texas 
Medical Center, Houston, Texas 77025. 


RESULTS 


Table 1 gives age, IQ, and education in- 
formation for the population. These demo- 
graphic figures appear to be comparable with 
other investigations with alcoholic patients 
(Fitzgerald, Pasewark, & Tanner, 1967; Goss, 
Morosko, & Sheldon, 1968; Miller, Pokorny, 
& Hanson, 1968). Table 2 provides mean and 
t ratio MMPI information for the two sam- 
ples, and Table 3 presents a code analysis of 
MMPI T scores above 70 for single areas as 
well as for several pattern configurations. The 
patterns included represent the most fre- 
quently observed patterns and pattern com- 
binations. Over half of each population con- 
tains patients who exceed a T score of 70 on 
Depression and on Psychopathic. Nearly a 
third exceed a T score of 70 on Hypochon- 
driasis, Hysteria, Psychasthenia, Schizo- 
phrenia, and Mania. The most interesting 
profile pattern is the 427 and the 427 stem 
combinations. These patterns indicated the 
presence of a lack of interpersonal warmth, 
subjective feelings of alienations, depression, 
and anxiety. 


Discussion 
The results clearly indicate the presence 
of multiple symptoms in addition to the speci- 


TABLE 1 


AGE, IQ, AND EDUCATION INFORMATION 
FOR 200 ALCOHOLIC OUTPATIENTS 


Age IO Education 
Group 
SD 
M SD M SD M E 
1 43.16 8.86 | 110.67 | 18.05 | 10.90 | 287^ 
2 44.19 | 10.01 | 112.49 | 14.78 | 11.72 | 2.72* 


Note.—N — 100 in each group. 3 
* These two groups are tignifeantly different at .05 level on 
education, 


682 


ALCOHOLISM AND CLINICAL SvMPTOMS 


TABLE 2 


Comparison OF MMPI T Score ror Two 
GROUPS OF ALCOHOLIC OUTPATIENTS 


Scale FE Group 1 Group 2 t ratio 
Lie 47.73 41.67 .06 
Validity 61.24 63.33 1.34 
Suppressor 50.14 49.68 34 
Hypochrondriasis 62.18 62.90 32 
Depression 72.53 75.22 1.26 
Hysteria 62.90 65.24 1.39 
Psychopathic 73.09 75.85 1.66 
Feminine Interests 59.87 60.05 42 
Paranoia 59.72 62.27 1.55 
Psychasthenia 66.84 68.57 84 
Schizophrenia 67.33 65.88 58 
Mania 61.74 62.34 34 
Social Ability 56.58 57.00 28 


Note,—N = 100 in each group. 


fied problem of alcoholism. The conclusion 
of Weingold et al. (1968), which suggests 
increased use of antidepressant therapeutics 
with alcoholics, appears questionable in the 
light of multiple symptoms. If the notion of 
providing drug therapy for indicated symp- 
toms was followed, antidepressant, antianx- 
lety, antipsychotic, and impulse control com- 
pounds would seem necessary for approxi- 
mately one-fourth (4278) of the population. 
In the light of recent reviews regarding the 
effectiveness of drugs with alcoholics, this 
policy seems questionable (Ditman, 1968; 
Kissin & Gross, 1968). Pre-post MMPI anal- 
ysis of alcoholics in a nonmedication treat- 
ment program (Sikes, Fabish, & Valles, 1965) 
indicate that these subjective symptoms de- 
crease without medication. 

Since over half of the population exceeds 
a T score of 70 on the Psychopathic scale a 
possible conclusion from the MMPI analysis 
supports an increased use of antabuse (Wal- 
lerstein, 1958) with patients who have T 
scores above 70 on Scale 4. This conclusion 
makes the assumption that continual drinking 
is more related to impulse control problems 
than to feelings of depression. Further work 
with aversive conditioning procedures also 
seems promising (Morosko & Baer, 1970). 
Other possibilities include the use of treat- 
ment procedures appropriate for particular 
subgroups of the alcoholic population. Wal- 
lerstein (1958) has reported on the effective- 
ness of different treatment procedures for 
particular subgroups of alcoholics. Further 
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TABLE 3 
FREQUENCY oF MMPI T SCORE AREAS 
AND COMBINATIONS ABOVE 70 
T scores and T score 
combinations Group 1 Group 2 Total 
above 70 
4 69 58 127 
2 64 50 114 
24 49 38 87 
7 37 35 72 
1 34 30 64 
3 39 25 64 
27 37 27 64 
47 33 25 58 
8 31 36 57 
9 33 24 57 
427 32 24 56 
49 27 20 47 
4278 24 22 46 
13 25 20 45 
123 26 18 44 
6 21 17 38 
4271 19 17 36 
4273 20 14 34 
42781 15 17 32 
15 16 31 
42783 lx 14 14 28 
0 14 13 27 
19 13 14 27 
427813 12 13 25 
39 14 10 24 
427819 8 9 17 
427839] 7 7 14 
4278139 7 6 13 


work in refining subgroups and increasing the 
effectiveness of treatment procedures seems 
essential in the light of the present low suc- 
cess rate with alcoholics. 
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Familial retardates who were to be released from institutional care (release 
group, R; N=120) were compared with familial retardates who were not 
candidates for discharge (institution group, I; N = 120) with regard to home 
backgrounds, family relationships, and preinstitutional and institutional ad- 
justment. Group R presented a preinstitutionalization history of milder retarda- 
tion than Group I. Group R evidenced cognitive and behavioral improvement 
from time of admission, whereas Group I evidenced stagnation and deterioration. 
Group R was subdivided into those who were successful versus unsuccessful in 
adapting to the community. Prior to institutional release, the successful group 
evidenced fewer cases of psychiatric disturbance. They evidenced superior per- 


formance in community-job situations. 


In Windle's (1962) review of the litera- 
ture with regard to prognosis in mental sub- 
normals, he identified three kinds of varia- 
bles which are associated with release from 
institutionalization and/or successful adap- 
tion to the community after release: those 
which have consistently shown predictive 
value for successful adjustment after release 
from institutionalization (e.g., age at release, 
institutional sanction for release, the attitudes 
in the home to which the retardate is being 
released); variables which predict release but 
which do not seem to be related to successful 
adjustment after release (e.g., IQ, age at ad- 
mission, “good” personality, diagnosis); and 
variables which are commonly assumed to re- 
late to successful adjustment, but which have 
not been demonstrated to have prognostic 
value (e.g., length of public institutionaliza- 
tion, organic versus psychosocial etiology, 
family interest in the retardate, working abil- 
ity). Research reported since Windle's article 
generally supports his conclusions (Gage & 
Wolfson, 1963; Gralicker, Koch, & Hender- 
son, 1965; Strickland & Arrell, 1967), al- 
though additional evidence suggests that IQ 
may have predictive value for adjustment 

1 This work was supported by Research Grant HD 
01798 from the Institute of Child Health and Human 
Development, United States Public Health Service, to 
K. J. Kun. 

2 Requests for reprints should be sent to William 
Vogel, Department of Psychology, Worcester State 
Hospital, Worcester, Massachusetts 01604. 


(Appell, Williams, & Fishell, 1966; Baller, 
Charles, & Miller, 1967; Jackson & Butler, 
1963; Madison, 1964). 

Windle noted several methodological prob- 
lems which make evaluation of the literature 
difficult: (a) a failure to study homogeneous 
subgroups of retardates so that meaningful 
differences are obscured; (5) a failure to 
employ large enough numbers of Ss so that 
meaningful generalizations might be made; 
(c) a failure to employ suitable control sam- 
ples; (d) inadequate statistical treatment; 
and (e) concern with individual variables, 
rather than the predictive value of indexes. 
which employ several variables. 

This study is part of a programmatic in- 
vestigation of factors which are associated 
with behavioral improvement in institutional- 
ized retardates (Vogel, Kun, & Meshorer, 
1967, 1968, 1969). The current report is an 
attempt to determine, first, in the light of 
Windle’s critique of earlier work, which varia- 
bles are associated with the administrative 
decision to release institutionalized retardates 
into the community. Second, the authors at- 
tempt to ascertain which variables are asso- 
ciated with successful adjustment after re- 
lease. 


METHOD 
Factors Associated with Institutional Release 


Subjects. The first problem was to ascertain the 
factors associated with institutional release, as con- 
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trasted with continued institutionalization. Conse- 
quently, the last 120 familial retardates were identi- 
fied who were released with institutional sanction 
from Wrentham State School into the community 
(release group, R). Only those Ss were chosen who 
were released to job placements in the community; 
none were chosen who were released to parent or 
foster parent care. These retardates were individually 
matched, for chronological age (CA) on admission 
and for current CA, with 120 familial retardates who 
were not considered to be candidates for release 
(institutional group, I). The two groups were com- 
pared with regard to psychological, social, and bio- 
graphical variables, described in the following para- 
graphs. Groups I and R were individually matched 
for CA and age at admission, first, because the im- 
portance of these variables has been amply demon- 
strated previously (Windle, 1962) and, second, be- 
cause they are related to other variables (e.g., mental 
age, MA) to be considered below. 

For Group R Ss, intelligence test data were avail- 
able for the year of admission and for the year of 
release. For each Group I Ss, similar data were avail- 
able for the year of admission, and for the year 
which corresponded to the year of release of the 
matched partner in Group R, 

For the test and retest years, ratings were ob- 
tained of the everyday behaviors of Ss, as described 
in a previous publication (Vogel, Kun, & Meshorer, 
1968). In brief, perusal of the records of Wrentham 
State School suggested that the available behavioral 
information could be placed into two classifications 
which were termed personal skills (PS) and social 
and emotional behavior (SEB). The items cate- 
gorized as PS reflect acquisition of those Perceptual, 
motoric, and verbal skills which enable a retardate 
to assume personal responsibility for himself and his 
possessions and enable him to assist the staff mem- 
bers in caring for the persons and Possessions of 
other retardates. The SEB items reflect acquisition of 
self-control over emotional expression and ability to 
relate socially to others as individuals and in groups. 
Two inventories were constructed, designated PS and 
SEB, with 15 items in each, Each item was scaled so 
that a clear order of behaviors was described ranging 
from absence of a behavioral Skill, through partial 
possession of it, through total mastery. The rater 
had to decide which of the multiple choices most 
adequately characterized a retardate's customary level 
of behavior in respect to that area of activity. One 
item each from the PS and SEB inventories is shown 
below; the other 14 items Írom each inventory 
were similarly constructed. 


Dressing: 
a. Unable to dress self at all, 
b. Can put on simple garments without assist- 
ance (eg., socks, underwear, shorts). 


c. Can dress self, but requires assistance with 
buttons, zippers, and laces, 

d. Can dress self, including buttons, 
and laces without assistance. 

e. Assists in dressing of others, 


zippers, 
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Relationships to ward personnel: 

a, Actively uncooperative and obstructive. 

b. Passively uncooperative. 

c. Passively cooperative and helpful. 

d. Actively cooperative and helpful. 

e. Helpful in promoting or gaining the cooper- 
ation of other retardates with the ward 
personnel, 


The development of the PS and SEB inventories 
was influenced by the TMR Performance Profile test 
of DiNola, Kaminsky, and Sternfeld (1965).9 Specifi- 
cally, their technique was adopted of assessing re- 
tardate behavior by means of multiple-choice 
items arranged to describe a progression of behaviors 
ranging from no mastery through mastery of a given 
Skill. The published content of the TMR test was 
generally inapplicable for the authors’ purposes. 
Its detailed scoring requires that the scorer have 
personal acquaintance with the retardate, rather 
than the use of institutional records which consti- 
tuted the source of information in this study. 
Nevertheless, content from seven TMR items was 
applicable to the purposes of the present study. The 
PS and SEB scores discussed and tabulated in the 
proceeding paragraphs are the result of combining 
scoring from these TMR items with the items 
which were prepared. 

Two research assistants, trained by the first author, 
scored the PS and SEB items with an interscorer 
reliability of .90, Further description of the PS and 
SEB scales, of their scoring, of the results of a 
factor analysis of the items, and of interscorer 
reliability and validity studies are presented in an 
earlier report (Vogel, Kun, & Meshorer, 1968) in 
which it was demonstrated that the scoring of the 
scales was sensitive to, and accurately reflected, 
changes in the behaviors of subnormals. 

Social history. Groups I and R were also com- 
pared on a number of variables, reflective of the 
social, educational, and medical histories of the 
patients, thought to be relevant in light of Windle's 
(1962) review. These were (a) socioeconomic status 
of the parents as reflected in paternal education and 
occupation, (5) parental interest, as reflected in the 
number of visits the retardate received and number 
of visits home, (c) CA at which S's mental deficiency 
was first noted, (d) the highest educational level 
achieved by the patient, (e) the highest occupational 
level at which the patient was able to perform satis- 
factorily at Wrentham State School, and (f) quality 
of occupational performance (Ss were rated excellent, 
satisfactory, or unsatisfactory for the highest occu- 
pational level they achieved). 

Groups I and R were also compared in regard to 
Dsychiatric disability other than mental defect (no 
Psychiatric disturbance, diagnosis of character dis- 
Order or neurosis but not psychosis, diagnosis of 
Dsychosis). Finally, Groups I and R were compared 
in regard to sexual composition. 


5 The PS and SEB items prepared by the authors 
and the identification numbers of the relevant TMR 
items are available from the first author upon 
request. 
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TABLE 1 


DIFFERENCES BETWEEN RELEASE AND INsTITUTION GROUPS IN INITIAL AND RETEST 
CA, MA, IQ, PERSONAL SKILIs, AND SOCIAL AND EMOTIONAL BEHAVIORS 


| Initial test Retest 
Measure | 

| Group R Group I Group R Group I F 
CA (yr.) 12.3 A 6.3 12.3 + 7.4 == 28.5 + 10 28.5 + 9.9 = 
2D A (yr.) 5.00 27.05* 8.84 5.9 142.1* 

s d 45.9 60.9* 60.3 41.8 103.0* 
T B 63.3 ns 844 56.1 221.9* 
69.0 251* 95.4 79.1 132.58* 


PS | 81 


Note.—R = release group; I = institution group; PS = personal skills; SEB = social and emotional behaviors. 


*p <.001 


Factors Associated with Success after Release 


The second problem was to ascertain which factors 
differentiate retardates who make a successful ad- 
justment to the community from those who must be 
returned to the institution. Accordingly, from Group 
R, two subgroups of 30 Ss each were formed, a suc- 
cessful release group (Group Rs) and an unsuccessful 
release group (Group Ru). Members of Group Rs 
had successfully maintained themselves outside of the 
institution for a period of 1 yr., while members of 
Group Ru had been reinstitutionalized prior to that 
‘time. The members of these groups were individually 
matched for CA, age at admission, year of admission, 
and year of release. Groups Rs and Ru were com- 
pared on initial and retest MA, IQ, PS, and SEB 
scores, as well as on the social history variables de- 
scribed in the preceding paragraphs. 


RESULTS 
Factors Associated with Institutional Release 


Indexes of adaptive behavior. The results 
of the F-test comparisons between the two 
groups on initial and retest CA, MA, IQ, PS, 
and SEB scores are shown in Table 1. In 
Table 1, it may be seen that MA, IQ, and 
PS scores differentiate the two groups on 
initial test at very high levels of statistical 
significance. However, for each index, the 
differences between the two groups are much 
greater on retest. The SEB inventory, which 
did not differentiate between the groups ini- 
tially, differentiated on retest to a greater 
degree than any other index. 

In Table 2, it is apparent that on retest 
there is little overlap between the two groups 
on the PS and SEB distributions, and little 
overlap between the two groups on the ex- 
tremes of the MA and IQ distributions. Thus, 
in Group R, only 1 S evidenced an MA on 
retest of less than 6 yr., as contrasted with 
51 Ss in Group I; 70 Ss in Group R evi- 


denced an MA of 8.1 or greater, as contrasted 
with 14 Ss in Group I. The overlap between 
Groups I and R was somewhat greater with 
IQ than with MA. 

On the retest SEB inventory, no Ss in 
Group R, as contrasted with 67 Ss in Group 
I, evidenced scores of 60 or less; 79 Ss in 


TABLE 2 


DISTRIBUTIONS FOR RELEASE AND INSTITUTION GROUPS 
or INITIAL AND RETEST MA, IQ, PERSONAL 
SKILLS, AND SOCIAL AND EMOTIONAL 
BEHAVIOR SCORES 


Initial test Retest 
Measure a 3 E 
roup TOU! 
G Lh 20 S59 (N= Boe (N= 120) 
MA 
0.1-2 1 11 0 7 
2.1-4 15 38 0 13 
4.1-6 35 29 1 31 
6.1-8 34 28 39 46 
8.1-10 27 12 42 13 
10.1-12 8 2 28 1 
I 
0-20 1 5 0 9 
21-50 23 67 28 80 
51-70 81 43 66 27 
71-80 14 5 22 3 
81-90 1 0 4 1 
PS 
0-50 2 19 0 7 
51-60 2 9 0 4 
61-70 18 32 0 4 
71-80 47 29 3 42 
81-90 38 23 13 44 
91-100 13 8 104 19 
SEF 50 5 15 0 45 
51-60 26 23 0 22 
61-70 46 49 5 24 
71-80 39 32 36 23 
81-90 4 0 55 6 
91-100 0 1 24 0 


Note.—See Table 1 for explanation of abbreviations. 
*TheN for each distribution is 120 Ss, except for MA retest 
for Groups R and I, where the respective Ns are 110 and 
111 Ss. 


688 W. Vocet, K. J. Kun, AND E. MESHORER 
TABLE 3 
PEARSON CORRELATIONS AMONG IQ, MA, CA, PERSONAL SKILLS AND SOCIAL AND 
EMOTIONAL BEHAVIOR SCORES ON ‘TESTING AND ON RETESTING FOR 
RETARDATES SELECTED FOR RELEASE FROM ÍNSTITUTIONALIZATION 
CA MA IQ | SEB 
Measure 
Initial Retest Initial Retest Initial Retest | Initial Retest 


SEB 13 24M 28 24" 34 | 
(22* (26)* | Q5" (28)** | 

PS 65** gi" FT “40"* 34 aare | agno 
(.65)** (.40)** (.62)** (.33)** (A8)** (.40)"* 


Note,—See Table 1 for explanation of abbreviations. All correlations are based on N = 120, except the retest correlations 
involving M fen which N = 110, The correlations in parentheses are with CA partialed out. 
< 


5$ 0h 
Group R, as contrasted with only 6 Ss in 
Group I, evidenced scores of 81 or higher. 

On retest on the PS inventory, 104 of 120 
Ss in Group R evidenced a score of 91 or 
better, as contrasted with 19 of 120 Ss in 
Group I. 

The Pearson correlations among IQ, MA, 
CA, and PS and SEB scores for Group R 
are shown in Table 3, and those for Group I 
are shown on Table 4. The “retest” correla- 
tions are not based on the raw scores at re- 
testing, but on difference scores from the 
first to second testing; that is, the retest val- 
ues in the table represent the intercorrelations 
of the increments in CA, MA, IQ, PS, and 
SEB scores from test to retest. The correla- 
tions are presented both with and without CA 
partialed out. The correlations with CA par- 
tialed out are shown in parentheses, 

Thus presented, Tables 3 and 4 are com- 
parable to previously published tables of inter- 


correlations of test and retest MA, IQ, PS, 
and SEB scores (Vogel, Kun, & Meshorer, 
1968). Several observations may be made in 
this regard. (a) Here, as in the earlier work, 
PS scores are very highly associated with ini- 
tial MA scores, even with CA partialed out. 
(5) Also, as in previous work (Vogel, Kun, 
& Meshorer, 1968), the correlations of the 
intelligence measures (MA, IQ) with PS 
Scores are appreciably higher, both on test 
and retest, than are the correlations with 
SEB scores. (c) In both groups, the correla- 
tions of PS and SEB scores are statistically 
significant, (d) In both groups the patterns 
of initial correlations are highly similar in 
Groups I and R. On retest, the patterns of 
correlations are again similar, although not 
to the same extent as on initial test. 
Prediction of institutional release from 
combinations of scores. An attempt was made 
to determine the extent to which combina- 


TABLE 4 


PEARSON CORRELATIONS AMONG IQ, MA, CA, PERSONAL Skitts, AND SOCIAL AND 


ON INITIAL TEsTING AND ON RETESTING 
FOR RETARDATES REMAINING INSTITUTIONALIZED 


EMOTIONAL BEHAVIOR SCORES 


CA MA ; 
Measure a fe 
Initial Retest Initial Retest Initial Retest Initial |. Retest 
MA 63 .09 
IQ .00 —.13 | 
SEB 16 —.15 32** 4 A1** 42** | 
.29' +e b bád * * 
PS ze —.08 Cie Che Bu Ce 155f* 
(75)** (.80)** (.68**) (:78)** (.57)** 


Note.—See Table 1 for explanation of abi 


dod; tions 
involving MA, for which N= 111, The Moor abbreviations, All correlations are based on N = 120, except the retest correlati 
$ . 


h tions in parentheses are with CA partialed out. 
**p c.l. 


.68** 
(.68)** 
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tions of variables could differentiate, at the 
time of each retardate’s initial admission, 
those retardates who were eventually to be 
released from those who were to remain 
institutionalized. 

All Ss in Group R were assigned an arbi- 
trary score of 2, and all Ss in Group I an 
arbitrary score of 1, so that all 240 retarded 
Ss were “scored.” These scores (1 or 2, for 
each S) were collectively called group mem- 
bership scores (GMS). Point biserial corre- 
lations were obtained between these GMSs 
and those variables which significantly ($ < 
.05) differentiated the two groups at the time 
of admission, that is, MA, IQ, SEB, and PS 
scores, and age at which abnormality was first 
noted, The correlations were as follows (posi- 
tive correlations indicate that Group R evi- 
denced higher scores than Group I on a given 
test): MA X GMS, .318; IQ x GMS, .457; 
SEB x GMS, .176; PS x GMS, .317; and 
Age at Which Abnormality Was First Noted 
X GMS, .244. 

Using all 240 Ss, a multiple correlation 
was then obtained between these five varia- 
bles and GMS. The value of the multiple 
correlation was .49, p < .001. It is apparent 
that this multiple correlation is not apprecia- 
bly higher than the correlation between IQ 
at admission and GMS (r — .457). 

Social history data. The retardates uni- 
formly were derived from lower-class back- 
grounds: 9496 of 140 fathers of Groups I 
and R retardates for whom occupational data 
were available were employed at no higher 
than the semiskilled level; only two of the 
fathers had finished high school. There were 
no differences between Groups I and R in 
regard to education or occupation. 

When the members of the two groups were 
compared by F test in regard to the CA at 
which mental subnormality was first noted, 
the difference was statistically significant (F 
= 7.18, p < .01), the mean age for Group R 
being 4.19 yr., and that for Group I, 3.22 yr. 

The retardates were sharply differentiated 
in regard to accomplishments. When the two 
groups were combined and divided at the 
median on the basis of the number of school 
grades completed, it was found that 7376 of 
Group R but only 3276 of Group I com- 
pleted the second grade (x?= 40.1, p< 
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.001). In regard to occupation, all of the 120 
members of Group R, but only 12 of the 
120 members of Group I were gainfully em- 
ployed off the school grounds (x? = 192, $ 
<.001). 

Finally, the two groups differed signifi- 
cantly in regard to sex; 81 of 120 Ss in 
Group R but only 60 Ss in Group I were 
female (x? = 6.88, p < .01). 


Factors Associated with Success after Release 


Neither initial nor retest MA, IQ, PS, or 
SEB scores differentiated Groups Rs and Ru. 
The only social history variables which dif- 
ferentiated the groups were psychiatric dis- 
ability, and job performance. 

Of the Ss in Group Rs, 22 had no history 
of psychiatric diagnosis while 8 had been 
diagnosed at some earlier period as character 
disorder, neurotic, or psychotic; the corre- 
sponding figures for Ss in Group Ru were 
10 and 20 (y?—8.11, p<.01). Thus, Ss 
who successfully adapted to the community 
were less likely to evidence a history of psy- 
chiatric disorder. 

In addition, when Ss’ job performances for 
the highest level jobs they held in the com- 
munity were rated as excellent, satisfactory, 
or unsatisfactory all 30 of the Group Rs Ss 
were rated as excellent, while not one of the 
Group Ru Ss achieved an excellent rating (x? 
= 56.1, p < .001). 


Discussion 
Factors Associated with Institutional Release 


Differences between Groups I and R on 
cognitive and adaptive behaviors. Even in a 
diagnostically homogeneous group of retard- 
ates, those Ss chosen for release from the 
institution (Group R) form a subgroup that 
is different in character from those who re- 
main institutionalized (Group I). Further, 
significant group differences are present from 
the first year of institutionalization, many 
years prior to release. The initial differences 
between the two groups became accentuated 
over time until, on retest, there is little over- 
lap between the groups in terms of their re- 
spective distributions of PS and SEB scores, 
and very little overlap at either extreme of 
both the MA and the IQ distributions. 

The question arises of whether the massive 
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differences on retest were attributable to Ss 
being administratively directed on different 
institutional courses because of staff decisions 
which were made on the basis of the initial 
IQ scores. The data do not seem to support 
this "staff bias" hypothesis. First, at Wrent- 
ham State School, all familial retardates, re- 
gardless of IQ, are automatically considered 
at potential “release” candidates, as reflected 
in the fact that all Ss in this study were 
assigned to the institutional elementary school, 
Only 25% of the retardates who are institu- 
tionalized at Wrentham State School are so 
enrolled, representing those retardates who 
are considered to have the best prognostic 
potential. Since 3296 of the members of 
Group I completed the second grade or more, 
they were considered as release candidates 
for a considerable time. 

Second, 20% of the retardates in Group R 
obtained initial IQ scores of less than 50 
so that they would conventionally be consid- 
ered as “trainable” rather than “educable,” 
that is, incapable of maintaining themselves 
at large in the community. On the other 
hand, 40% of Group I retardates initially 
obtained IQ scores of above 50, so that they 
would conventionally be considered as the 
highest quality retardates, ordinarily capable 
of maintaining themselves in the community. 
Thus, while Group R evidenced a higher 
mean initial IQ score than Group I, there 
were too many individual exceptions to sup- 
port the hypothesis that the group differences 
on retest were primarily a reflection of ad- 
ministrative decisions to give favorable treat- 
ment to persons with an initial high IQ. 

It is also apparent that the differences in 
institutional adjustment between Groups I 
and R cannot be attributed to socioeconomic 
differences in the retardates home back- 
grounds. There were no differences between 
Groups I and R in terms of parental educa- 
tion or occupation, Similarly, the differences 
cannot be attributed to differences in parental 
interest after the retardates’ institutionaliza- 
tion, as there were no differences between 
the groups in terms of parental visits received, 
or in the number of visits to the parental 
home. 

Thus, the differences between the groups 
on retest probably reflect factors other than 
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staff bias based on the initial IQ scores, par- 
ental interest, or parental socioeconomic 
status. The data suggest two primary sources 
of variance. A 

First, the initial test differences in favor 
of Group R (high IQ scores, high PS Scores, | 
mental retardation not noted until an average 
of 1 yr. after the mental retardation of the 
members of Group I became apparent) sug- 
gest that a majority of the members of Group | 
R were intellectually and behaviorally less 
deficient than a majority of the members of 
Group I from the time prior to institutionali- 
zation. 

Second, it appears that the members of | 
Group R had a far more favorable adapta- 
tion to institutionalization than did the mem- 
bers of Group I. Group I developed far more 
psychiatric symptomatology, finished fewer 
school grades, and did much more poorly in 
terms of occupational accomplishments. As 
may be seen from Table 1, the members of 
Group I actually declined in IQ and SEB 
scores from the time of admission to a period 
in time which averaged 16 yr. later. 

It is not possible, from the current data, 
to determine the cause of the poor institu- 
tional adjustment. It cannot be ascertained, | 
for example, if the members of Group I had 
a poor educational record and low SEB scores 
because they developed psychiatric symptom- 
atology; or whether they developed psy- 
chiatric symptomatology as a by-product of 
general neglect and subsequent failure of so- 
cial and emotional development once they 
were no longer enrolled in the school program. 

The data raise intriguing questions in re- 
gard to MA, Only 1 of 120 members of R 
group evidenced an MA under 6 (see Table 
2). Careful inquiry among the administrative — » 
and medical staff indicated that this does not | 
reflect an administrative decision; that 1S, 
there was no policy of releasing only re- 
tardates with an MA of greater than 6. In- 
sofar as intelligence scores had figured in the 
decision whether or not to release a retardate, 
it had been the IQ that had been attended 
to, and even here, no cutoff score had ever 
been established. As may be seen from Table 
2, on retest, 28 of 120 Ss in Group R( that 
is, 23%) had IQ scores under 50, despite the 
fact that retardates with IQ scores under 50 
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are normally considered as incapable of main- 
taining themselves with any degree of inde- 
pendence outside the institution. Thus, the 
IQ score had been employed as a general 
guide for release, rather than as an inviola- 
ble criterion. The data raise the question of 
whether retardates with an MA under 6 are 
incapable of mastering the skills which serve 
as a requisite for institutional release, and 
whether the MA may not be a more valuable 
index of retardate functioning than IQ. In 
earlier work it has been suggested that MA 
relates more successfully to biological and 
psychological behaviors than does IQ (Vogel 
& Broverman, 1964) because MA reflects the 
absolute level of mental ability of an indi- 
vidual, whereas IQ reflects mental ability 
relative to CA, Consequently, MA should be 
a better reflection of the absolute level of 
mental activity and of brain functioning than 
is IQ. 

The most dramatic difference between the 
groups over the course of hospitalization 
came not in terms of intellectual functioning, 
but in terms of SEBs. Although the SEB did 
not differentiate the two groups on initial 
test, it was the best single differentiator on 
retest, there being very little overlap be- 
tween the two groups. Possibly, the sophisti- 
cated skills with which the SEB inventory 
is concerned are not gained until the later 
years of institutionalization, so that the de- 
vice does not successfully differentiate the 
groups on admission. The PS inventory also 
sharply differentiated the two groups, on both 
initial test and retest. These data confirm 
previous findings (Vogel, Kun, & Meshorer, 
1968, 1969) which suggest that PS and SEB 
scores significantly differentiate groups of re- 
tardates along meaningful lines, perhaps to a 
greater extent than do IQ scores. 

It would seem that the SEB scores tap 
different aspects of retardate development 
than do the intelligence scores. Thus, on 
initial test, SEB scores are significantly re- 
lated to MA in both groups, but the size of 
the correlations (Group R, .22; Group I, .29) 
indicates that the two sets of scores have 
Sufficient independence to suggest that they 
measure different, if related, phenomena. 

On the other hand, PS scores are massively 
correlated with MA on initial test, even with 
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CA removed (Group R, .65; Group I, .75). 
In both groups, both initially and on retest, 
the PS scores have much higher relationships 
with MA and IQ than do the SEB scores. 
These interrelationships of PS and SEB 
scores, on the one hand, to MA and IQ scores, 
on the other, are highly similar to patterns 
that have been previously obtained (Vogel, 
Kun, & Meshorer, 1968). The authors have 
interpreted them to mean that MA and IQ 
are related to behaviors which can be gained 
through instruction (i.e., PS scores) but are 
less related to behaviors which reflect com- 
plex social and interpersonal skills (i.e., SEB 
scores). 

Paradoxically, these data support both 
those who argue for the efficacy of intelligence 
tests in evaluating the adaptive potential of 
retardates (McAndrew & Edgerton, 1964, 
1965) and those who argue against (Bauer- 
meister, 1965; Masland, Sarason, & Gladwin, 
1958). It is apparent from Table 2 that the 
MA and IQ scores differentiate the groups 
at a very high degree of significance on re- 
test, although neither differentiates as well as 
the SEB inventory. 

The finding that, despite individual excep- 
tions, IQ on admission is related to institu- 
tional adjustment and to chances of release 
is in accord with previous research (Windle, 
1962). Similarly, the findings that absence 
of psychiatric illness, good academic records, 
and vocational placements were associated 
with institutional release are in accord with 
previous observations that these are the most 
common criteria for institutional discharge 
(Humphreys, 1937; Shafter, 1954; Windle, 
1962). The fact that no relationship was ob- 
served between parental interest and chance 
of release is also in support of Windle’s 
(1962) conclusions. Finally, Windle reported 
that there was no conclusive evidence of a 
relationship between parental socioeconomic 
status and chance of release, and suggested 
that there was a need to investigate this re- 
lationship for homogeneous groups of retard- 
ates. No relationship between release and 
socioeconomic status was found. 


Factors Associated with Success after Release 


It was found that retardates who make a 
successful adjustment to the community after 
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their institutional release can be differentiated 
from those who do not, on the basis of infor- 
mation available prior to institutional release. 
The retardates who eventually make a suc- 
cessful community adjustment evidence sig- 
nificantly less psychiatric difficulty while still 
institutionalized at Wrentham. They present a 
history of superior job performance in the 
community, as contrasted with the unsuccess- 
ful retardates. It seems likely that the two 
findings are associated, that is, that poor job 
performance may be a function of a history of 
psychiatric disorder. 

The fact that these two indexes relate to 
adjustment to the community at such a high 
level of confidence would appear to be of 
prime importance. In his review, Windle 
(1962) reported that the empirical research 
dealing with work history is scarce, although 
he cited a study by Shafter (1957) which sug- 
gested that custodians’ judgments in regard 
to retardates’ work habits were positively re- 
lated to future adjustment in vocational place- 
ment settings, 

Overall, the following results of the present 
study suggest promising areas for future re- 
search, especially those which indicate that 
PS and SEB scores may hold considerable 
value; the finding that release into the com- 
munity cannot be expected with an MA of 
less than 6; the findings that institutional 
adjustment more than initial intellectual ca- 
pability or preinstitutional experience may be 
crucial in determining institutional release; 
and the findings that Ss who are successful in 
adapting to the community showed less psy- 
chiatric disturbance and superior job per- 
formance. 
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The magnitudes of the Ponzo and Poggendorff illusions among 10 hypnotically 
age-regressed college students were typical of normative data obtained from 
young children, This effect was almost complete for the Ponzo but not as 
pronounced for the Poggendorff illusion. A control group under task motivation 
instructions failed to produce comparable results. It appears as if age regression 
facilitates the use or nonuse of visual cues and mechanisms in the manner typical 
of earlier stages of perceptual development. It is suggested that these illusions 
may serve as an objective indicator of hypnotic age regression, and that 
hypnotic age regression may provide a methodological approach to the experi- 


mental study of perceptual development. 


The use of standard-hypnotic-induction 
techniques has been reported to facilitate the 
retrieval of responses, verbal and nonverbal, 
presumably not emitted for long periods of 
time. The retrieval of past events by means 
of hypnosis is usually accomplished by sug- 
gesting that S is actually the age at which 
the event is assumed to have occurred. This 
technique, referred to as “age regression," has 
been applied to the retrieval of memory for 
specific events, such as dates (True, 1949), 
performance on intelligence and projective 
tests (Mercer & Gibson, 1950; Orne, 1951), 
physiological responses (Gidro-Frank & Bow- 
ersbuch, 1948; Kupper, 1945), and numerous 
other behaviors reviewed by Barber (1962), 
Gebhard (1961), and Yates (1961). 

According to Barber, the research gen- 
erated by the concept of age regression may 
be questioned on methodological grounds 
(Barber, 1967). The primary criticisms are 
poor controls and unconvincing dependent 
variables. Reviews by Gebhard (1961) and 
Yates (1961), while less critical than Bar- 
ber, concur in the need for rigor in experi- 
mental methodology if the researcher is to 
establish the validity of age regression. 


1This study was supported by Grants MH 08061 
and HD 00151 from the National Institute of Mental 
Health. A preliminary report of this study appeared 
in Science, 1968, 159, 1375-1376. 

? Requests for reprints should be sent to Richard 
M. Lundy, Department of Psychology, Pennsylvania 
State University, University Park, Pennsylvania 
16802. 


An approach to the description and under- 
standing of hypnosis is to identify those re- 
sponses elicited under hypnosis which cannot 
easily be made under other kinds of sugges- 
tions. If responses can be identified which 
cannot be made at all under other kinds of 
suggestions, then in the description of those 
responses may be found the key to under- 
standing the nature of hypnosis. Specifically, 
the validity of age regression can be better 
established when responses typical of children 
but not of adults are produced under age 
regression and when these same responses are 
not produced under a waking suggestion. Of 
course, the possibility of the validity of age 
regression is increased if those behaviors at 
the regressed “age” are in fact similar to the 
actual responses given by children at that age. 

It was recently determined that the Ponzo 
(Figure 1) and Poggendorff (Figure 2) illu- 
sions both vary systematically as a function 
of age (Leibowitz & Gwozdecki, 1967; Leibo- 
witz & Judisch, 1967). The magnitude of the 
Ponzo illusion increases with age from child- 
hood through adolescence, approaches an 
asymptote, and decreases again in old age. 
The Poggendorff illusion decreases in mag- 
nitude with age from 5 to approximately 10 
yr., and then remains stable. 

The S’s responses to these illusions could 
serve as an adequate indication of the facili- 
tation of retrieval of age-appropriate percep- 
tual responses while age regressed. This would 
not be a matter of memory, since it is highly 
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unlikely that most Ss were exposed to the 
specific experimental stimuli at the ages of 
5 and 9. In addition, this task seemed espe- 
cially well suited inasmuch as the age func- 
tions of these particular illusions develop in 
opposite directions, It is virtually certain that 
Ss had no previous knowledge of these norma- 
tive functions. Therefore, the possibility of 
acquiescence to expectations is effectively 
eliminated. The purpose of the present study 
was to determine whether retrieval of child- 
hood responses to the Ponzo and Poggendorff 
illusions would be facilitated by age regres- 
sion by means of a standard-induction tech- 
nique. 


METHOD 


Subjects. Undergraduate students from the intro- 
ductory psychology courses at the Pennsylvania State 
University, who answered positively to the state- 
ment “I would be willing to participate in a hypnosis 
experiment” were subjected to a standard-hypnotic- 
induction procedure and then tested on Barber's 
Suggestibility Scale (BSS, Barber & Glass, 1962). 
The first 10 students obtaining a score of 8 on the 
scale were selected for the present experiment. There 
were 4 females and 6 males. Further selection of 2 
female and 3 male Ss for a control group was deter- 
mined by the same procedure. The Ss ranged in age 
from 18 to 21 with a mean of 19,1 yr., and had 
normal vision either with or without correction. 

Apparatus. The apparatus and Procedure used in 
this experiment were identical to that used by Leibo- 
witz and Judisch (1967) for the Ponzo illusion and 
by Leibowitz and Gwozdecki (1967) for the Poggen- 
dorff illusion, For the Ponzo illusion, the stimuli 
consisted of a series of 21 cards similar to the insert 
in Figure 1, but with the line nearer to the point of 
convergence varying in length from 23 to 5 in. in 3- 
in. steps. The other vertical line was constant in 
length at 4 in. The Poggendorff figure was made of 
metal with the right half of the diagonal adjustable 
(see the insert in Figure 2). The center of the figure 
was 13 X 4 in. and the diagonal bars were each 73 X 
3% in. with a 45° angle formed between the center 
bar and the diagonal bar. Details of the construction 
of the stimuli are discussed in the original publica- 
tions, 

Procedure, Each S was presented the stimuli for 
the Ponzo and Poggendorff illusions under four con- 
ditions: no hypnosis, hypnosis-no regression, hypnosis 
and regression to age 9, and hypnosis and regression 
to age 5. Only one condition was presented on a given 
day, at weekly intervals, as determined by random 
order. In each hypnosis session a standard-induction 
procedure and the BSS were administered and then 
the stimuli were presented. The illusion presented first 
on a given day was randomly determined. 

The Ponzo stimuli were presented in a random 
order, one card at a time, horizontally on a table 
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before S who sat approximately 18 in. from the card. 
The S was given the following instructions: “I am 
going to show you some cards with lines on them, 
I want you to point to, or tell me, which of the 
two vertical lines is longer or bigger.” The procedure 
was adapted from the Leibowitz and Judisch study 
(1967) which was designed for administration with 
children. The age-regressed Ss were able to follow 
these instructions without difficulty. After presenta- 
tion of the first and second cards in the series E 
gave no further instructions and simply recorded 
S's responses without comment. Inasmuch as the 
order of the cards was random and E was uninformed 
as to the method for determining the equality value, 
he was naive regarding the significance of S's re- 
sponses for any given card at the time of presenta- 
tion. The equality value was determined later by 
interpolation from the transition point at which S's 
response changed. 

The S was given six trials each session with the 
Poggendorff apparatus. The right diagonal bar was 
movable and was set, in an alternating order, signifi- 
cantly out of line above or below the nonmovable, 
standard left bar. Instructions were given to S to 
"adjust the right-half of the bar so that it seems to 
be in a straight line with the left bar." The E demon- 
strated how the bar could be moved and S was asked 
if he understood the task. When necessary the in- 
Structions were repeated. S's line of regard was kept 
on a vertical plane relative to the center bar of 
the illusion apparatus. 

Between presentation of the Ponzo and Poggen- 
dorff illusions, hypnotized Ss were asked to close 
their eyes and the age-regression suggestion was 
reemphasized, The entire presentation time for both 
illusions was approximately 15 min. for the hypnosis 
conditions, 

A control group of five Ss, selected by the same 
procedures as the experimental group,* were pre- 
sented the same stimuli under two conditions: (a) 
routine instructions without any age regression or 
function suggestion, and (b) task motivation instruc- 
tions in which they were told “I want you to be five 
years of age; you are to regress to age five and we 
will still be able to talk to each other. You can do 
this for me and I am going to help you do this by 
asking you some questions." To facilitate the task- 
motivation instructions, control Ss were then pre- 
sented the same list of questions given to the age- 
regressed experimental Ss. These questions were 
presented with the suggestion to S that “you are nom 
five years old and I want to ask you some questions, 
and included inquiries about where he lived, places 
he visited, his birthday party, and his attitudes 
toward school and playmates, The S was also asked 


3 Even though E was naive regarding the method 
for determining the equality value and thus did nof 
know the significance of any single response, he did 
know whether S was age regressed or not. Thus; 
the possibility of E bias has not been completely 
eliminated. i 

* The Ss in both experimental and control condi- 
tions were serving in their first hypnosis experiment. 
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Fic. 1, Ponzo illusion. (The equal vertical lines in the Ponzo illusion (inset) appear unequal. The graph 
describes the magnitude of this illusion as a function of chronological age, open circles; superimposed are the 
data obtained under hypnosis, solid circles.) (Reprinted from an article by Michael Parrish, R. M. Lundy, 
and H. W. Leibowitz published in the March 1968 Science. Copyrighted by the American Association for 


the Advancement of Science, 1968.) 


to print his name and to make some numbers. The 
handwriting samples and the numbers taken from 
this session and those taken in an earlier S-screening 
session during which these contro] Ss were hypno- 
tized and regressed to age 5 were submitted to a 
faculty member who was asked to determine if the 
samples, for a given S, were or were not made dur- 
ing the same stage of motor development. The report 
of the rater that these samples looked like they were 
made by a “child” during the same period of motor 
development indicated that Ss would write and make 
numbers when not hypnotized that were as “child- 
like" as those when they were hypnotized and re- 
gressed. This report, considered with the fact that 
Ss were willing to answer questions about events 
during this fifth year, seems to indicate that these 
control Ss were motivated to cooperate in trying to 
“be 5 years old." The two control conditions were 
also presented randomly and separated by 1-wk. 
intervals. 


RESULTS 


Ponzo illusion. 'The magnitude of the illu- 
sion was calculated as the difference between 
the physical equality setting and the subjec- 
tive equality value interpolated from S's re- 


* sponses, Since the comparison line varied in 


4-in. steps the interpolation was to the nearest 
jy in. The mean scores and standard devia- 
tions for each condition of the present experi- 
ment, together with the mean scores for the 
appropriate age group in the Leibowitz and 
Judisch (1967) study, are presented in Table 
1. To facilitate comparison with the norma- 
tive data obtained as a function of chrono- 
logical age (CA), the present results are 
superimposed on the function obtained in the 
previous experiment (Figure 1). It will be 
observed that the data in the absence of age 
regression, either with or without hypnosis, 
are similar to those obtained with Ss of com- 
parable CA. With regression to age 9, there 
is a decrease in the magnitude of the illusion 
which is identical to that obtained with vari- 
ation of CA. With further regression to age 
5, there is a further decrease in the size of 
the illusion which is not as marked as that 
obtained with chronological variations. 

An analysis of variance leads to a rejection 
of the hypothesis that the means for the four 
conditions are equal (F = 7.27, p<.01). A 
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TABLE 1 
Mean MAGNITUDE OF Ponzo ILLUSION IN INCHES 


Conditions 
Data 
Hypnosis | Hypnosis | Hypnosis 
No 
hypnosis | rest | ESS | CON 
Present 
M 53 St 39 AT 
SD 11 uty || OSE 16 
Mean age of groups 
19.3 19.3 9.2 52 
Leibowitz and 
Judisch 
M 51 51 32 40 
SD 24 24 A7 22 


Newman-Keuls procedure (Winer, 1962) was 
employed to determine which pairs of treat- 
ment means differed significantly. Table 2 
reveals that, with the exception of the dif- 
ferences between conditions no hypnosis and 
hypnosis—no regression, all mean differences 
are significant. This effect and the observed 
correspondence between the present means 
and the normative means (Figure 1) suggest 
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TABLE 2 


MATRIX OF DIFFERENCE BETWEEN MEANS OF THE 
Four CONDITIONS FOR THE PONZO ILLUSION 


diti Seend | Kester | Hypnosis tN 
s osi 
Condition [e posa ypnosis | hypnosis 0g 
y 

Hypnosis re- 

gressed to 5 ga 34* 36* 
Hypnosis re- 

gressed to 9 12* 14* 
Hypnosis 2 
No-hypnosis 

*p «.05. 


strongly that hypnotic induction facilitated 
the retrieval of the age-appropriate percep- 
tual response to this illusion. The data for 
the control group obtained under the condi- ^ 
tion of no task-motivation were also similar 
to those obtained by Ss of the same CA, 
being .68 in. Under the task-motivation in- 
structions, the value of the illusion increased 
to a mean value of .98 in. This value is in 
the opposite direction from the value obtained 
for children of age 5 and suggests that the 
task-motivation condition did not facilitate 
the retrieval of age-appropriate perceptual re- 
sponses to this illusion. 


13 15 I7 19 


AGE REGRESSION AND PERCEPTION OF ILLUSIONS 


Poggendorf illusion. The magnitude of the 
illusion was calculated as the differences be- 
tween the physical quality setting and sub- 
jective equality value given by S. A scale, 
calibrated in 4-in. steps was attached to the 
back of the center bar. The means for the 
six trials under each condition are presented 
in Table 3 and superimposed on the norma- 
tive data obtained as a function of CA in 
Figure 2, The magnitudes of the illusion in 
the absence of age regression and for the 9- 
yr. age-regression group are essentially iden- 
tical to those previously obtained, while the 
data for the 5-yr. group are in the same di- 
rection but not as marked as in the previous 
study. 

An analysis of variance leads to a rejection 
of the hypothesis that the means of the four 
conditions are all equal (F = 4.10, p < .05). 
A Newman-Keuls procedure computed to de- 
termine which pairs of the treatment means 
differed significantly revealed significant dif- 
ferences between the no-hypnosis and hyp- 
nosis-regressed-to-age-5 conditions, and be- 
tween the hypnosis and hypnosis—regressed- 
to-age-5 conditions (Table 4). The data for 
the control group obtained under the no-task- 
motivation condition were similar to those 
obtained by Ss of the same CA, being .94 in. 
However, for the task-motivation condition 
the mean value was 1.27 in. which corre- 
sponds to the hypnosis-regressed-to-age-9 con- 


TABLE 3 
MEAN MAGNITUDE OF THE POGGENDORFF ILLUSION 
IN INCHES 
Conditions 
Data Hypnosis | Hypnosi: 
s nosis 
hy ois | Hypnosis regressed regressed 
Present 
M. .96 1.03 1.26 1.41 
SD 39 32 38 24 


Mean age of groups 


19 19 8.5 5 
Leibowitz and 
Gwozdecki 
M 1.06 1.06 1.29 1.83 
SD .38 .38 .36 .36 
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TABLE 4 


MATRIX or DIFFERENCE BETWEEN MEANS OF THE FOUR 
CONDITIONS FOR THE POGGENDORFF ILLUSION 


y Hypnosis | Hypnosis No 
Condition cone A eet Hypnosis hypnosis 

Hypnosis re- 

gressed to 5 15 35* 45* 
Hypnosis re- 

gressed to 9 23 30 
Hypnosis if 
No-hypnosis 

* p <.05. 


dition rather than to the hypnosis-regressed- 
to-age-5 condition. 


Discussion 


Reponses to the Ponzo illusion appropriate 
for children were emitted by young adults 
while age-regressed. The conclusion that age 
regression facilitates the use or nonuse of 
visual cues in a manner appropriate to earlier 
stages of perceptual development is supported 
by the correspondence between the present 
data and the normative data based on CA 
variation. Further support is provided by the 
fact that Ss given task-motivation instruc- 
tions were unable to respond in a manner 
appropriate to 5-yr.-old children. These Ss 
were selected in the same manner as the ex- 
perimental groups and were presumably as 
suggestible as the experimental Ss. Careful 
postsession interviews indicated that these 
Ss were surprised to learn that their responses 
for the task-motivation task were considera- 
bly different from those actually obtained 
from children. Handwriting samples taken 
while Ss were hypnotically regressed to age 5 
and while under task-motivation instructions 
to write as a 5-yr.-old were not noticeably 
different, This is of interest in that it sug- 
gests that Ss did attempt to follow the task- 
motivation instructions. However, further in- 
vestigation regarding this condition should be 
undertaken. It is difficult to assess the per- 
ceived differences in the regression instruc- 
tions between Ss who have just been given 
the standard-hypnotic-induction technique 
and Ss who have had no induction instruc- 
tions, even when the regression instructions 
are held constant. It would be of interest to 
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investigate the parameters of the age-regres- 
sion instructions independently of the stand- 
ard-hypnotic-induction technique by encour- 
aging Ss to “really feel like you are five years 
old” within the task-motivation condition. 

The retrieval of perceptual responses for 
the Poggendorff illusion was not as marked 
as that seen for the Ponzo illusion. The mean 
obtained for the hypnosis-regressed-to-age-5 
group was considerably lower than the norma- 
tive data for the 5-yr.-olds. While it is evi- 
dent that the mean for the hypnosis- 
regressed-to-age-9 condition was similar to 
the normative data for this group, the New- 
man-Kuels procedure revealed no significant 
differences between this condition and the 
hypnosis-regressed-to-age-5 condition. These 
results, not seen with the Ponzo illusion, sug- 
gest that the induction procedure did not 
facilitate the retrieval of the perceptual cues 
for this illusion for age 5 to the same degree 
that it did for the Ponzo illusion. 

A satisfactory explanation of this differ- 
ence is not obvious. However, it should be 
pointed out that these illusions are most prob- 
ably mediated by different mechanisms. For 
example, it has been hypothesized that the 
Ponzo illusion is a result of inappropriate 
constancy scaling and is related directly to the 
individual's environmental experiences (Fil- 
ehne, 1898; Gregory, 1963; Tausch, 1954; 
Von Holst, 1957). A similar explanation does 
not appear to be appropriate for the Poggen- 
dorff illusion. 

More importantly, whatever processes of 
perceptual development are responsible for 
the observed changes in the magnitude of 
these illusions as a function of CA, they are 
apparently not completely irreversible. These 
results further suggest the possible use of 
hypnotic age regression as a methodological 
approach to the study of developmental 
changes in perception. 
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LEARNING AND USE OF LOGICAL SYMBOLS BY 
DEAF AND HEARING SUBJECTS* 


SOLIS L. KATES? 


University of Massachusetts 


Deaf Ss were as able as hearing Ss matched on age, IQ, and socioeconomic 
background in learning and using logical concepts, and superior to hearing Ss 
matched on written language comprehension, IQ, and socioeconomic background 
in learning these concepts. Verbal language comprehension apparently does not 
possess an exclusive determining influence in learning and grasping the signifi- 
cance of these concepts. It seems justifiable that the ability to identify 
correspondences between logical symbolic expressions and their stimulus 
combinations, to verbalize accurately the rules of these correspondences, and to 
use learned symbols in new contexts constitute increasingly more difficult levels 
of conceptual behavior. Supplementary verbalization of rules leads to more 
successful transfer than learning the rules without correct verbal formulations. 


Are deaf adolescents able to match logical 
symbols with their appropriate referents in a 
series of corrected trials? Are they able to de- 
scribe the significance of these symbols cor- 
rectly after these acquisition trials? And are 
they able to employ these symbols in a test 
situation with new stimulus features? These 
questions are explored in this study, which 
was essentially a replication of an investiga- 
tion by Furth and Youniss (1965). 

The logical symbols, -, /, and - , repre- 
sent three different connectives—“and,” “or,” 
and “not,” respectively. These symbols were 
combined with the letters, B and R—B for 
blue and R for red—to form three different 
symbolic expressions representing three dif- 
ferent concepts—the conjunctive (B-R), the 
exclusive-disjunctive (B/R), and the conjunc- 
tive-absence (B‘R). In the acquisition phase, 
where corrections were given, each trial re- 
quired the matching of certain color combi- 


1 This research was supported, in part, by Research 
Grant No. HD 00336 from the United States Public 
Health Service. This paper was presented at the 
meeting of the Eastern Psychological Association, 
Washington, D. C., April 20, 1968. The author ex- 
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schools in Northampton and Easthampton, Massa- 
chusetts, 

2 Requests for reprints should be sent to the au- 
thor, Psychology Department, University of Mas- 
sachusetts, Amherst, Massachusetts 01002. 


nations with one of the three expressions 
incorporating the letters B and R and the 
logical symbol representing the connective. In 
the test phase, these same logical symbols, 
standing for the same connectives, were used 
but with new letters, C and D, referring to 
combinations of the new stimulus features, 
circular and dark. 

Furth and Youniss (1965) have identified 
two different modes of learning the relation- 
ships between the symbolic expressions, con- 
sisting of the logical connectives and the let- 
ters B and R, and the corresponding stimulus 
combinations. In the first, learning this con- 
nection is probably a result of remembering 
their perceptual correspondence. In the sec- 
ond, the connection between the symbolic 
expression and the stimulus combination is 
learned, but this learned connection is or can 
be divorced from the concrete data—that is, 
from the stimulus features on which it was 
originally built. Such learning approximates 
the formal operations that Piaget (1957) has 
described. When learned in the latter way, 
this connection may be successfully applied 
to new stimulus combinations when it is rele- 
vant. Tn order to evaluate whether this second 
mode of learning has occurred, the successful 
Ss from the acquisition phase were examined 
on the test phase. 

Deaf Ss tested by Furth and Youniss 
(1965) could not apply these logical symbols 
to new stimulus combinations. Only 796 of 
their deaf Ss, aged 12 through 20 yr., who at- 
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tained criterion on the acquisition task were 
successful in transfer. However, 64% of the 
urban, hearing Ss aged 12 through 14 yr. 
successfully matched these logical connectives 
with the new stimulus combinations in the 
test phase. Furth and Youniss ascribed the 
poor transfer ability to the deaf Ss’ failure to 
look spontaneously for an underlying basis 
linking logical connectives with their appro- 
priate referents. Satisfied from previous re- 
search that deaf Ss are able to comprehend 
and use these logical symbols when specifically 
taught, Furth (1966) believed that they 
learned by rote, with a lack of intellectual 
curiosity in tasks where comparable hearing 
Ss attempt to discover the significance of 
these symbols. Furth believes that such im- 
pairment of spontaneous learning in a dis- 
covery task is related, in part, to the re- 
stricted experience of the deaf Ss and not 
simply to limited verbal language. 

Furth and Youniss used deaf Ss who were 
unable to comprehend written language at 
the difficulty level of ordinary conversation; 
possibly this language factor may have played 
some role in their findings, Are deaf Ss who 
can understand written connected language at 
some minimal level, such as the fifth-grade 
level, able to learn and then employ the 
learned logical symbols in new situations? It 
was thought that deaf Ss, matched with 
hearing Ss on the relevant variables, would 
be as able to learn and then employ the 
learned logical symbols in new stimulus con- 
texts, provided that they had this minimal 
language comprehension. The enrichment of 
their experiences which may result, in part, 
from their language ability should help the 
deaf in learning the connections between 
symbolic expressions and stimulus combina- 
tions. In addition, the support given by a 
correct verbal formulation should be of con- 
siderable value in using the logical symbol 
in a new context (Inhelder & Piaget, 1958, 
p. 253). 

Consequently, the author obtained verbal 
statements describing the significance of the 
symbolic expressions following the completion 
of the acquisition phase. Tt is believed that, 
for both deaf and hearing Ss, attaining cri- 
terion on the acquisition task and giving cor- 
rect verbal descriptions of the symbolic ex- 


Kates 


pressions would be associated with greater 
success on the test phase. These verbal form- 
ulations, when present with the learned con- 
nections, should help direct the responses of 
Ss to relevant stimulus combinations and 
should help resist the interference of irrelevant 
stimulus features. One can say that there is a 
more “abstract” attitude, in the Goldstein 
sense, when the learned connection and its sup- 
plementary verbal formulation are present 
than for the learned connection alone. 

It is possible that this study may have 
some relevance to the contribution of age and 
language to the learning of logical symbols. 
When deaf Ss are matched on written lan- 
guage comprehension with hearing Ss, they 
are usually older and may show the impact of 
more contacts with the physical and social 
environment on their learning of logical sym- 
bols. If the deaf are more able than these 
language matched, but younger, hearing Ss 
to acquire the connection between the logical 
symbols and their respective instances, and to 
transfer this learning to a new task, it may 
provide evidence for the role of age-related 
adaptations on the development of logical 
thinking. It would seem to show that mental 
operations basic to logical thinking may be 
refined or extended by language but are not 
dependent on language for their emergence. 
For example, in discussing the logical opera- 
tion of inclusion, Piaget (1957) stated that 
"the child has not simply to carry out a verbal 
or symbolic translation of the perceptual data, 
but an operational composition or decomposi- 
tion of its elements . . . [p. 5]." Further- 
more, Piaget (1957, p. 6) continued on to 
state that such perception involves knowledge 
of logical operations, which require age- 
related contacts with the environment. But if 
the deaf and language matched hearing Ss are 
not different in their learning of these logi- 
cal relations, the effect of language on logical 
behavior may have to be given somewhat 
additional consideration. 

The conjunctive concept has been described 
as easier than the exclusive-disjunctive or con- 
junctive-absence concepts for adolescents 
(Youniss & Furth, 1964). Yet Neisser and 
Weene (1962) have indicated that the con- 
junctive-conjunctive-absence, and exclusion 
(similar to the exclusive-disjunctive concept) 
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LEARNING BY DEAF AND HEARING SUBJECTS 


concepts are not significantly different for col- 
lege students. This question can be further 
examined in the context of this investigation. 


METHOD 
Subjects 


Sixteen deaf Ss were selected from the Clarke 
School for the Deaf in Northampton, Massachusetts, 
who were between the ages of 11 and 17 yr. who 
had been deaf from birth or before the age of 3, and 
could not have learned connected English language 
without special instruction. There were 8 girls and 
8 boys. 

A group of 16 hearing Ss were selected from the 
Kennedy Junior High School in Northampton, 
Massachusetts, and from the Northampton High 
School in Northampton, Massachusetts. They were 
between 11 and 17 yr. of age and were matched with 
the deaf Ss on chronological age, sex, socioeconomic 
level, and IQ (hereafter known as hearing-age Ss). 
Another 16 hearing Ss between the ages of 9 and 11 
yr. were selected from the fourth, fifth, and sixth 
grades of the Easthampton, Massachusetts, ele- 
mentary school system; they were matched with the 
deaf Ss on IQ, sex, socioeconomic background, and 
comprehension of written language (hereafter known 
as hearing-achievement Ss). The IQ scores were 
obtained by using the Picture Arrangement, Block 
Design, and Coding subtests of the appropriate 
Wechsler Intelligence Scales for children. The aver- 
age of the word and paragraph meaning scores on the 
Stanford Achievement Test were used to match 
hearing and deaf Ss on written language compre- 
hension. The Hollingshead scale (1957) was used 
to provide a numerical rating of the fathers" 
occupations for matching groups on socioeconomic 
level. Table 1 presents the means and standard 
deviations for Ss on these independent and control 
variables. 


Stimulus Materials 


Phase 1: Acquisition, In the acquisition task 48 
stimulus cards, each measuring 4 X 6 in., were used. 
Sixteen stimulus cards exemplified the conjunctive 
concept, 16 cards the exclusive-disjunctive concept, 
and 16 cards the conjunctive-absence concept. Each 
of the 16 conjunctive concept cards was colored half 
red and half blue (with the left-right color position 
counterbalanced). The correct response for the con- 
junctive card was B-R, representing a conjunction 
of blue and red. 

The 16 exclusive-disjunctive concept cards consisted 
of 4 red and yellow cards, 4 red and green cards, 4 
blue and yellow cards, and 4 blue and green cards. 
(Again the position of these colors was counter- 
balanced.) The disjunctive response of B/R was to 
be given to any card containing blue and another 
color besides red, or containing red and another 
color besides blue. 

Each of the 16 conjunctive-absence concept cards 
was colored yellow and green. (Again the position 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR AGE, IQ, 
WRITTEN LANGUAGE COMPREHENSION, AND 
SOCIOECONOMIC BACKGROUND 


Stanford 
Group Age 1Q:.|| Tagine | see 
menta 
Deaf 
2x 15-0 109.3 5.3 3.8 
SD 27 mo. 10.0 8 1.6 
Hearing age 
X 15-1 |1099| — 44 
SD 25 mo. 9.5 13 
Hearing achieve- 
ment 
10-11 | 109.9 5.3 43 
SD 10 mo. 14.9 9 13 
Note.—N = 16. 


a In grade level. 
b Hollingshead Socioeconomic Index. 


of these colors was counterbalanced.) The correct re- 
sponse for this class was B-R, which was to be given 
to any card which was neither blue nor red. 

Phase 2: Test. Here 18 cards were used, with 5 
cards illustrating the conjunctive concept, 5 cards the 
conjunctive-absence concept, and 8 cards the ex- 
clusive-disjunctive concept. In the test task, there 
was a change in the two critical elements; instead 
of blue and red, the two elements were now 
circular form and dark color. The 5 conjunctive 
cards contained dark colored circular figures (circular 
and dark, C-D); the 8 disjunctive cards included 4 
cards having dark colored angular figures and 4 
cards with light colored circular figures (either circu- 
lar or dark, C/D); and the conjunctive-absence 
cards contained light colored angular figures (neither 
circular nor dark, C-D). 


Procedure 


Phase 1: Acquisition. Each S was tested individ- 
ually in both the acquisition phase and the test 
phase. In the acquisition phase S was instructed to 
look carefully at colored cards shown one at a time, 
and to choose which of the three answers (B-R, B/R, 
B-R) he thought belonged with that colored card. 
He was told that in the beginning he would have 
to guess, but that corrections given by E would 
help him find the correct answer for each card. He 
was not allowed to make any marks or changes in 
previous answers. There were a total of 48 trials, 
or eight blocks of 6 trials each; within each block, 
there were two instances of each concept shown in 
random order. After the 48 trials, S was asked to 
write the meaning of each of the three logical sym- 
bolic expressions. 

Phase 2: Test. The test phase consisted of 18 
stimulus instances with circular and dark as the 
critical elements presented in random order. The S 
was again instructed to look carefully at the cards 
and, using what he had learned before (in the 
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TABLE 2 


CHI-SQUARE VALUES AND FISHER f VALUES FOR COM- 
PARISONS OF THE NUMBER OF SuBJECTS WHO 
ATTAINED CRITERION IN THE 
ACQUISITION PHASE 


Number | Number 


Condition | whoat- | who did Statistic 
tained | not attain 

Deaf 16 0 x? = 13.9, p <.001 

a eve- 

ment 5 1 > (Fisher) = .0001 
Deaf 16 0 ns 
Hearing age 13 3 
Hearing age 13 3 x? = 6,22, p «.02 
Hearing 

achieve- 

ment 5 11 p (Fisher) = .005 


acquisition task), decide which answer (now C-D, 
C/D, C-D) belonged with each card. The test phase 
was administered in the same way as the acquisition 
phase except that no corrections were given, and S 
circled the correct answer instead of writing it in a 
blank. Immediately after the last trial, S was asked 
to write the meaning of the three response symbols. 
Throughout the testing, E was careful not to ver- 
balize the three symbols or their meanings. 

Each S was considered to have attained criterion 
on the acquisition task if he fulfilled either of the 
following two standards: (a) he had to achieve 10 
consecutively correct responses with no more than 
one error following these 10 responses or (b) he had 
to respond correctly in the last seven trials. On the 
test task, S was considered to have attained cri- 
terion if he fulfilled one of the following two condi- 
tions: (a) he had to have 10 consecutively correct 
responses with no more than a total of 3 wrong 
responses and no more than 1 wrong response after 
the 10 consecutively correct responses or (b) he 
had to have the last 8 (or more) consecutively 
correct with no more than a total of 3 wrong 
responses. 

Finally, in order to evaluate the written descrip- 
tions given to each of the three symbolic expressions, 
a scoring system was devised with illustrative ex- 
amples for assigning a score of 3, 2, or 1 to each 
description. Essentially, the description of a concept 
received a score of 3 if it included both the appro- 
priate critical elements and the proper logical con- 
nective or its equivalent. Examples are blue and red 
for B-R, circular form and dark color for C-D, 
circular | or dark for C/D, and neither blue nor red 
for B-R. A score of 2 was given to descriptions that 
mentioned the two critical elements without specify- 
ing the proper logical connective, such as blue-red 
for B-R, a circle, dark color for C-D, blue with 
yellow for B/R, green and yellow for B-R, and a 
pink triangle for C-D. A score of 1 was given when 
the proper critical elements were not specified and 
the appropriate logical connective was not included, 
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such as blue for B-R, circle for C/D, and triangle for 
C-D. A maximum score of 9 was possible for each 
phase. Scoring was done independently by two judges, 
reliability was .98. All protocols were scored without 
knowledge of S's identity or group. 


RESULTS 


The data indicate that deaf Ss and hear- 
ing-age Ss were not significantly different in 
the number who attained criterion in the 
acquisition phase (Table 2). However, the 
number of hearing-achievement Ss who at- 
tained criterion in the acquisition phase was 
significantly smaller than either the number 
of deaf Ss or the number of hearing-age Ss 
(Table 2). 

When the data for each S group of 16 
were analyzed for the correct verbal descrip- 
tion of the three concepts after the acquisi- 
tion phase (Table 3), 7 of the deaf Ss, 6 of 
the hearing-age group, and none of the hear- 
ing-achievement group were able to describe 
the three concepts correctly. 

Further, of those 16 deaf Ss who attained 
criterion in the acquisition phase, just 5 Ss 
were able to attain criterion in the test phase. 
Similarly, 4 of the 13 hearing-age Ss who at- 
tained criterion in the acquisition phase were 
able to achieve criterion in the test phase. 
None of the 5 hearing-achievement Ss who 
attained criterion in the acquisition phase 
were able to succeed in reaching criterion in 
the test phase. Statistical tests evaluating 
the differences in the proportion of successful 
Ss in each of the three groups who reached 


TABLE 3 


CHI-SQUARE AND FISHER p VALUES FOR COMPARISONS 
OF THE NUMBER OF SUBJECTS WHO VERBALIZED 
CORRECTLY THE THREE CONCEPTS 
FOLLOWING ACQUISITION 


Number 
Number | who did 
balized not ver- 
Condition correct]; balize Statistic 
the three | correctly 
the three 
concepts concepta 
Deaf 7 9 2266 p < -02 
Hapis j 
a jeve- 
ment 0 16 p (Fisher) = .003 
Deaf 7 ns 
Hearing age 6 10 
Hearing age 6 10 2 = 5,1, p <.05 
Hearing s 
achieve- 
ment 0 16 > (Fisher) = .007 
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criterion in the test phase did not produce any 
significant findings (Table 4). It should be 
recalled that about one-third of the Ss in the 
deaf and in the hearing-age groups who were 
successful in the acquisition phase were also 
successful in the test phase. 

The author also examined how correct ver- 
balizations together with achieving criterion 
on the acquisition phase affected performance 
in the test phase. Table 5 shows the distribu- 
tion of Ss who attained criterion on the acqui- 
sition phase together with correct descriptions 
of the three concepts, and of Ss who succeeded 
in attaining criterion on the acquisition phase 
but were not able to describe the three con- 
cepts correctly. Chi-square and Fisher proba- 
bility tests show that there was a significant 
finding indicating that correct verbalization 
and successful performance in the acquisition 
phase was associated with significantly greater 
numbers of Ss who attained criterion on the 
test phase. A significant relationship was also 
obtained for the hearing-age Ss but there was 
only a trend (at the .10 level) for the deaf Ss 
(Table 5). 

There were no significant differences in 
difficulty of the three concepts for the deaf 
and for the hearing-achievement Ss on the 
acquisition phase. However, on the acquisition 
phase the conjunctive concept was signifi- 
cantly easier for the hearing-age Ss than the 
exclusive-disjunctive and conjunctive-absence 
concepts whereas the latter two were not sig- 
nificantly different. The F (df = 2/45) value 
for the hearing-age group was 5.9, significant 
at the .01 level. Also, the percentage of correct 
trials for those Ss in each of the three groups 
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TABLE 4 


CHI-SQUARE AND FISHER p VALUES FOR COMPARISONS 
OF THE NUMBER OF SuBJECTS WHO WERE 
SUCCESSFUL ON ACQUISITION AND 
WHO ATTAINED CRITERION IN 
THE TEsT PHASE 


Number who | Number who 


Condition Adie ye Cs; attain] p 
test phase | test phase 

Deaf 5 11 RR 
Hearing achievement 0 5 
Deaf 5 11 " 
Hearing age 4 9 
Hearing age 4 9 ns 
Hearing achievement 0 5 


who had been unable to attain criterion on 
the test phase although they had been suc- 
cessful on the acquisition phase was com- 
pared. Analyses of variance showed no sig- 
nificant differences in the difficulty of the 
three concept types on the test phase for the 
three S groups for those Ss who succeeded on 
the acquisition phase but failed the test 
phase. 


DISCUSSION 


The deaf Ss were as able as the hearing-age 
Ss to attain criterion in the acquisition phase. 
Inasmuch as the acquisition phase involved 
establishing the relationships between stim- 
ulus materials which were largely perceptual 
in nature, through an immediate corrective 
feedback procedure, it could be anticipated 
that the deaf Ss would do as well as the 
hearing-age Ss. There was a significantly 
greater number of deaf and hearing-age Ss 


TABLE 5 


RELATION BETWEEN ACQUISITION PHASE AND CORRECT VERBALIZATION AND 
ATTAINING CRITERION ON TEST PHASE WITH CORRESPONDING 
CHI-SQUARE OR FISHER “p” VALUES 


Group in test pi 


Ss (deaf plus hearing ase} in Group 1 
Ss (deaf plus hearing age) in Group 2 


Deaf Ss in Group 1 
Deaf Ss in Group 2 


on eB no 


Hearing-age Ss in Group 1 
Hearing-age Ss in Group 2 


Ss who attained criterion 
hase 


Ss pelt ve Lie cri- Statistic 
5 x? = 6.3, p < 02 
15 P (Fisher) = .001 
3 p (Fisher) = .095 
? p (Fisher) = .025 


who 


` Note.— = Ss who attained criterion in acquisition phase plus verbalizing correctly the three concepts; Group 2 = Ss 
Ma iduirion in acquisition phase but did not verbalize correctly the three concepts. A 
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than hearing-achievement Ss who attained 
criterion in the acquisition phase. This greater 
success of the deaf Ss cannot be attributed 
to the comprehension of written language in- 
asmuch as they were matched on this vari- 
able with the hearing-achievement group. 
Verbal language comprehension, it seems, was 
not the crucial factor mediating the learning 
of the correspondence between symbolic ex- 
pressions and their stimulus combinations. 

When evaluated on the ability to give cor- 
rect verbal descriptions of the three logical 
symbolic expressions, the deaf and the hear- 
ing-age Ss proved to be significantly superior 
to hearing-achievement Ss. Even in the case of 
verbal descriptions of the logical symbolic 
expressions, similar ability to comprehend 
connected written verbal language apparently 
plays a small role. If the grasp of the signifi- 
cance of the logical symbolic expressions were 
solely dependent on similar comprehension of 
written verbal language, then deaf Ss and 
hearing Ss matched on this variable should 
not have been significantly different. But in- 
asmuch as they were significantly different 
on verbalization, then again it may be inferred 
that language ability was not the determin- 
ing influence in understanding the three logi- 
cal symbolic expressions. 

Consequently, on the basis of this finding 
and the one immediately described above, 
age seems to be more vital than written lan- 
guage capacity for the appropriate acquisition 
and verbal description of logical concepts. 
Probably the effects of age include greater 
experience with these concepts in daily life 
and with the component operations necessary 
to the understanding of these concepts, such 
as combining, dissociating, ordering, and set- 
ting up of correspondences. As Piaget (1957) 
has stated, capacity for logical thought de- 
pends more upon knowledge of the many 
necessary component operations than upon 
verbal symbolic skills. If one assumes that 
greater age is related to greater experience 
with and knowledge of these logical opera- 
tions, then it appears that these results offer 
partial support to the aforementioned formu- 
laion of Piaget. The present results emphasize 
that written language knowledge cannot sub- 
stitute for greater experience related to in- 
creased age, particularly at the age levels 
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studied here. The conclusion is that the de- 
velopment of skill in certain mental opera- 
tions must precede the ability to describe 
verbally the logical symbolic expressions cor- 
rectly; and skill in the use of these mental 
operations probably emerges from the active 
age-related adaptation efforts of the growing 
individual in his attempts to cope with en- 
vironmental fluctuations. 

The successful acquisition of the connection 
between logical symbolic expressions and their 
stimulus combinations affords a small guaran- 
tee that the learned symbols representing the 
logical connectives will be employed appro- 
priately in a new context. The small percent- 
age of deaf and hearing-age Ss who were able 
to use these logical symbols appropriately in 
the new context suggests that for most Ss 
their acquisition success took the form of 
perceptual correspondences between logical 
symbolic expressions and stimulus referents. 
In the hearing-achievement group, there were 
no Ss who could make this transfer. While 
no significant differences in deaf, hearing-age 
Ss, and hearing-achievement Ss could be es- 
tablished, one finding is nonetheless of inter- 
est. The substantial similarity in the transfer 
behavior of deaf and hearing-age Ss contra- 
dicts the Furth and Youniss (1965) result. 
One possible reason for this discrepancy is 
that in the present study the deaf and hearing 
Ss were matched as precisely as possible on the 
variables of age, IQ, and socioeconomic back- 
ground whereas Furth and Youniss did not 
match their deaf and hearing Ss. When deaf 
and hearing Ss have been matched and are 
approximately similar on these control vari- 
ables, they demonstrate an approximately 
equivalent capacity for spontaneous efforts 
to discover the significance of the logical sym- 
bolic expressions, 

When one accepts the correct verbalization 
of the three concepts as an alternative cri- 
terion of learning in the acquisition phase, 
one notes that there is greater predictability 
for logical behavior in the new stimulus con- 
text of the test phase. But there is a limiting 
factor even when successful verbal behavior 
characterizes the learning of the logical sym- 
bolic expressions by deaf and hearing-age SS. 
Inasmuch as approximately two-thirds of 
these Ss used these learned symbols success- 
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. fully in the new context, it is likely that the 
correct verbal descriptions of about one-third 
of these Ss were not entirely free of the 
concrete stimulus features of the acquisition 
phase to which they applied. Some of these 
correct descriptions probably represented ver- 
bal statements that could not be transformed 
into more abstract rules about the logical 
symbols applicable to the same logical com- 
binations but composed of different stimulus 
features. It seems justifiable to infer that the 
ability to identify correspondences between 
logical symbolic expressions and their stimu- 
lus combinations, the ability to describe ac- 
curately the rules of these correspondences, 
and the ability to use these learned symbols in 
new contexts constitute increasingly more dif- 
ficult levels of conceptual behavior related to 
age and possibly other factors such as intel- 
ligence. 

Another point of discussion deals with the 
significant relationship between attaining cri- 
terion in the acquisition phase plus correctly 
describing the logical symbolic expressions 
and successful behavior in the test phase. 
When Ss attained criterion in the acquisition 
phase without correct statements about the 
logical symbols, they showed significantly less 
success in employing these symbols in the 
new context of the test phase. Possibly, this 
supplementary verbalization of the rule helps 
by providing Ss with an ordering principle or 
cognitive structure which supports transfer 
behavior. When faced with different stimulus 
combinations that may disturb the employ- 
ment of the learned symbol, Ss who have the 
verbal statement of this rule available may 
possess an additional resource for initiating 
regulative control over their behavior. When 
this ordering principle is available in verbal 
form, there seems less likelihood that changed 
stimulus referents will disturb appropriate 
transfer behavior to any significant degree. At 
this point, the general finding that verbal abil- 
ity per se is not the determining factor in 
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the successful learning of logical symbols 
should be emphasized. However, the ability 
to put these logical symbols, once they have 
been learned, into verbal language apparently 
makes a significant contribution to transfer 
behavior. 

Only the hearing-age group found the ex- 
clusive-disjunctive concept more difficult than 
the other two concepts, and this finding 
relates specifically to the acquisition phase. 
On this acquisition phase, the other two 
groups did not experience any concept as more 
difficult than the others. On the test phase, 
where the relative difficulty of any concept 
or concepts would possibly yield information 
about which ones prevented attainment in 
a new context, there were no differences in 
concept difficulty. One can conclude that these 
three concepts present about the same degree 
of difficulty for most adolescent Ss in a learn- 
ing and correction phase and about the same 
degree of difficulty for all Ss when they have 
to deal with these concepts involving new 
stimulus elements in a phase where no correc- 
tion is given. 
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Homograph measures of vocabulary and semantic interpretation were used to 
test several theories of language disorder in schizophrenics and aphasics. 
Aphasics showed a loss of abstract vocabulary, but this did not appreciably 
affect their interpretation of word meanings in sentences. The raw perform- 
ance of the schizophrenics was consistently worse than the controls’ and 
better than the aphasics', despite comparability among the groups in education 
and verbal intelligence. However, there was no evidence of concretistic 
"regression" in any of the language measures of the schizophrenics. Schizo- 
phrenics were somewhat more inclined than controls to select primary meanings 
(strongest mediating responses) for ambiguous words in sentences, but this was 
not attributable to lack of knowledge of secondary meanings. For the most part, 
the results were consistent with Chapman's theory of primacy bias in 


schizophrenia. 


One reason for the persistent interest in 
the semantic aspects of language in mental 
patients, historically, was the early recogni- 
tion by clinicians that semantic changes take 
place, especially in schizophrenia and certain 
disorders involving known damage to the 
brain. Disturbance of association was re- 
garded by Bleuler (1950) as a primary symp- 
tom of schizophrenia “insofar as it involves a 
diminution or leveling of the number of 
affinities [p. 350]." On the other hand, Piro 
(1958) emphasized referential widening and 
“semantic dispersion" in schizophrenia. Con- 
cretism has long been observed in the problem 
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solving and language of mental patients (Fla- 
vell, 1956; Goldstein & Scheerer, 1941; Mil- 
gram, 1959), The great interest in studying 
these disturbances in schizophrenia is partially 
explained by the prospect of finding mea- 
surable organic causes for it that are common 
to brain disorders (e.g. aphasia) which 
clearly have less mysterious origins. Thus, 
empirical studies of the observed disturbances 
of association and abstraction may have sig- 
nificance for the etiology of schizophrenic 
symptoms. 

Another reason stems from a practical issue 
in clinical assessment: whether vocabulary 
knowledge is the intellectual function most 
resistant to deterioration and, therefore, the 
most stable index of premorbid functioning 
(Terman, 1918). Babcock (1930) developed 
an index of deterioration based on the ration- 
ale that deterioration occurs first in areas of 
new learning and the formation of new asso- 
ciations and last in those areas learned earliest 
in life (vocabulary). Critics have pointed out 
that real deterioration in conceptual level or 
organization can occur without showing an 
appreciable vocabulary loss because, in con- 
trast to tests of other functions, the standards 
for acceptable achievement on most vocabu- 
lary tests are so low (Feifel, 1949; Harring- 
ton & Ehrmann, 1954; Vacorzynski, 1941). 
While this evidence weighs against Babcock's 
hypothesis, it does not disprove its validity. 
Stronger contrary evidence comes from direct 
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tests of vocabulary deficit associated with 
brain damage (Williams, Lubin, & Gieseking, 
1959) or a schizophrenic episode (Lubin, Gie- 
seking, & Williams, 1962). Still, the practical 
need to measure intellectual deterioration re- 
mains, and clinicians continue to use vocabu- 
lary scores in this way, viz., the comparison 
of “hold” versus “don’t hold” subtests of the 
Wechsler Adult Intelligence Scale (WAIS). 

Finally, several theorists (Goldstein, 1948; 
Vigotsky, 1934; von Domarus, 1944; Werner, 
1948) have expounded a regression principle: 
that in certain forms of psychopathology the 
language of adults regresses to a more primi- 
tive, concrete level. Brown (1958) showed 
that this principle is a corollary of a popular 
developmental “law of linguistic progression,” 
which holds that progression from the con- 
crete to the abstract characterizes the evolu- 
tion of the species from animal to man, 
the historical development of contemporary 
languages and the transformation of children 
into adults, It has been inferred that if lan- 
guage declines with mental disorder according 
to the regression principle, then language 
progresses with normal development according 
to the general law, showing increasing mastery 
of abstraction with age. This reasoning as- 
sumes an intermediate premise which should 
itself be tested: that linguistic decline re- 
traces the path of linguistic growth, rather 
than taking some other course. That obstacle 
has not dampened the enthusiasm for the 
inference in the past, and the relevance of 
language disorder for general linguistic theory 
remains an open question. 


Homograph Interpretation 


A homograph is a word that can assume 
two or more different meanings without any 
change in spelling. Usually a homograph is 
likewise a homophone, being pronounced the 
same regardless of its meaning, but there are 
exceptions. It is a promising tool for experi- 
mental psycholinguistics because it offers a 
. means of studying systematically various se- 
mantic and syntactic aspects of written lan- 
guage. Pursuing the line of research on clini- 
cal assessment aforementioned, Willner 
(1965) designed an Unusual Meanings Vo- 
. cabulary Test, comprised of homographs hav- 
3 ing a very familiar meaning and an unusual 
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meaning. Whereas most vocabulary tests mea- 
sure the number of words of progressively de- 
creasing familiarity for which S can give one 
meaning, his test required S to select from 
five alternatives the uncommon meaning of 
a familiar homograph. Willner found that in 
comparison with normal controls, schizo- 
phrenics knew significantly fewer unusual 
meanings than usual meanings (the latter 
measured by a standard WAIS Vocabulary 
test). This discrepancy did not reach signifi- 
cance for brain-damaged patients. Both pa- 
tient groups were significantly impaired in 
abstract reasoning. From these results Willner 
speculated that impaired reasoning by schizo- 
phrenics may be attributable to pathological 
alterations in the meanings of words used in 
the abstraction process, whereas brain damage 
impairs reasoning by diminishing the ability 
to form relationships between words without 
altering their meaning. 

The meaning of a homograph in a sentence 
is ambiguous, and correct (consensual) inter- 
pretation depends on syntactic and referential 
constraints in the sentence, as well as the rela- 
tive “mediating response strengths” for the 
alternative meanings possible. Chapman, 
Chapman, and Miller (1964) found that 
errors of interpretation in sentence contexts 
by both normals and schizophrenics were 
biased toward the primary alternative, that is, 
the meaning which first comes to mind when 
the homograph is encountered out of context. 
The primacy bias for the schizophrenics was 
significantly greater than for the normals. 
Rather than being qualitatively different, this 
schizophrenic deviation was merely an exag- 
geration of a normal language pattern. In a 
related experiment, the same authors found 
no differences between the two groups in 
knowledge of primary and secondary homo- 
graph meanings, so that the stronger primacy 
bias of the schizophrenics could not be at- 
tributed to loss of secondary meanings from 
their vocabulary. This finding opposes the 
results of Willner’s study; however, the for- 
mat of Willner’s vocabulary test was slightly 
different and his items were considerably more 
difficult. 

The present investigation was designed to 
test whether schizophrenics or aphasics differ 
from controls in (a) making more primary, 
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than secondary errors of homograph interpre- 
tation in context, (b) making more concrete 
than abstract errors of homograph interpreta- 
tion in context, (c) knowing fewer secondary 
than primary homograph meanings, (d) 
knowing fewer abstract than concrete homo- 
graph meanings, and (e) making more inter- 
pretational errors in context attributable to 
vocabulary deficiency. Chapman's theory pre- 
dicts that for schizophrenics the first hypothe- 
sis would be confirmed, but the third and fifth 
(as related to primacy bias) would be re- 
jected. Willner’s theory predicts confirmation 
of the third hypothesis for schizophrenics but 
rejection of it for aphasics, The second and 
fourth hypotheses are formulations of the 
popular “principle of linguistic regression” in 
terms of semantic disambiguation and vocabu- 
lary, respectively. 


METHOD 
Subjects 


The experimental Ss were 44 chronic Veterans 
Administration patients with a primary diagnosis of 
schizophrenic reaction and no indications of brain 
damage, and 11 Veterans’ Administration patients 
with acute brain syndrome who showed primary 
aphasic symptoms and no indications of psychosis. 
Almost all of the schizophrenics were diagnosed as 
chronic undifferentiated or paranoid type. Seven 
aphasics had suffered cerebral vascular accidents, 3 
had received head injuries and the etiology in one 
case was unknown, The control groups were 22 
chronic Veterans’ Administration patients with a 
primary diagnosis of alcohol addiction but no indica- 
tions of psychosis or obvious brain damage, and 20 
normal volunteers who were either state hospital 
employees or working-class acquaintances of the state 
hospital personnel officer. The normals were the 
principal control group. The alcoholics provided a 
measure of control for chronic hospitalization and 
drug intake, which might influence the schizophrenics’ 
performance. The schizophrenics and alcoholics had 
been in and out of hospitals for 15 and 8 yr., re- 
spectively, on the average, and 89% of the former 
and 82% of the latter were receiving tranquillizing 
drugs. Seven alcoholics were also taking antabuse. 

No S was mentally deficient. They participated 
without special incentives, except that the normals 
were paid $2.50. The means and standard deviations 
for length of education were 11.25, SD=2.07 for 
the normals, 10.82, SD —2.52 for the alcoholics, 
10.47, SD=2.65 for the schizophrenics, and 10.91, 
SD = 2.87 for the aphasics. On the Shipley-Hartford 
Vocabulary Scale they were 27.95, SD — 5.69 for the 
normals, 26.05, SD=9.86 for the alcoholics, 25.20, 
SD=7.44 for the Schizophrenics, and 24.29, SD— 
9.02 for the seven aphasics tested. The four groups 
did not differ in education, parental social class, or 
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vocabulary scores, although the schizophrenics were 
slightly lower than the normals on the latter (p< 
20). The normals were younger on the average (35 
yr. than the alcoholics (42 yr.) and the schizo- 
phrenics (41 yr.), and the aphasics (54 yr.) were 
older than all other groups, but the few associations 
found between age and language performance scores 
were explained by their joint correlation with educa- 
tion. As the groups were comparable in length of. 
education, the age differences were considered in- 
consequential, 


Measures 


The Shipley-Hartford Vocabulary Scale was used 
as a measure of current verbal intelligence and 
"cognitive presence." The interpretation test consisted 
of 48 sentences, each containing a homograph. For 
example, one item read as follows: 


The policeman took his club to the scene of the 
riot, 
This means: A. He took a heavy stick. 
B. He took a set of keys. 
C. He took a group of people. 


Later in the test the following item occurred: “Mr. 
Jones took his son's club on a picnic." The same 
three alternative interpretations followed, in the 
order of A, C, B. A heavy stick is the consensually 
correct answer in the first item but the error alterna- 
tive in the second. A group of people is the error 
alternative in the first item but the correct answer 
in the second. A set of keys is an irrelevant alterna- 
tive in both items, included to measure random 
Tesponses, 

The homographs were selected from a list of over 
500 drawn from the dictionary.* Since nouns are 
most readily scaled for referential concreteness be- 
cause they usually have the clearest referents, only 
homographs with two or more semantically distinct 
nominal referents were retained. The meanings of 
each homograph were scaled for primacy and con- 
creteness by psychology students. One group of 
judges made the primacy evaluations by writing 
down in order “the first meaning that comes to mind 
when the word is seen [scored 3], then the second 
meaning that comes to mind, and the third last of all 
[scored 1]". Another group of judges rated these 
Meanings on a 4-point scale for concreteness of 
referent, following Brown's (1958) definition: A 
concrete word (scored 3) has a clearer “thing 
character, is more “picturable,” has a more tangible 
or visual contour and a smaller denotation than an 
abstract word (scored 0), for example, apple is more 
concrete than experience. 

On the basis of these judgments 24 homographs 
were selected for the inclusion in the interpretation 
test, 6 having a secondary-concrete meaning pair! 


tAn exhaustive list of noun homographs com- 
piled by Joy Yeager and the first author, and copies 
of the interpretation and repertory tests, may be 
obtained from Norman F. Watt, William James 1260: 
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TABLE 1 


MEAN PERCENTAGES OF REPERTORY ERRORS BY ALL GROUPS FOR 
Various Types or HowocnAPH MEANING 


i Grou; 

Repertory error type No. items COEM MOD ZA GRADU EMI TU combined 
"- 
Secondary-concrete 6 36 44 48 48 45 
Secondary-abstract 7 19 21 33 55 30 
Primary-abstract 5 13 21 28 47 26 
Primary-concrete 6 13 18 20 42 21 
Total errors 24 20 26 33 48 31 
Allsecondary 13 27 32 40 52 37 
All primary 11 13 19 24 45 23 
All concrete 12 25 31 34 45 33 
Allabstract 12 16 21 31 52 28 

with a primary-abstract meaning, and 6 having a RESULTS 


secondary-abstract meaning and a primary-concrete 
meaning. Three homographs each paired the other 
four possible combinations of primacy and con- 
creteness of meaning. The meanings classified as 
primary had average primacy scores of at least 2.25, 
and their overall average (mean of means) was 2.7. 
The secondary meanings were rated no higher than 
2.00 and their mean of means was 1,2. Thus, the 
primary meanings were very salient and the sec- 
ondary meanings distinctly more obscure, typically 
being the third meaning recalled. The meanings classi- 
fied as concrete had average concreteness ratings of at 
least 2.00, and their overall average was 2.7. The 
abstract meanings were rated 1.75 or less, and their 
overall average was .9. 

The repertory test was designed to test knowledge 
of homograph meanings. As in Willner’s (1965) 
format, each of the 24 homographs was followed by 
five nouns from which S had to select the one closest 
in meaning. A distractor alternative was included in 
17 items. Each homograph had two potential mean- 
ings included in the interpretation test, but only half 
were assessed in the repertory test. 

Scoring. There were 12 different types of error pos- 
sible in the interpretation test, depending on the 
type of meaning required for correct interpretation 
and the type of meaning that was incorrect. For 
example, Error Type 1 consisted of selecting a pri- 
mary-abstract meaning when the correct interpreta- 
tion was secondary-concrete, and Error Type 2 was 
the reverse of this. A misinterpretation ratio was 
computed for each error type by dividing the number 
of wrong items by the number of wrong and right 
items combined [W/(W -- R)]. An error score was 
thus expressed as a percentage of the items answered 
meaningfully (i.e. excluding the items where S chose 
the irrelevant alternative). Any S was dropped from 
the study if he chose the irrelevant meaning on more 
than half of the items for any of the 12 error types. 
Four schizophrenics were eliminated for this reason. 
This scoring procedure is different from the one used 
by Chapman et al. (1964), who subtracted the irrele- 
vant choices from the wrong ones to arrive at the 
error score. They did not exclude any S for choosing 
too many irrelevant meanings. 


Semantic Repertory 


The results of the repertory test are pre- 
sented in Table 1 and summarized by analysis 
of variance in Table 2. Through an over- 
sight in constructing the repertory test, the 
number of items was not equal for all four 
types of homograph, so the errors were com- 
puted as a percentage of the items of each type 
in the test (5, 6, or 7, respectively). Table 1 
gives the mean percentages for each group on 
each type. The analysis of variance in Table 2 
was based on the raw scores after adjusting 
them for differences in the number of items 
present. The significant main effect for diag- 
nosis shows that the groups differed in their 
total errors. The aphasics made more repertory 
errors than every other group, and the schizo- 


TABLE 2 


SuMMARY OF UNWEIGHTED-MEANS ANALYSIS OF 
VARIANCE OF REPERTORY TEST ERRORS 


Source of variation SS af MS F 
Between Ss 96 
Diagnosis (D) 11998,34 3 | 3999,45 | 8,22" 
Ss within groups 
(errorp) 45258.10 | 93 | 486.65 
Within Ss 291 
Primacy (P) 4456.14 1 | 4456.14 | 43,37 
xP 370.93 3| 123.64| 1.20 
P X Ss within groups 
(errorpp) 9556.17 | 93| 102.75 
Concreteness (C) 468.1 | 1| 468.11| 4.22** 
DXC 1005.09 3| 335.03] 3.02* 
C X Ss within groups 
(errorpc) 10323.94 | 93 | 111.01 
PXC 1705.82 1 | 1705.82 | 20,73*** 
DXPXC 732.68 3| 24423| 2,97* 
PC X Ss within 3 
groups (errorpc) 7654,06 | 93 82.30 
*p <.10. 
*»* $5 <.05. 
+p c 001. 
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TABLE 3 


TOTAL Error RATES ON THE INTERPRETATION TEST FOR ALL GROUPS 
AS A FUNCTION OF REPERTORY ADEQUACY 


Interpretation test error rate (%) 


Repertory item 


holics 
(220) tee 
Correct 
M 7.40 12.68 
SD 5.16 15.35 
Incorrect 
M 9.95 24.18 
SD 15.63 24.22 


phrenics more than the normals, but the alco- 
holics were not significantly different from 
either the normals or the schizophrenics. The 
significant main effect for primacy shows, as 
expected, that all groups knew more primary 
than secondary word meanings. The signifi- 
cant main effect for concreteness shows that 
more concrete than abstract repertory errors 
were made. This result is primarily due to the 
unusually high error rate for secondary-con- 
crete meanings (cf. the strong Primacy X 
Concreteness interaction), which is probably 
attributable to the presence of distractor al- 
ternatives with high “pull” for the secondary- 
concrete items, Data from an alternate form 
of the repertory test with different Ss showed 
the reverse order of secondary-concrete and 
secondary-abstract errors (a more logical re- 
sult), which suggests that the error rate for 
secondary-concrete meanings in the present 
experiment may have been inflated artifac- 
tually. 

None of the three interactions reflecting 
group comparisons of repertory error types 
reached statistical significance. From the in- 
significant Diagnosis X Primacy interaction 
the authors conclude that neither the schizo- 
phrenics nor the asphasics were relatively de- 
ficient in knowledge of secondary word mean- 
ings. Specifically, the discrepancy between 
Secondary and primary errors was no larger 
for the schizophrenics than the normals ¢= 
.58) nor the alcoholics (¢=.76) and was 
actually smaller for the aphasics than the 
normals (ż = —1.19) and the alcoholics t= 
—1.05). This result confirms Chapman’s ob- 
servation that schizophrenics and normals do 
not differ in their knowledge of secondary 


hasi G 
Paria tina à ha 1 pea combined 
20.73 23.36 16.45 
13.62 11.78 13.88 
27.61 29.91 23.45 
20.88 21.16 21.96 


meanings, rather than Willner's finding of 
the contrary. The result for the aphasics is 
consistent with Willner's finding for brain- 
damaged patients. 

The trend toward significant Diagnosis X 
Concreteness interaction is largely accounted 
for by the performance of the aphasics, whose 
discrepancy between abstract and concrete 
errors was larger than the normals’ (¢ = 2.08, 
P € .05") and the alcoholics’ (t = 2.34, p< 
.05) and slightly larger than the schizo- | 
phrenics’ (£= 1.45, p< .20). The schizo- 
phrenics showed a slight trend in the same 
direction when compared with the alcoholics 
(£ = 1.45, p < .20) but not with the normals 
(t = 1.09, ns). 


Relation of Repertory to Interpretation 
in Context 


It will be recalled that the repertory test 
included half of the meanings used in the in- 
terpretation test, so it was possible to ex- 
amine the role of repertory adequacy in 
determining errors of interpretation for those 
24 items. The results with reference to the 
total error rates on the interpretation test 
(Table 3) were analyzed by unweighted- 
means analysis of variance. There was 4 SIE 
nificant main effect for diagnosis (F = 6.37, 
$ < .001), showing that the groups differed in 
their overall error rates on the interpretation 
test, and a significant main effect for repertory 
adequacy (F = 7.63, p < .01), indicating that 
all groups combined made more errors of 
interpretation when the meaning required for 
correct interpretation was not known. The in- 


5 All significance levels were based on two-tailed 
tests, unless otherwise indicated. 
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teraction of Diagnosis X Repertory Adequacy 
did not approach significance (F = .54), in- 
dicating there were no differences among the 
groups in the proportion of interpretation 
errors due to unknown meanings. 

Table 4 presents the interpretation error 
- rates for all groups combined, broken down 
by the four varieties of homograph meaning 
required and by repertory adequacy. When 
the homograph meaning was correct, the over- 
all error rates for the four types of meaning 
did not differ significantly. But the results in 
the lower half of the table show clearly that 
inadequate vocabulary accounts for more 
errors of interpretation when the correct inter- 
pretation requires a secondary-abstract mean- 
ing than in any other type. The low sec- 
ondary-concrete error rate there corroborates 
the authors’ earlier impression that some 
secondary-concrete repertory errors were due 
to artifacts in the repertory test rather than 
true ignorance of the meaning. When the 
data were broken down further by diagnostic 
groups, some of the error frequencies were too 
small for meaningful analysis, but it was clear 
from examination of the data that there were 
no further differences among the groups. 


Semantic Interpretation in Context 


Table 5 summarizes the overall results for 
primacy bias, concreteness bias, and total 
errors of interpretation in context. The groups 
differed significantly in total errors, with the 
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TABLE 4 


PERCENTAGE OF INTERPRETATION TEsT ERRORS BY 
ALL Groups COMBINED rog EACH TYPE OF 
HOMOGRAPH MEANING REQUIRED AS A 
FUNCTION OF REPERTORY ADEQUACY 


Interpretation test error rate (%) 


Repertory Item | Type of meaning required 


Total 


alcoholics making more errors than the nor- 
mals (t = 2.25, p < .05), the schizophrenics 
more than the alcoholics (¢ = 2.39, p < .02), 
and the aphasics more than the schizophrenics 
(t = 1.74, p < .10). Analysis of variance of 
the primary and secondary errors showed a 
significant main effect for diagnosis (F = 
8.93, p < .001) but only slight evidence of a 
primacy effect (F = 1.66, p = .20) and no 
Primacy X Diagnosis interaction (F — .61, 
ns). Comparing specifically the primacy 
biases (primary-secondary errors), the schizo- 
phrenics slightly exceeded the normals (¢ = 
1.61, ? « .20) and the alcoholics (¢ = 1.24, 


TABLE 5 
Summary OF Error RATES ON THE INTERPRETATION TEST 


Type of Error 


Primary errors 

(Error Types 1, 3, 7, & 9) 
Secondary errors 

(Error Types 2, 4, 8, & 10) 
Primacy bias 

(Primary-secondary errors) 
Concrete errors 

(Error Types 2, 3, 6, & 12) 
Abstract errors 

(Error Types 1, 4, 5, & 11) 
Concreteness bias 


Mean interpretation error rate (%) 


wea) | m2» [ (220 | Qu 

| 18 | 1010 | 1374 21.81 26.63 
9.70 14.21 16.75 23.78 

0.40 —047 5.06" 2.85 

1345 19.18 2486 32.18 

5.50 10.64 1946 23.09 

7.95 8.54 540 9.09 

8.95 14.05 20.36 26.82 


(Concrete-abstract errors) 
Total errors 


macy bias of the schizophrenics showed a slight trend toward significant difference from the normals (p < .20) and 


* The 
the alcoholics ( < .20). 
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p< 20) but not the aphasics. Only if the 
data for the normals and alcoholics are com- 
bined, and the comparison is given the ad- 
vantage of a one-tailed test of significance, do 
the data support Chapman’s theory of greater 
primacy bias in schizophrenics (¢ = 1.69, df 
= 84, p < .05, one-tailed test). 

The analysis of variance of the concrete and 
abstract errors found a significant main effect 
for diagnosis (F = 11.67, p < .001) and for 
concreteness (F = 22.31, p< .001) but no 
Concreteness X Diagnosis interaction (F = 
.24, ns). None of the group comparisons of 
concreteness bias approached statistical sig- 
nificance. 

Examination of the results for Error Types 
1-4, which included half of the 48 test items, 
provided an additional test of the interaction 
Primacy Bias X Concreteness Bias. Again the 
authors found a significant main effect for 
diagnosis and for concreteness, and a signifi- 
cant main effect for primacy, which corrobo- 
rated for the first time Chapman's hypothesis 
of a general human inclination to make more 
primary than secondary errors. However, 
there was no interaction of primacy bias and 
concreteness bias for the groups combined, 
and none of the interactions involving diag- 
nosis approached significance. 


Correlations of Semantic Measures with 
Other Variables 


The correlations of the principal language 
measures with verbal intelligence, education, 
and length of hospitalization were analyzed 
for all groups but the aphasics. The most 
striking finding was that the language mea- 
sures of the schizophrenics and alcoholics cor- 
related higher with verbal intelligence than 
with education in almost every case, whereas 
the performance of the normals consistently 
correlated as highly with education as with 
verbal intelligence, especially on the mea- 
sures of interpretation in context. Thus, when 
the variance for education was partialed out, 
the correlation between total interpretational 
errors and verbal intelligence dropped from 
—.49 (p < .05) to —.12 (ns) for the normals, 
but remained virtually unchanged for the 
schizophrenics (—.75 to —.73) and alcoholics 
(—.63 to —.61), all significant at .01. The 
analysis for total repertory errors was simi- 
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lar, though less extreme. Therefore, it seems 
clear that intellectual training plays a more 
important role in determining these language 
functions for normals than for either of the 
patient groups. Considering that the measure 
of verbal intelligence at the same time assesses 
attentiveness or presence of mind, it could be 
inferred that semantic functioning in mental 
patients is dependent to a great extent on 
current attention to the task. If this is true, 
then it is remarkable that the phenomenon is 
as marked in the alcoholics as the schizo- 
phrenics, which would imply that it is not 
specific to schizophrenia but is general in 
chronic patients, perhaps reflecting the ef- 
fects of drugs, long hospitalization, or varia- 
bility in the motivation to comply with the re- 
quirements of the testing situation. 

The inference about hospitalization effects 
is challenged somewhat by the fact that only 
1 of the 28 correlations with length of hos- 
pitalization was significant. That correlation 
showed that schizophrenics who had been hos- 
pitalized longer had less primacy bias, mainly 
because they made more secondary errors of 
interpretation in context. This result seems to 
be incongruous with Chapman's theory, un- 
less it is assumed that the primacy bias di- 
minishes with increasing chronicity, which 
seems unlikely. 


SuMMARY AND DISCUSSION 


Groups combined. Al groups knew more 
primary than secondary word meanings; that 
is, meanings which come readily to mind are 
known by more people than less salient mean- 
ings. In short, “retrieval readiness" is posi- 
tively related to comprehension. The Ss in 
this experiment knew more abstract than 
concrete meanings, but it is believed that this 
was principally due to a few strong distrac- 
tors among the secondary-concrete items. (It 
would have served the purposes of this study 
better to have no distractors at all on the 
repertory test, but that would deviate from 
Willner's procedure.) Comparison of the pri- 
mary-concrete versus primary-abstract items; 
and results from a separate experiment, 
strongly indicate that if artifacts are absent, 
more concrete than abstract meanings are 
known. More errors of interpretation Were 
made when the homograph meaning was not 
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known than when it was known, especially 
when the correct interpretation was secondary 
and abstract. 

Regarding interpretation of homographs in 
context, there was a strong concreteness bias 
but less clear evidence of an overall primacy 
bias. Further analysis revealed an inverse rela- 
tion between strength of the primacy bias and 
degree of ambiguity in the sentence. (Sen- 
tence ambiguity, or general difficulty in cor- 
rectly interpreting the item, was rated in- 
tuitively by the second author; it usually 
boiled down to the number of misleading cues 
in the sentence.) There was a clear primacy 
bias (+ 4.596) in sentences with low ambig- 
uity, but an even stronger secondary bias 
(— 6.9%) in sentences with high ambiguity. 
It appears that there is a general human 
tendency to err by choosing a more common 
meaning when the interpretation is relatively 
uncomplicated by misleading cues, as if such 
errors were automatic (“by force of habit"). 
However, when the sentence is highly am- 
biguous, Ss err by choosing the less common 
meaning, as if they thought, “an interpreta- 
tion that requires reflection must imply an 
obscure meaning." This account of the pri- 
macy bias is also consistent with Chapman's 
finding a marked primacy bias on an instru- 
ment intentionally constructed with sentences 
of low ambiguity. In the present experiment 
there was no interaction between primacy 
bias and concreteness bias. 

Group differences. Mental patients made 
more total errors on the raw experimental 
measures than the normals, and the magni- 
tude of semantic "impairment" was related 
to severity of language disturbance clinically. 
Consistently the aphasics made the most 
errors, followed by the schizophrenics, the 
alcoholics and the normals, in that order. The 
same order obtained for percentage of irrele- 
vant alternatives chosen on the interpretation 
test, the rates being 6% (aphasics), 2% 
(schizophrenics), 1% (alcoholics), and .3% 
(normals). This implies that a general factor 
of inattention to the task (or inability to 
focus attention meaningfully) accounts for a 
substantial part of the errors in repertory and 
interpretation. The fact that the alcoholics 
differed from the normals on most of the 

, measures indicates that semantic functioning 
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is influenced by chronic hospitalization or 
some clinical aspect of alcoholism. 

With regard to the principal hypotheses, the 
aphasics knew fewer abstract meanings than 
the control groups, which confirms previous 
research and the general clinical impression 
that abstract vocabulary is most affected in 
aphasia. This repertory deficiency did not 
handicap them, especially on the interpreta- 
tion test. A deficiency of abstract meanings 
was not found for the schizophrenics, nor any 
evidence of a stronger bias toward concrete 
errors of interpretation in context among the 
mental patients. Thus, except for the loss 
of abstract meanings in the aphasics, the 
principle of linguistic regression found no 
support in this study. 

The schizophrenics showed a somewhat 
stronger primacy bias than the control groups 
in interpreting words in context. This sup- 
ports Chapman’s (1964) theory, but not very 
strongly. It should be noted that, although 
the homographs were selected by clearly more 
stringent criteria for primacy than Chap- 
man’s, the sentences spanned a much wider 
range of semantic ambiguity than his. The 
authors found that Chapman’s theory held for 
relatively unambiguous items, all groups show- 
ing a primacy bias and the schizophrenics 
more than the others. However, finding a 
secondary bias on highly ambiguous items and 
no group differences suggests that his theory 
may not hold for all types of disambiguation 
measures. 

The present study confirmed Chapman’s 
finding that schizophrenics do not “lose” sec- 
ondary meanings from their vocabulary. Will- 
ner’s (1965) contrary finding that schizo- 
phrenics knew fewer unusual (secondary) 
meanings may be due to his test being more 
difficult. Normals in the present study made 
27% errors on the secondary items, whereas 
Willner’s normals made 45-50% errors on his 
Unusual Meanings Vocabulary Test. Further, 
Willner * does not “really believe that the un- 
usual meanings of words are permanently 
‘Jost,’ but rather that schizophrenic patients 
have significantly more difficulty in retrieving 
these meanings.” This implies that schizo- 
phrenics might perform poorly on Willner’s 


6 A. Willner, personal communication, 1968. 
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test (in which correct answers require weaker 
mediating responses) for the same reason that 
they might show a strong primacy bias when 
interpreting words in context (overreliance on 
strongest mediating responses). Such an ex- 
planation would reconcile Willner's seemingly 
different results with those in this study and 
in Chapman’s, 
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